After my previous journal, I spent a few days debating on whether I should take statistics or precalculus first. As you may have guessed from the title, I chose statistics, which I finally began on Saturday.
I’ve also completed my analytics classes. We concluded the course with integer optimization, which I’ll speak of briefly below.
The last thing I’ll discuss today is some potential online courses related to data science that I found, and the unlikely place I discovered them.
Statistics
I took statistics in college and really enjoyed it. However, I have been bitter about that specific class since taking it. I was never one of those kids who freaked out if they didn’t get an A in a class. Those kids used to annoy me. That is, until college. Then I became one of those kids! Getting a B was awful. An A- made me insane because a minus affected our GPA at my college.
Back to that statistics class. I received an A on all of my quizzes and tests. You might even say I was an exemplary student, if you so desire. Except for my attendance. I was a couple minutes late to class on no more than 5 occasions and I missed a couple lectures during the semester. My fault entirely, but I never missed an assignment.
My professor, in all her wisdom, docked points off your grade if you were ever tardy or absent from the class. Quite a few points, actually. In fact, she lowered my letter grade for the course from an A to a B solely due to my attendance.
I get that I had no excuse for my tardiness, but I have to throw down the bullshit flag here. You shouldn’t ever be late for a job or for grade school / high school classes, but this is college. I paid for the class. If someone doesn’t want to attend a class, that’s their prerogative. Attendance alone shouldn’t impact your grade. Obviously if you miss assignments or tests because of it, then of course it should matter. But I didn’t and I don’t feel that I deserve that B.
My GPA no longer has any impact whatsoever on my life, and mine was pretty good, but it still bothers me even today.
Anyway, I began statistics on Khan Academy several days ago. I’ve only completed the first lesson thus far, but it was an exceptionally long lesson.
When I say lesson, I’m referring to all of the lectures and quizzes within a section. For example, exponential & logarithmic functions in algebra II, or the unit circle definition of sine, cosine, and tangent in trigonometry. Each lesson has multiple lectures covering all the topics within that heading.
The first lesson of statistics is displaying and describing data. Fun stuff!
It actually is kind of fun. We only covered the basics of statistics so far: mean, median, mode, range, standard deviation, variance, interquartile range, different types of plots, etc.
It’s funny that I’ve spent two months running statistical analyses in The Analytics Edge and I didn’t know how to read a box plot. I had completely forgotten what it meant after all these years.
This refresher was nice before we delve into more advanced topics. I’m also happy with my decision to take statistics first. I think it will help me in my upcoming analytics and data science classes.
I will say, however, that one of the first quizzes in the class asked a lot of questions about topics we hadn’t covered up until that point. I am incapable of ignoring any of the quizzes, so I continued working on it until I got the required 5 questions in a row correct. I feel that this quiz should have been toward the end of the lesson, rather than the beginning.
Other than that, the class has been great and Sal Khan does a fantastic job teaching all of the math topics. I couldn’t be happier that Khan Academy exists and I can’t imagine taking math anywhere else.
Integer Optimization
We covered integer optimization during the last week of The Analytics Edge on edX. It’s similar to linear optimization, which we studied last week, except it only uses integers (whole numbers).
I’m still using OpenOffice, but it can be completed with Microsoft Excel or LibreOffice. I have Excel, but it was easier to follow along in the lectures with one of the other two programs.
I’m becoming more comfortable with optimization. I still need to work on it, but it’s making more sense to me. My mind puts up a wall when I try to use spreadsheets, which I’m slowly breaking down. I’m still not sure why I dislike them so much. They’re useful and not all that difficult to utilize.
In the lectures, we learned how integer optimization is used to quickly and efficiently schedule sport teams and hospital operation rooms, as well as increasing probability matches on eHarmony.
For operation rooms, you need to know how many rooms each department needs, how often they need them, the minimum and maximum number, etc. You write these constraints, along with other data and objective, into the spreadsheet. It then calculates how to divide up the available operation rooms each week.
It’s actually pretty cool and simplifies complex problems that used to take weeks or months by hand (in the event of sports team schedules).
Now that analytics is over, I’ll begin organizing all of my notes for the course so I can use all of the techniques I learned in the future. I only hope my future classes are as good as this one and the ones on Khan Academy.
Options for Future Study
I’ve been browsing through edX and Coursera for a new class to take to replace The Analytics Edge.
I realized during my search one of the reasons why I’m not as fond of Coursera and the reason why they don’t allow you to access quizzes and assignments without paying for the class. At least for the classes I tried, anyhow.
Coursera is a for-profit organization, while edX is nonprofit. This isn’t an issue in and of itself. There are many institutions and companies that are designed for profit, most in fact. In the educational world, however, it’s my opinion that this can cause a conflict of interest.
The educational entity is more interested in making money than they are with providing quality instruction. I read a study that shows on average (yay statistics), nonprofit institutions charge less for tuition and they spend five times more per student than for-profit schools. That’s huge!
Not having access to assignments or quizzes greatly diminishes the level of learning. edX doesn’t require you to pay unless you want a certificate. Otherwise, all of the content is freely available. That’s why I’m leery to return to Coursera, but I have found a few classes that seem promising.
It seems that every time I find a class that sounds interesting, I read horrible reviews. I don’t base my life on reviews, but if a staggering number of students rate a class poorly, it gives me cause to question my choice to take it.
Since that continued to occur, and there are so many options in the field of data science, I decided to google it to see if there are any forums or discussions on good classes.
This led me to Reddit of all places. I’m not really into Reddit and I rarely go there, but I was surprised how great of a resource it became. People gave their insight into many of the classes I was considering, and some I had no knowledge of.
Current Candidates:
Conclusion
I’m sorry for writing so much today. I couldn’t stop! Hopefully there was something of interest to you here. Comment below if you know of any other classes or if you’ve had experience with one of the ones above. Thank you for following me and I’ll see you next week!
Author and hobby digital artist. Barry aspires to become a data scientist and better himself as a person.