My interest in data science started around 2015. I’d been working in the marketing department of a real estate company for a year, and discovered (or I should say, rediscovered), my love for numbers, analysis and telling stories. As long as I could remember, I loved math and science, but I also loved creating things too. Whether it was art by the way of drawings and paintings, writing and music, by way of the piano.
I attended Southern Tech, later known as Southern Polytechnic State University, now known as Kennesaw State University, after two years of messing around, I dropped out. Because learning is such a big part of my DNA, I enrolled shortly after in a computer programming class at Chattahoochee Tech. Despite doing well, I only stayed for one semester.
Learning Data Science
As data science became a buzzword, I dug in. Reading as many articles I could get my hands and eyes on. I soon realized that this field was something bigger than just ‘data analyst’ or ‘statistics’. I wanted to get my feet wet, so I took a few Coursera, Udemy, Udacity and EdX courses. I learned a little of the R programming language, a little python, some advanced Excel techniques, statistical analysis, regression, and data visualization.
The more I learned, the more I became confused about which direction I wanted to go. There were hundreds of data science-related courses with more coming out each week. At one point I was enrolled in five or six courses at one time. Some I’d finish, most I wouldn’t. I found my heart racing with excitement as I opened each new ‘You want to Learn Data Science Today?’ email announcement. I loved visualizing data and knew my way around charts, but I also like the analyzing part too. I pulled hair.
Did I Learn the Wrong Thing?
The nagging problem I had with the data science-related courses was not learning the material, it was how to attack data science. In other words, I wasn’t sure if I was on the ‘right’ or better path for me. My biggest fear was that I’d rack up 67 certifications as a data doodah and would come to find it utterly useless by next Christmas. I came across a data science boot camp at Georgia Tech last summer, but I wasn’t sure if it was a good idea to fork over a sizeable amount for the 24-week program as I had been unemployed for several months before getting my current gig. I liked the curriculum, as it seemed to cover all the bases. Then came David Venturi. This self-learning rock star saved my learning life. David Venturi wrote an article about how he dropped out of a computer science program and put together a customized data science program for himself.
This was like a breath of fresh air. This was the proverbial fork in the road that had the light shining on only one path: to create my own data science curriculum from the online learning platforms I’d grown to love and become addicted to. I came across an article written by Harrison Jansma that provided a general curriculum guide with a warning that ‘this is intended to be high-level, and not just a list of courses to take or books to read’:
- Python Programming
- Statistics & Linear Algebra
- A prerequisite for machine learning and data analysis. If you already have a solid
- Numpy, Pandas, & Matplotlib
- Machine Learning
- Production Systems
I was inspired to make my own program:
Linear algebra, including multivariate calculus. Linear Algebra for free at Khan Academy.
Regression, both linear and nonlinear models appropriately. You can learn about Linear Regression at Coursera.
Probability theory, including Bayes’ Law and Central Limit Theorem. You can learn about probability and data at Coursera.
Numerical analysis, including time series analysis and forecasting. You can learn about time series forecasting at Udacity.
Core machine learning methods, including clustering, decision trees, and k-NN. machine learning for free via Stanford University’s course on Coursera.
Python Data Analytics
Your Data Science Education is as Extensive as you Make It
Even after crafting my data science boot camp, I had no assumptions about what I could, should or would get from it. As far as I know, I would be just gaining some knowledge in an area that I was interested in but wouldn’t get hired in. Or…my no-degree having self could indeed get started somewhere in data science, doing something I like. Still, I understand that my program is basically boot camp style and if I wanted to go deeper, I would need to devote a lot more time into the learning.
Although my current job title doesn’t have data science in the title, it’s in a field (Search Engine Optimization) that is increasingly becoming more aligned with data science. SEOs must contend with a lot of data; and while we deal mostly with URLs, web user, and behavioral data and keywords, this data needs cleaning and it needs to be analyzed.
When a client wants to know why their website saw a decline in September of 2018, compared to September 2017, the SEO must be good at seeing patterns in the data, make analysis and tell all of this in a story that the client can understand and relate to. Most data science articles say that there are not enough qualified people in the field right now and future employment projections point to a future where job openings would still outnumber candidates.