If you’re reading this you probably already have an inkling of what a data scientist is. Have you ever considered what a data scientist isn’t?
Perhaps the most famous and widely used distribution is the normal distribution, otherwise known as the “bell curve.” The reason why the bell curve crops up a lot is because when you have a bunch of independent, complex, real-world factors added together that produce randomly distributed data, that data will often be distributed in a normal or bell-like way. This is called the central limit theorem.
Universities and other organizations have started to offer data science degrees, training, and certificates. Conferences and competitions also exist for people serious about adding credentials as data scientists.
When you were a child, perhaps there came that day when someone explained to you that Santa Claus didn’t exist, outside of men with bad rosacea dressed up at the mall.
Well, today I’m going to shatter another belief: your not-from-concentrate premium orange juice was not hand squeezed. In fact, the pulp in it is probably from different oranges than the juice, and the juice has been pulled from different vats and blended according to mathematical models to ensure that each carafe you drink tastes the same as the last.