Analytics Camp 2012 Data Science
Melinda Thielbar discussing data science in the 1pm session.
Talks about using a distribution curve of % of customers versus $ Sales as an example.
Most underestimate mean vs median.
What does a data scientist do? Each part of being a data scientist actually is a scientific discipline in itself.
Many get caught up with having a single number that describes something, but it is the complexity of the data within the curve that is the important part.
Many times filtering of complex data will exclude information that may provide trends.
You can have success in analyzing data that others are excluding.
Lots of discussion on interactions of data scientists and other scientists with other disciplines as well as with each other. Can people from different worlds interact with respect? Can they communicate?
Data scientists understand randomness and understand it in all its forms and where it comes from.
A better data set is better than a bigger dataset. Discussion of experimental design. Different algorithms and tools can be used to reduce the number of data points. There is a lot of stuff in control theory.