Value is in the eye of the beholder…
I probably receive anywhere from 20 to 40 messages every week or so from individuals with questions about data literacy, specific data packages, or data visualization with Tableau. There are a lot of questions that I hear that I never thought to ask. I never asked for permission. Many people ask should I learn Python or R? What is the best book to read to learn Tableau? These are personal and specific to individual interests and deserving of exploration. Mainly because like anything, your mileage may vary.
And then, there is the tired question about what is the difference between a data analyst, data scientist, data engineer, statistician, blah, blah, blah…I will leave you to explore this well worn debate elsewhere. There also is a bit of a territorial debate where member of these distinct groups poke fun at the other less worthy participants.
Here is what I learned. The brightest and best want to bring you along. They encourage, share resources, and are welcoming.
A recent podcast featuring Xiao-Li Meng, the Whipple V. N. Jones Professor of Statistics, and the Founding Editor-in-Chief of Harvard Data Science Review stopped me in my tracks. A year ago I was reading through his work and wrote a few blogs about his ideas. Curious about a pending part two to the discussion, Statistical Paradises and Paradoxes in Big Data(I): Law of Large Populations, Big Data Paradox, and the 2016 US Presidential Election, I sent him an email.
He promptly responded. Apologized about the delay in the Part II, sent me a copy of the article above and several others he thought might be of interest (along with access instructions). Meet him in the podcast below and learn about his role as Editor-In-Chief of The Harvard Data Science Review.
<a href="https://medium.com/media/8b412af03b6fcce48673d32e43212fce/href">https://medium.com/media/8b412af03b6fcce48673d32e43212fce/href</a>
The Mission and Scope of the HDSR is captured by this quote below:
As an open access platform of the Harvard Data Science Initiative, Harvard Data Science Review (HDSR) features foundational thinking, research milestones, educational innovations, and major applications. It aims to publish contents that help to define and shape data science as a scientifically rigorous and globally impactful multidisciplinary field based on the principled and purposed production, processing, parsing and analysis of data.
Read the tag-line and I dare you not to be excited if you are even slightly data adjacent,
A microscopic, telescopic, and kaleidoscopic view of data science
Here is one of the articles referenced in the journal section brilliantly labeled as CORNUCOPIA: impact, innovation, and knowledge transfer.
(A)Data in the Life: Authorship Attribution in Lennon-McCartney Songs
Apologies for the interactive experience that will likely derail your tasks at hand as you explore the features below. If the link brings you to 3 vizzes, pick number 3 to activate the visualization below.
I have been taught that items should be bundled in threes so here is one more. I recently read a great article in Medium, Anaconda is bloated — Set up a lean, robust data science environment with Miniconda and Conda-forge by Ted Petrou. Here is how this was a lifesaver. I work with large datasets in healthcare and medicine. The last thing I needed was a clunky giant installation of valuable but likely not needed programs on my MacBook. Thanks to Ted and his willingness to walk me through a few hiccups I am up and running. If you are a Python person and value just-in-time instruction I suggest you check out his workshops (a few free ones are there for a test drive) over on DUNDER DATA.
Although I completed an executive online course from Columbia School of Engineering in Applied Analytics — once I returned to the real world I wanted expertise at a granular level in the packages and libraries that I actually use daily. Ted is so clear and patient and provides the detailed explanations of why things work a certain way and how to customize your experience specific for your data needs.
<a href="https://medium.com/media/f43d95193ef5093e6f3e69700766b359/href">https://medium.com/media/f43d95193ef5093e6f3e69700766b359/href</a>
Follow along at data & donuts for more discussions on data literacy, analytics, and telling stories from large datasets…I am sharing information from the university and technical community college courses I teach as well as workshops, presentations, and areas of interest (data adjacent). Best way to get in touch is still through LinkedIn or twitter.
Bonny P McClain (data literacy and analytics)