Over the past year and a few months, I’ve had a chance to lead a few different data science teams working on different kinds of hypotheses. The engineering process view that the so-called agile methodologies bring to data science teams is something that has been written about. However, one’s own experiences tend to be different, […]Read more "Lessons from Agile in Data Science"
Although the data science and big data buzzwords have been bandied about for years now, and although artificial intelligence has been talked about for decades, the two fields are irrevocably inter-related and interdependent. For one thing, the wide interest in data science started just as we were beginning to leverage distribute data storage and computation […]Read more "The Expert System Anachronism in the Data Science and AI Divergence"
Data scientists are new age explorers. Their field of exploration is rife with data from various sources. Their methods are mathematics, linear algebra, computational sciences, statistics and data visualisation. Their tools are programming languages, frameworks, libraries and statistical analysis tools. And their rewards are stepping stones, better understanding and insights. The data science process for […]Read more "Hypothesis Generation: A Key Data Science Challenge"
Thanks to a question on Quora, I’ve had the chance to explore the skewness of samples from symmetric distributions, prior to and after odd exponent transformations such as . While the answer is posted there, I’d like to explore related odd transformations here and their effect on the skewness. A simple experiment below reveals the […]Read more "Exploring Skewness for Odd-Exponent Transformations of Symmetric Distributions"
A decade ago, Microsoft looked very different from the Microsoft we see today – it has been a remarkable transformation. One of the areas where MS have made a big push is machine learning and data analytics. Although the CRAN repository is going strong with >10,000 packages as of today, the MRAN repository (Microsoft’s Managed […]Read more "Azure ML Studio and R"
I’m given to spurts of activity on Quora. Over the past year, I’ve had the opportunity to answer several questions there on the topics of data science, big data and data engineering. Some answers here are career-specific, while others are of a technical nature. Then there are interesting and nuanced questions that are always a […]Read more "Quora Data Science Answers Roundup"
Early in December 2016, I spoke at the Strata+Hadoop World 2016 Singapore conference on sensor data analysis approaches, specifically, time series analysis. My company, The Data Team, were represented at Strata+Hadoop World at the innovator’s pavilion. It was a wonderful learning experience for me at the conference, and I have the following key take-aways: There […]Read more "Video: Talk from Strata+Hadoop World 2016 Singapore"