Author: Vincent Granville
Source for picture: here
The first part of this list was published here. These are articles that I wrote in the last few years. The whole series will feature articles related to the following aspects of machine learning:
 Mathematics, simulations, benchmarking algorithms based on synthetic data (in short, experimental data science)
 Opinions, for instance about the value of a PhD in our field, or the use of some techniques
 Methods, principles, rules of thumb, recipes, tricks
 Business analytics
 Core Techniques
My articles are always written in simple English and accessible to professionals with typically one year of calculus or statistical training, at the undergraduate level. They are geared towards people who use data but are interesting in gaining more practical analytical experience. Managers and decision makers are part of my intended audience. The style is compact, geared towards people who do not have a lot of free time.
Despite these restrictions, stateoftheart, ofthebeatenpath results as well as machine learning trade secrets and research material are frequently shared. References to more advanced literature (from myself and other authors) is provided for those who want to dig deeper in the interested topics discussed.
1. Machine Learning Tricks, Recipes and Statistical Models
These articles focus on techniques that have wide applications or that are otherwise fundamental or seminal in nature.
 One Trillion Random Digits
 New Perspective on the Central Limit Theorem and Statistical Testing
 Simple Solution to Feature Selection Problems
 ScaleInvariant Clustering and Regression
 Deep Dive into Polynomial Regression and Overfitting
 Stochastic Processes and New Tests of Randomness – Application to Cool Number Theory Problem
 A Simple Introduction to Complex Stochastic Processes – Part 2
 A Simple Introduction to Complex Stochastic Processes
 High Precision Computing: Benchmark, Examples, and Tutorial
 Logistic Map, Chaos, Randomness and Quantum Algorithms
 Graph Theory: Six Degrees of Separation Problem
 Interesting Problem for Serious Geeks: Selfcorrecting Random Walks
 9 Offthebeatenpath Statistical Science Topics with Interesting Applications
 Data Science Method to Discover Large Prime Numbers
 Nice Generalization of the KNN Clustering Algorithm – Also Useful for Data Reduction
 How to Detect if Numbers are Random or Not
 How and Why: Decorrelate Time Series
 Distribution of Arrival Times of Extreme Events
 Why Zipf’s law explains so many big data and physics phenomenons
2. Free books

Statistics: New Foundations, Toolbox, and Machine Learning Recipes
Available here. In about 300 pages and 28 chapters it covers many new topics, offering a fresh perspective on the subject, including rules of thumb and recipes that are easy to automate or integrate in blackbox systems, as well as new modelfree, datadriven foundations to statistical science and predictive analytics. The approach focuses on robust techniques; it is bottomup (from applications to theory), in contrast to the traditional topdown approach.
The material is accessible to practitioners with a oneyear collegelevel exposure to statistics and probability. The compact and tutorial style, featuring many applications with numerous illustrations, is aimed at practitioners, researchers, and executives in various quantitative fields.

Applied Stochastic Processes
Available here. Full title: Applied Stochastic Processes, Chaos Modeling, and Probabilistic Properties of Numeration Systems (104 pages, 16 chapters.) This book is intended for professionals in data science, computer science, operations research, statistics, machine learning, big data, and mathematics. In 100 pages, it covers many new topics, offering a fresh perspective on the subject.
It is accessible to practitioners with a twoyear collegelevel exposure to statistics and probability. The compact and tutorial style, featuring many applications (Blockchain, quantum algorithms, HPC, random number generation, cryptography, Fintech, web crawling, statistical testing) with numerous illustrations, is aimed at practitioners, researchers and executives in various quantitative fields.
To receive a weekly digest of our new articles, subscribe to our newsletter, here.
About the author: Vincent Granville is a data science pioneer, mathematician, book author (Wiley), patent owner, former postdoc at Cambridge University, former VCfunded executive, with 20+ years of corporate experience including CNET, NBC, Visa, Wells Fargo, Microsoft, eBay. Vincent is also selfpublisher at DataShaping.com, and founded and cofounded a few startups, including one with a successful exit (Data Science Central acquired by Tech Target). He recently opened Paris Restaurant, in Anacortes. You can access Vincent’s articles and books, here.