Presentation: Predictability In ML Applications



10:35am - 11:25am

Day of week:




In the context of building predictive models, predictability is usually considered a blessing. After all – that is the goal: build the model that has the highest predictive performance. The rise of ‘big data’ has in fact vastly improved our ability to predict human behavior thanks to the introduction of much more informative features. However, in practice things are more differentiated than that. For many applications, the relevant outcome is observed for possibly very different reasons. In such mixed scenarios, the model will automatically gravitate to the one, that is easiest to predict at the expense of the others. This even holds if the predictable scenario is by far less common or relevant. We present a number of such scenarios: clicks on ads being performed ‘intentionally’ vs. ‘accidentally’, online forms being filled out by people or fraudulent bots, and finally consumers visiting store locations vs. their phones pretending to be there. The combination of different and highly informative features can have significantly negative overall impact on the usefulness of predictive modeling.

Speaker: Claudia Perlich

Chief Scientist at Dstillery

Claudia Perlich currently acts as chief scientist at Dstillery (previously m6d) and in this role designs, develops, analyzes, and optimizes the machine learning that drives digital advertising. She has published more than 50 scientific article and holds multiple patents in machine learning. She has won many data mining competitions and best paper awards at KDD and is acting as General Chair for KDD 2014. Before joining m6d in February 2010, Perlich worked in the Predictive Modeling Group at IBM’s T. J. Watson Research Center, concentrating on data analytics and machine learning for complex real-world domains and applications. She holds a PhD in information systems from NYU and an MA in computer science from Colorado University and teaches in the Stern MBA program at NYU. @claudia_perlich

Find Claudia Perlich at

Similar Talks

Research Engineer @FastForwardLabs, Keras Contributor
Director of Research @FastForwardLabs
Co-Founder and CEO of Julia Computing and a Co-Creator of the Julia language
Quantitative Software Developer @JuliaComputing
Deep Learning Engineer @Teza (ex-Amazon, ex-NVidia)
Research Engineer @FastForwardLabs, Keras Contributor
Co-founder and Machine Learning Engineer @Ravelin


Conference for Professional Software Developers