This document summarizes a presentation about machine learning on Hadoop data lakes. It introduces the two speakers: Michal Iwanowski, Product Director at DeepSense.io, and Piotr Niedzwiedz, CTO at DeepSense.io. It then discusses challenges with machine learning algorithms and technologies for big data, including techniques like one-hot encoding, hashing, and online learning. Finally, it proposes a model benchmarking tool and DS Studio architecture to address limitations of existing tools for flexible data transformation and full big data support.