SQL is not enough
As Data Scientists, we always have to roll up our sleeves to perform data processing/engineering jobs in data science pipelines.
It's widely accepted that SQL is core skill for performing ETL. However, I feel Python is equally important for its flexibility and thriving community, it's like Swiss Army Knife in your pocket!!
Every data scientist using Python would easily know how to apply Pandas, Numpy, Matplotlib, Seaborn, Plotly, Sklearn and other Stats modules on tabular data.
However, as Data Science field is converging with Software Engineering, I would add 10 more modules/tricks in the list to write production ready code.
At times few things are very easy to do in SQL however I personally feel most of the times Python wins, it has extensive library of modules to perform complex data manipulations.
Share your favorite python modules to help others! Thank you!
#Python #Learning #DataScience #DataEngineering