DataScience

Python Data Science: Analysis, Wrangling, and Visualization

23 tutorials beginner / intermediate

Data science in Python revolves around a powerful stack: pandas for data manipulation, NumPy for numerical computing, Matplotlib and Seaborn for visualization, and scikit-learn for modeling. Whether you are cleaning messy CSVs, joining datasets, computing statistics, or building dashboards, these tools form the foundation.

This collection covers the full data science workflow from loading and wrangling data through analysis, visualization, and integration with databases and modern tools like Polars and PySpark.

Tutorials marked with the cert badge include a final exam that awards a certificate of completion you can download and share.

01 DataFrames and Core Tools 6 tutorials

Python for Data Science

Overview of the Python data science ecosystem and the role each library plays.

Beginner read()

DataFrames in Python

Understanding DataFrame structure, creation, indexing, and basic operations.

Beginner read()

Working with pandas DataFrames

Practical pandas operations: filtering, grouping, aggregation, and transformation.

Intermediate read()

Joining Data Structures with pandas

Merge, join, concat, and combining DataFrames from multiple sources.

Intermediate read()

NumPy Multidimensional Arrays and Matrices

NumPy array creation, operations, broadcasting, and linear algebra fundamentals.

Intermediate read()

Python Array Computation Libraries

Comparing NumPy, CuPy, JAX, and other array computation options.

Intermediate read()

02 Data Wrangling and Processing 4 tutorials

Data Wrangling with Python

Cleaning, reshaping, and preparing real-world data for analysis.

Intermediate read()

Data Normalization in Python

Min-Max scaling, Z-score standardization, robust scaling, and scikit-learn pipelines.

Intermediate read()

How to Parse CSV in Python

Reading and writing CSVs with the csv module, pandas, and handling edge cases.

Beginner read()

Understanding Pipelines in Python

Building data processing pipelines for reproducible, maintainable analysis workflows.

Intermediate read()

03 Visualization and Statistics 5 tutorials

Seaborn in Python

Statistical visualization with Seaborn: distributions, relationships, categories, and custom styling.

Intermediate read()

Python statsmodels

Statistical modeling, hypothesis testing, regression analysis, and time series with statsmodels.

Intermediate read()

Analyzing Financial Data with Python

Working with financial datasets: time series, returns, moving averages, and risk metrics.

Intermediate read()

Python Financial Data Smoothing

Smoothing techniques for noisy financial data: moving averages, exponential smoothing, and filters.

Intermediate read()

Using Python in Power BI: The Complete, No-Nonsense Guide

All three integration modes (data source, Power Query transformation, visual), the PythonScriptWrapper mechanics, Service runtime constraints, the May 2026 deprecation, Microsoft Fabric Semantic Link, and four real-world use case walkthroughs. Approximately 2.5 hours. Certificate of completion available.

Intermediate cert read()

04 Databases and Modern Tools 8 tutorials

Python Data Science: Analysis, Wrangling, and Visualization

Python for Data Science

DataFrames in Python

Working with pandas DataFrames

Joining Data Structures with pandas

NumPy Multidimensional Arrays and Matrices

Python Array Computation Libraries

Data Wrangling with Python

Data Normalization in Python

How to Parse CSV in Python

Understanding Pipelines in Python

Seaborn in Python

Python statsmodels

Analyzing Financial Data with Python

Python Financial Data Smoothing

Using Python in Power BI: The Complete, No-Nonsense Guide

SQL with Python

cursor.execute() in Python Database Programming

Python oracledb Guide

Understanding Polars in Python

PyArrow: Columnar Engine for Python Data

PySpark Window Functions

Partition Columns in Python

Python vs R Programming