author: Parul Pandey id: course-1 summary: Master the essentials of Pandas and NumPy. categories: python status: Published home_url: https://parulnith.github.io/courses/
Welcome! In this module, we cover data manipulation basics - tools every Data Scientist needs. We will focus on the “Holy Trinity” of Python Data Science:
Duration: 5:00
NumPy is the backbone of almost all data libraries in Python. It allows for fast operations on arrays.
import numpy as np
# Create a 1D array
arr = np.array([1, 2, 3, 4, 5])
# Perform vector operations
print(arr * 2)
Duration: 10:00
Pandas introduces the DataFrame, a powerful tool for handling tabular data.
# Loading a Dataset
import pandas as pd
# Creating a simple dataframe
data = {
'Name': ['Alice', 'Bob', 'Charlie'],
'Age': [25, 30, 35],
'City': ['New York', 'London', 'Paris']
}
df = pd.DataFrame(data)
print(df)
Duration: 8:00 Before using advanced libraries, it’s important to understand matplotlib.
import matplotlib.pyplot as plt
plt.plot([1, 2, 3, 4], [10, 20, 25, 30])
plt.title("Simple Growth Chart")
plt.show()
Duration: 10:00
To get hands-on experience, we have prepared an interactive Marimo notebook. This allows you to run code directly in your browser without installing anything.
You’ve successfully:
Learned how to use NumPy arrays.
Created your first Pandas DataFrame.
Visualized basic data points.
Next Step: Head over to Course 2: Visual Storytelling to learn how to make these charts look professional.