Data Science Tutorials
Data Science provides a comprehensive curriculum for aspiring data scientists and analysts. Our tutorials cover statistical analysis, machine learning, and data visualization, suitable for both beginners and intermediate learners. Through interactive labs and hands - on coding exercises, you'll gain practical experience with real - world datasets. Our data science playground allows you to apply your skills in a dynamic online environment.
Other Skill Trees
LinuxDevOpsCybersecurityKali LinuxDevOps EngineerCybersecurity EngineerDatabaseRed Hat Enterprise LinuxCompTIADockerCKS TrainingCKAD TrainingCKA TrainingKubernetesDevSecOpsLFCS TrainingPythonGitShellNmapWiresharkHydraJavaSQLitePostgreSQLMySQLRedisMongoDBGolangC++CJenkinsAnsiblePandasNumPyscikit-learnMatplotlibWeb DevelopmentHTMLCSSJavaScriptReact
NumPy Broadcasting
Broadcasting is a powerful feature in NumPy that allows arrays with different shapes to be used in arithmetic operations. It provides a way to vectorize array operations and improve computational efficiency. This lab will guide you through the basics of broadcasting in NumPy.
NumPyPython
Pandas Introduction and Setup
In this lab, you will get started with Pandas, a powerful data analysis library in Python. You will learn how to verify its installation, import it, create a basic Series, access its elements, and inspect its properties.
Pandas
NumPy Copies and Views
In this lab, you will learn the basics of working with NumPy arrays. NumPy is a powerful library for numerical computing in Python. It provides efficient data structures and functions for performing mathematical operations on arrays.
NumPyPython
Pandas Sorting Data
In this lab, you will learn the essential techniques for sorting data in a Pandas DataFrame. You'll explore sorting by single and multiple columns, controlling the sort order, and managing the DataFrame's index after sorting operations.
Pandas
Pandas Grouping and Aggregating
In this lab, you will learn the fundamentals of data grouping and aggregation using the Pandas library. You'll practice using groupby() to create groups and apply various aggregation functions.
Pandas
Pandas Basic Data Cleaning
In this lab, you will learn the fundamental techniques for cleaning data using the Pandas library, including handling missing values, removing duplicates, and correcting data types.
Pandas
Pandas Reading External Data
In this lab, you will learn the fundamentals of reading external data into a Pandas DataFrame. You will use the powerful `read_csv` function and its key parameters to handle various real-world CSV file formats.
Pandas
Scikit-learn Installation and Setup
In this lab, you will learn how to verify your scikit-learn installation, import necessary modules, and load a sample dataset to get started with machine learning in Python.
scikit-learn
Scikit-learn Data Preprocessing
In this lab, you will learn the fundamental data preprocessing techniques in scikit-learn, including feature scaling with StandardScaler and target encoding with LabelEncoder, using the classic Iris dataset.
scikit-learn
Pandas Creating DataFrames
In this lab, you will learn the fundamental ways to create Pandas DataFrames, including from dictionaries, and how to customize their columns and indexes.
Pandas
Pandas Descriptive Statistics
In this lab, you will learn how to compute various descriptive statistics for a Pandas DataFrame, including mean, median, min/max, and more.
Pandas
NumPy Universal Functions
In this lab, we will explore the basics of NumPy Universal Functions (ufuncs). Ufuncs are functions that operate on ndarrays in an element-by-element fashion, supporting array broadcasting, type casting, and other standard features. We will learn about the different methods of ufuncs, broadcasting rules, type casting rules, and how to override ufunc behavior.
NumPyPython
NumPy Structured Arrays
In this lab, we will learn about structured arrays in NumPy. Structured arrays are ndarrays whose datatype is a composition of simpler datatypes organized as a sequence of named fields. They are useful for working with structured data, such as tabular data, where each field represents a different attribute of the data.
NumPyPython
Pandas Selecting Data
In this lab, you will learn the fundamental techniques for selecting and subsetting data from Pandas DataFrames, including selecting columns, rows, and specific slices of data.
Pandas
Matplotlib Subplots Creation
In this lab, you will learn how to create and customize multiple subplots in a single figure using Matplotlib, a powerful plotting library in Python. You will practice creating subplots, plotting data on them, and adjusting layouts.
Matplotlib
Scikit-learn Data Loading and Exploration
In this lab, you will learn the fundamentals of loading and exploring datasets in scikit-learn using the classic Iris dataset. You will practice accessing data, targets, and feature names, and perform a simple visualization.
scikit-learn
Scikit-learn KNN Classification
In this lab, you will learn how to use scikit-learn to build a K-Nearest Neighbors (KNN) classifier, train it on the Iris dataset, and make predictions on new data.
scikit-learn
Scikit-learn Cross-Validation
In this lab, you will learn how to perform cross-validation using scikit-learn to evaluate the performance of a machine learning model more robustly.
scikit-learn
- Prev
- 1
- 2
- 3
- 4
- 5
- 6
- ...
- 165
- Next