Introduction
In this lab, we will learn how to use the Pandas DataFrame get() method in Python. The get() method is used to retrieve an item from the DataFrame based on a specified key (e.g., a column name). If the key is not found, it returns a default value.
VM Tips
After the VM startup is done, click the top left corner to switch to the Notebook tab to access Jupyter Notebook for practice.
Sometimes, you may need to wait a few seconds for Jupyter Notebook to finish loading. The validation of operations cannot be automated because of limitations in Jupyter Notebook.
If you face issues during learning, feel free to ask Labby. Provide feedback after the session, and we will promptly resolve the problem for you.
Import the necessary libraries
To start, we need to import the pandas library to use the DataFrame object.
import pandas as pd
Create a DataFrame
Let's create a simple DataFrame to work with. We will use the pd.DataFrame() function to create the DataFrame.
df = pd.DataFrame({"Name": ["Navya", "Vindya"], "Age": [25, 24], "Education": ["M.Tech", "Ph.d"], "YOP": [2019, None]})
Get a Single Column
To retrieve a single column from the DataFrame, use the column name as the key in the get() method.
age_column = df.get("Age")
Get Multiple Columns
To retrieve multiple columns from the DataFrame, specify the column names as a list in the get() method.
columns = df.get(["Name", "Education"])
Specify a Default Value
If the specified key is not found in the DataFrame, you can specify a default value to be returned using the default parameter in the get() method.
skills_column = df.get("Skills", default="Key not found")
Summary
In this lab, we learned how to use the get() method on a Pandas DataFrame in Python. We saw how to retrieve single and multiple columns, and also how to specify a default value in case the key is not found. The get() method is a useful tool for working with DataFrames in pandas.