Introduction
This lab demonstrates how to precompute the k nearest neighbors before using them in KNeighborsClassifier. KNeighborsClassifier can compute the nearest neighbors internally, but precomputing them can have several benefits, such as finer parameter control, caching for multiple use, or custom implementations. Here we use the caching property of pipelines to cache the nearest neighbors graph between multiple fits of KNeighborsClassifier.
VM Tips
After the VM startup is done, click the top left corner to switch to the Notebook tab to access Jupyter Notebook for practice.
Sometimes, you may need to wait a few seconds for Jupyter Notebook to finish loading. The validation of operations cannot be automated because of limitations in Jupyter Notebook.
If you face issues during learning, feel free to ask Labby. Provide feedback after the session, and we will promptly resolve the problem for you.
Skills Graph
%%%%{init: {'theme':'neutral'}}%%%%
flowchart RL
sklearn(("`Sklearn`")) -.-> sklearn/DataPreprocessingandFeatureEngineeringGroup(["`Data Preprocessing and Feature Engineering`"])
sklearn(("`Sklearn`")) -.-> sklearn/UtilitiesandDatasetsGroup(["`Utilities and Datasets`"])
sklearn(("`Sklearn`")) -.-> sklearn/ModelSelectionandEvaluationGroup(["`Model Selection and Evaluation`"])
sklearn(("`Sklearn`")) -.-> sklearn/CoreModelsandAlgorithmsGroup(["`Core Models and Algorithms`"])
ml(("`Machine Learning`")) -.-> ml/FrameworkandSoftwareGroup(["`Framework and Software`"])
sklearn/DataPreprocessingandFeatureEngineeringGroup -.-> sklearn/pipeline("`Pipeline`")
sklearn/UtilitiesandDatasetsGroup -.-> sklearn/datasets("`Datasets`")
sklearn/ModelSelectionandEvaluationGroup -.-> sklearn/model_selection("`Model Selection`")
sklearn/CoreModelsandAlgorithmsGroup -.-> sklearn/neighbors("`Nearest Neighbors`")
ml/FrameworkandSoftwareGroup -.-> ml/sklearn("`scikit-learn`")
subgraph Lab Skills
sklearn/pipeline -.-> lab-49072{{"`Caching Nearest Neighbors`"}}
sklearn/datasets -.-> lab-49072{{"`Caching Nearest Neighbors`"}}
sklearn/model_selection -.-> lab-49072{{"`Caching Nearest Neighbors`"}}
sklearn/neighbors -.-> lab-49072{{"`Caching Nearest Neighbors`"}}
ml/sklearn -.-> lab-49072{{"`Caching Nearest Neighbors`"}}
end