How to build a source distribution of a Python package

PythonPythonBeginner
Practice Now

Introduction

This tutorial will guide you through the process of building a source distribution for your Python package. Whether you're developing a reusable module or a full-fledged application, understanding how to package and distribute your Python code is a crucial skill. By the end of this tutorial, you'll be able to create a professional-grade Python package that can be easily shared and installed by others.


Skills Graph

%%%%{init: {'theme':'neutral'}}%%%% flowchart RL python(("`Python`")) -.-> python/ModulesandPackagesGroup(["`Modules and Packages`"]) python/ModulesandPackagesGroup -.-> python/importing_modules("`Importing Modules`") python/ModulesandPackagesGroup -.-> python/creating_modules("`Creating Modules`") python/ModulesandPackagesGroup -.-> python/using_packages("`Using Packages`") python/ModulesandPackagesGroup -.-> python/standard_libraries("`Common Standard Libraries`") subgraph Lab Skills python/importing_modules -.-> lab-397946{{"`How to build a source distribution of a Python package`"}} python/creating_modules -.-> lab-397946{{"`How to build a source distribution of a Python package`"}} python/using_packages -.-> lab-397946{{"`How to build a source distribution of a Python package`"}} python/standard_libraries -.-> lab-397946{{"`How to build a source distribution of a Python package`"}} end

Understanding Python Packages

What is a Python Package?

A Python package is a collection of Python modules organized in a hierarchical structure, which allows for better code organization, reusability, and distribution. Packages provide a way to group related modules together, making it easier to manage and maintain your code.

Modules vs. Packages

A Python module is a single Python file that contains definitions and statements. Modules can be imported and used in other Python scripts or programs. A package, on the other hand, is a directory that contains one or more Python modules, as well as an __init__.py file, which is used to define the package's structure and behavior.

Advantages of Using Packages

  1. Modularity: Packages allow you to organize your code into logical units, making it easier to manage and maintain.
  2. Reusability: Packages enable you to reuse code across different projects, promoting code reuse and efficiency.
  3. Namespace Management: Packages help manage namespace conflicts by providing a hierarchical structure for your modules.
  4. Distribution: Packages can be easily distributed and installed, making it simpler to share your code with others.

Anatomy of a Python Package

A Python package typically consists of the following elements:

  • __init__.py: This file is required for a directory to be recognized as a package. It can contain initialization code, package-level variables, and more.
  • module1.py: A Python module file that is part of the package.
  • module2.py: Another Python module file that is part of the package.
  • subpackage/: A subdirectory that contains additional modules or subpackages.
graph TD package[my_package] --> init[__init__.py] package --> module1[module1.py] package --> module2[module2.py] package --> subpackage[subpackage/] subpackage --> submodule[submodule.py]

Importing Packages and Modules

To use a package or module, you need to import it. Here's an example:

## Importing the entire package
import my_package

## Importing a specific module from the package
from my_package import module1

## Importing a specific function from a module
from my_package.module2 import my_function

By understanding the concept of Python packages, you can organize your code effectively and distribute your work as reusable components.

Packaging Your Python Module

Understanding the Setup.py File

The setup.py file is the heart of a Python package distribution. It contains metadata about your package, such as the name, version, author, description, and dependencies. Here's an example setup.py file:

from setuptools import setup, find_packages

setup(
    name='my_package',
    version='1.0.0',
    description='A sample Python package',
    author='LabEx',
    packages=find_packages(),
    install_requires=[
        'numpy>=1.19.2',
        'pandas>=1.1.3',
    ],
)

Creating a Source Distribution

To create a source distribution of your Python package, you can use the setuptools library. Follow these steps:

  1. Create the setup.py file in the root directory of your package.

  2. Open a terminal and navigate to the root directory of your package.

  3. Run the following command to create the source distribution:

    python setup.py sdist

    This will create a dist directory in your package's root directory, containing a .tar.gz file, which is the source distribution of your package.

Anatomy of a Source Distribution

A source distribution of a Python package typically contains the following files:

  • my_package/: The directory containing your package's modules and subpackages.
  • my_package.egg-info/: Metadata about your package, including the PKG-INFO file.
  • setup.py: The file used to build and distribute your package.
  • MANIFEST.in: An optional file that specifies additional files to include in the distribution.
graph TD dist[dist/] --> tarball[my_package-1.0.0.tar.gz] root[my_package/] --> init[__init__.py] root --> module1[module1.py] root --> module2[module2.py] root --> setup[setup.py] root --> manifest[MANIFEST.in] root --> egginfo[my_package.egg-info/] egginfo --> pkginfo[PKG-INFO]

By creating a source distribution of your Python package, you can easily share your code with others and make it available for installation.

Distributing Your Python Package

Uploading to PyPI

The Python Package Index (PyPI) is the official repository for Python packages. To distribute your package on PyPI, follow these steps:

  1. Create an account on the PyPI website.

  2. Install the twine package, which is used to upload your package to PyPI:

    pip install twine
  3. Run the following commands to upload your package:

    python setup.py sdist
    twine upload dist/*

    This will create the source distribution and then upload it to PyPI.

Installing Your Package

Once your package is available on PyPI, users can install it using the pip command:

pip install my_package

This will download and install the latest version of your package from PyPI.

Versioning and Semantic Versioning

When distributing your package, it's important to use semantic versioning to indicate the level of changes in your package. Semantic versioning follows the format MAJOR.MINOR.PATCH, where:

  • MAJOR version changes indicate incompatible API changes.
  • MINOR version changes indicate new features or functionality added in a backwards-compatible manner.
  • PATCH version changes indicate bug fixes or other minor changes.

This helps users understand the level of changes in your package and plan their upgrades accordingly.

Hosting Your Own Package Repository

If you don't want to distribute your package on PyPI, you can host your own package repository. There are several options for this, such as:

By hosting your own package repository, you can have more control over the distribution and versioning of your package.

By following these steps, you can successfully distribute your Python package to the wider community and make it available for installation.

Summary

In this comprehensive Python tutorial, you've learned the essential steps to build a source distribution for your Python package. From understanding the fundamentals of Python packaging to distributing your package, you now have the knowledge and tools to package and share your Python code with the world. By following these best practices, you can ensure that your Python package is easily accessible, installable, and maintainable by your users.

Other Python Tutorials you may like