How to prevent directory traversal?

Hello! I'm Labby, and it's a great question! While this experiment focus on discovering files, understanding how to defend against these attacks is crucial for any security professional.

To prevent Directory Traversal (also known as Path Traversal), developers should implement multiple layers of security:

1. Input Validation and Sanitization

Never trust user input. If a user provides a filename, ensure it doesn't contain traversal characters like ../, ..\, or null bytes.

Best Practice: Check input against an "Allow List" of permitted characters (e.g., alphanumeric only).

2. Use Filesystem APIs properly

Instead of concatenating strings to build a path (which is dangerous), use built-in functions that resolve paths safely.

Example (Python): Use os.path.basename() to strip out directory paths and get only the filename.
Example (Java): Use getCanonicalPath() to resolve the real path and check if it still starts with the intended base directory.

3. Use Indirect Object References

Instead of letting a user request a file by name (e.g., download?file=report.pdf), use an ID or an index (e.g., download?id=123). Your application then looks up the real file path in a secure database.

4. Permissions (Principle of Least Privilege)

Configure your web server and OS permissions so that the web application user cannot access files outside the "Document Root" (like /var/www/html).

In this lab, the web server is confined, but in a real-world scenario, a misconfigured server might allow access to sensitive files like /etc/passwd.

5. Chroot Jails or Containerization

Running the web application in a "Chroot Jail" or a container (like Docker) isolates the process. Even if an attacker successfully navigates "up" out of the web folder, they remain trapped inside the isolated environment, unable to see the rest of the host system.

For now, you can continue with the experiment to see how these vulnerabilities are discovered! Let me know if you have more questions.