Developing Tests

This document provides comprehensive guidelines for developing and maintaining tests in the Garden Linux test framework.

Test Development Principles

The following principles guide all test development in Garden Linux:

Core Principles

1. Be easy to read and understand (without extensive knowledge of Garden Linux internals)

Use native Python APIs over shell scripts where feasible
Write clear, self-documenting test names and assertions
Avoid complex logic in test functions

2. Be explicit about what quality they ensure

Test names must clearly communicate what is broken if the test fails
In general, one test should not have multiple assertions (there might be valid exceptions)
Each test should verify a single, specific behavior

3. Be very strict about declaring if they mutate system state

Use appropriate markers (@pytest.mark.modify, @pytest.mark.root) to declare system modifications, as per ADR-0007
Document (reason=) why system modifications are necessary
Ensure tests clean up after themselves.
- If new functionality is added, check if tests/plugins/sysdiff.py collects modifications, as specified in ADR-0022.

4. Only run as root when needed

Use @pytest.mark.root only when root privileges are absolutely necessary
Document (reason=) why root access is required
Prefer unprivileged testing when possible

5. Target appropriate test environments and platforms

Use @pytest.mark.booted to mark tests that require a full booted system (such as QEMU or cloud VMs) to function correctly.
Use @pytest.mark.feature to restrict tests to only those environments or platforms where they are intended to run, especially if they would fail elsewhere.
Whenever possible, design your tests so that they work across all supported environments and platforms. Only exclude an environment or platform if there’s a strong, well-documented reason to do so.
If possible, still add a minimal test for excluded platforms
Document (reason=) why a test must run (or be excluded) in certain environments or platforms
For a list of all available test environments (like chroot, QEMU, cloud, and OCI), see Test Environment Details.

6. Use abstractions judiciously to hide implementation details

Leverage plugins for infrastructure concerns (parsing files, accessing data, establishing connections)
Use handlers for setup/teardown operations
Keep test logic visible and maintain Arrange-Act-Assert structure
Avoid over-abstraction that requires reading multiple plugins to understand a test

Parser plugins (ADR-0026)

Use the default parsing plugins (parse, parse_file) for files and command output to keep comment handling, format support, and errors consistent
Skip ad-hoc parsing (Path.read_text(), direct json.loads(), regex scraping) when a parser plugin covers the case
Add a domain-specific parser plugin when parsing repeats, needs special handling beyond the defaults, or clearly improves readability/maintainability
For examples, see Parsing Plugins

7. Be mindful about external dependencies

Prefer Python standard library over third-party packages, following ADR-0016
Only add PyPI dependencies when there's clear benefit
Document why external dependencies are necessary

8. Handlers must restore system state in teardown phase

Handlers (yield fixtures) that modify system state must restore the original state after tests complete, as required by ADR-0007
This cleanup is only required when tests run with --allow-system-modifications and are marked with @pytest.mark.modify
The pattern is: save initial state → yield to test → restore initial state in teardown
Handlers must track what they changed and reverse those changes in reverse order
Examples of state to restore: service status, kernel modules, installed packages, filesystem changes, network configuration
For examples, see Handlers for Setup/Teardown

9. Use Test Coverage Markers

Test Coverage Markers need to be added to features and tests to assure a high test coverage.

Framework Structure

How Tests, Plugins, and Handlers Connect

The framework uses pytest's plugin system to automatically register fixtures:

Plugins (tests/plugins/) - Provide fixtures for system access
Handlers (tests/handlers/) - Provide fixtures for setup/teardown
Tests (tests/integration/test_*.py) - Use fixtures via dependency injection

Registration: All plugins are automatically registered as pytest fixtures via conftest.py

Plugins (`tests/plugins/`)

Plugins are pytest fixtures that handle infrastructure concerns and system interactions:

Purpose: Provide clean APIs for system access (file parsing, service management, etc.)
Usage: Provide pytest fixtures that can be injected into test functions
Examples: Systemd, Sshd, ShellRunner, KernelModule
Guideline: Handle "how to access" not "what to test"

Handlers (`tests/handlers/`)

Handlers are pytest fixtures that manage test setup and teardown:

Purpose: Setup/teardown of test state (connections, services, environments)
Pattern: Yield fixtures that prepare resources and explicitly clean up after tests
Usage: Used as pytest fixtures with yield for cleanup
Examples: service_ssh, service_containerd

Key distinction: Unlike regular fixtures that provide data, handlers manage stateful resources that need explicit cleanup.

Utils (`tests/plugins/utils.py`)

Utility functions provide reusable functionality:

Purpose: Helper functions not used directly in tests
Usage: Used by plugins and handlers
Examples: equals_ignore_case, parse_etc_file

Test Organization and Naming

Directory Structure

All test files are placed into tests/integration/{category}. The following tree structure shows how tests are organized by functional area:

tests/integration/
├── boot/                    # Boot-related tests (ignition, cloud-init, initrd, secureboot, etc.)
├── core/                    # Core system functionality (services, network, users, logging, etc.)
├── infrastructure/          # Infrastructure and platform tests (cloud platforms, iscsi, nvme, kvm, metal)
├── kernel/                  # Kernel-related tests (cmdline, modules, parameters, etc.)
├── runtime/                 # Runtime environment tests (containers, SAP, gardener, nodejs, etc.)
└── security/                # Security tests (SSH, firewall, PAM, capabilities, etc.)
    └── compliance/          # Compliance tests (CIS, FIPS, STIG, FedRAMP)

The purpose of categorizing tests is to improve maintainability and discoverability. By grouping related tests together, developers can more easily:

Locate existing tests for a specific functional area
Understand the scope and coverage of the test suite
Organize test execution by category when needed
Maintain consistency when adding new tests

These categories are subject to change as new tests are added and the test suite evolves.

File Naming Convention

Test files follow the pattern test_*.py and should be named based on the functionality they test:

test_ignition.py (in boot/) - Ignition configuration and functionality
test_services.py (in core/) - Enabled/disabled and started/stopped services
test_network.py (in core/) - Network configuration and connectivity
test_ssh.py (in security/) - SSH configuration and security
test_fips.py (in security/compliance/) - FIPS compliance tests

INFO

Tests are not strictly tied to features in the features folder anymore. Have a look at @pytest.mark.feature() if you need a test condition related to a feature.

Test Function Naming and Comments

Test functions should clearly describe what they verify by naming them accordingly and providing a useful comment:

python

def test_sshd_has_required_config(sshd_config_item: str, sshd: Sshd):
    """Test that SSH daemon has the required configuration values."""

def test_users_have_no_authorized_keys(expected_users):
    """Test that unmanaged users don't have SSH authorized keys."""

def test_startup_time(systemd: Systemd):
    """Test that system startup time is within acceptable limits."""

Feature-Based Organization

Tests are organized by functionality rather than Garden Linux features. However, feature-specific tests use the @pytest.mark.feature marker to limit execution:

python

@pytest.mark.feature("ssh")
def test_ssh_service_running(systemd: Systemd, service_ssh):
    assert systemd.is_active("ssh"), "SSH service is not running"

Test Writing Best Practices

Assertions Only in Test Code

Test functions should contain only assertions and minimal logic:

python

# Good: Clear assertion with descriptive message
def test_sshd_config(sshd: Sshd):
    actual_value = sshd.get_config_section("PermitRootLogin")
    assert actual_value == "No", f"PermitRootLogin should be 'No', got '{actual_value}'"

# Bad: Complex logic in test
def test_sshd_config(sshd: Sshd):
    config = sshd.get_all_config()
    for key, value in config.items():
        if key == "PermitRootLogin":
            if value != "No":
                raise AssertionError(f"Expected No, got {value}")
            break

Use Plugins for Infrastructure, Not Test Logic

The key is finding the right balance between abstraction and readability. Plugins should handle infrastructure concerns, not hide test logic:

Good abstractions (infrastructure concerns):

File parsing and data access
System service management
Network connections and sockets
Data processing and formatting

Avoid over-abstraction (test logic concerns):

Business logic validation
Test-specific assertions
Domain-specific checks
Complex test workflows

python

# Good: Plugin handles system interaction, test logic is clear
def test_service_running(systemd: Systemd):
    assert systemd.is_active("ssh"), "SSH service is not running"

# Good: Clear test logic with infrastructure abstraction
def test_sshd_permit_root_login(sshd: Sshd):
    """Test that SSH root login is disabled."""
    actual_value = sshd.get_config_section("PermitRootLogin")
    assert actual_value == "No", f"Root login should be disabled, got '{actual_value}'"

# Bad: Over-abstraction hiding test logic
def test_ssh_security_compliance(ssh_security: SshSecurity):
    """Test SSH security compliance."""
    assert ssh_security.is_secure(), "SSH configuration is not secure"

# Bad: Direct shell calls when plugin abstraction exists (or could be useful)
def test_service_running(shell: ShellRunner):
    result = shell("systemctl is-active ssh")
    assert result.stdout.strip() == "active", "SSH service is not running"

Parsing Plugins

Use the parsing plugins to keep file/command parsing consistent and readable:

python

# Command output parsing
def test_systemd_failed_units(shell, parse):
    result = shell("systemctl --no-legend --no-pager")
    lines = [line for line in result.stdout.splitlines() if line.strip()]
    units = [line.split()[2] for line in lines] # 0: UNIT, 1: LOAD, 2: ACTIVE
    assert all(state != "active" for state in units)

python

# Command output parsing with JSON response
def test_systemd_failed_units(shell, parse):
    result = shell("systemctl --output=json")
    data = parse.from_str(result.stdout).parse(format="json")
    assert isinstance(data, list)
    assert all(unit["active"] == "active" for unit in data)

python

# Structured File parsing with auto-detected keyval format
def test_dmesg_gardener_sysctl_no_restrictions_on_accessing_dmesg(parse_file):
    file_path = "/etc/sysctl.d/40-allow-nonroot-dmesg.conf"
    config = parse_file.parse(file_path)
    assert config["kernel.dmesg_restrict"] == "0"

python

# Structured File parsing with manually selected YAML format
def test_cloud_cfg_disables_ssh_pw_auth(parse_file):
    cfg = parse_file.parse("/etc/cloud/cloud.cfg", format="yaml")
    assert cfg["disable_root"] is True
    # Check if value is not in a list
    assert "resizefs" not in cfg["cloud_init_modules"]
    # Check if multiple values are in a list, in a certain order
    cfg = parse_file.parse("/etc/cloud/cloud.cfg", format="yaml", ordered=True)
    assert [ "mounts", "set_hostname" ] in cfg["cloud_init_modules"]

python

# Read a file line by line and use a regex to validate
def test_machine_id_is_initialized(parse_file):
    lines = parse_file.lines("/etc/machine-id")
    assert re.compile(r"^[0-9a-f]{32}$") in lines

Handlers for Setup/Teardown

WARNING

Handlers that modify system state must restore the original state in the teardown phase. See Core Principle 8 above for detailed requirements and examples.

Use handlers (yield fixtures) for managing test state and cleanup:

Always check initial state before modifying
Only restore what you changed (don't stop services that were already running)
Clean up in reverse order of setup (especially important for dependencies like kernel modules)
Use ignore_exit_code=True for cleanup operations that might fail if already cleaned up
If new system modifications are introduced, verify that tests/plugins/sysdiff.py can detect them

Example: service test, including setup and teardown

python

@pytest.fixture
def service_ssh(systemd: Systemd):
    """Fixture for SSH service management with cleanup."""

    # Save initial state
    service_active_initially = systemd.is_active("ssh")

    if not service_active_initially:
        systemd.start_unit("ssh")

    # Yield to test
    yield "ssh"  # This returns "ssh" to the test as the fixture's value. This can be use to parametrize tests.

    # Teardown/Cleanup: restore original state
    if not service_active_initially:
        systemd.stop_unit("ssh")

Example: installing packages and loading kernel modules, including setup and teardown

python

TEST_NAME_MODULES = [ "nvme", "nvme_auth" ]
TEST_NAME_PACKAGES = [
    {"name": 'mount', "status": None},
    {"name": 'wget', "status": None},
]

@pytest.fixture
def test_name(shell: ShellRunner, dpkg: Dpkg, kernel_module: KernelModule):
    # Setup: whatever is needed as configuration
    for mod_name in TEST_NAME_MODULES:
        kernel_module.load_module(mod_name)
    for pkg in TEST_NAME_PACKAGES:
        if not dpkg.package_is_installed(pkg["name"]):
            shell(f"DEBIAN_FRONTEND=noninteractive apt-get install -y {pkg["name"]}")
            # Save Status change for teardown
            pkg["status"] == "Installed"
    # ... (additional setup code) ...

    # Yield to test
    yield

    # Teardown/Cleanup: reverse all changes in reverse order
    # ... (additional teardown code) ...
    # Uninstall needed packages again
    for pkg in TEST_NAME_PACKAGES:
        if pkg["status"] == "Installed":
            shell(f"DEBIAN_FRONTEND=noninteractive apt-get remove -y {pkg["name"]}")
    # Unload all modules loaded by load_module (automatically handles dependencies and order)
    kernel_module.unload_modules()

Utils for Helper Functions

Utility functions should not be used directly in tests but provide reusable functionality:

python

# In plugins/utils.py
def equals_ignore_case(a: str, b: str) -> bool:
    return a.casefold() == b.casefold()

# In test files - use through plugins
def test_config_value(sshd: Sshd):
    actual_value = sshd.get_config_section("LogLevel")
    assert equals_ignore_case(actual_value, "VERBOSE"), f"Expected VERBOSE, got {actual_value}"

Shell Calls vs Filesystem Lookups

Prefer filesystem lookups over shell calls when possible. Use shell calls only when:

Testing shell functionality itself (command execution, shell features)
No appropriate plugin abstraction exists and/or filesystem access is very complex
Testing system-level behavior that requires shell execution

python

# Good: Direct filesystem access
def test_os_release():
    with open("/etc/os-release", "r") as f:
        content = f.read()
        assert "ID=gardenlinux" in content

# Good: Plugin abstraction for system interactions
def test_service_status(systemd: Systemd):
    assert systemd.is_active("ssh"), "SSH service is not running"

# Acceptable: Shell call when testing shell functionality
def test_shell_command_execution(shell: ShellRunner):
    """Test that shell commands execute correctly."""
    from datetime import datetime
    result = shell("date +%Y")
    current_year = str(datetime.now().year)
    assert result.stdout.strip() == current_year, f"Shell command execution failed, expected {current_year}"

# Bad: Shell call when plugin abstraction exists
def test_service_status(shell: ShellRunner):
    result = shell("systemctl is-active ssh")
    assert result.stdout.strip() == "active", "SSH service is not running"

Markers and Test Configuration

Core Markers

`@pytest.mark.booted(reason="...")`

Indicates the test requires a booted system:

python

@pytest.mark.booted(reason="Calling sshd -T requires a booted system")
def test_sshd_config(sshd: Sshd):
    # Test implementation

`@pytest.mark.modify(reason="...")`

Indicates the test modifies system state:

python

@pytest.mark.modify(reason="Starting the unit modifies the system state")
def test_service_start(systemd: Systemd):
    # Test implementation

`@pytest.mark.root(reason="...")`

Indicates the test requires root privileges:

python

@pytest.mark.root(reason="Starting the unit requires root")
def test_service_start(systemd: Systemd):
    # Test implementation

`@pytest.mark.feature("condition", reason="...")`

Limits test execution based on feature conditions:

python

@pytest.mark.feature("ssh and not (ali or aws or azure or openstack)",
                     reason="We want no authorized_keys for unmanaged users")
def test_users_have_no_authorized_keys(expected_users):
    # Test implementation

`@pytest.mark.arch("condition", reason="...")`

Limits test execution based on processor architecture:

python

@pytest.mark.arch("amd64", reason="Kernel Module only available on amd64")
def test_kernel_module_amd64(expected_module):
    # Test implementation

`@pytest.mark.hypervisor(["name", "..."], reason="...")`

Only run the following test when running on a real hypervisor environment:

python

@pytest.mark.hypervisor(
    "amazon",
    reason="Only works on real AWS infrastructure due to NTP server access requirements.",
)
def test_correct_ntp_on_aws(timedatectl: TimeDateCtl):
    # Test implementation

Common Filtering Patterns

Environment-specific filtering:

python

# Only run on booted systems (QEMU, cloud)
@pytest.mark.booted(reason="Requires running system")

# Skip in container environments (OCI)
@pytest.mark.feature("not container", reason="Container environment not suitable")

Platform-specific filtering:

python

# Cloud provider specific tests
@pytest.mark.feature("aws", reason="AWS-specific functionality")
@pytest.mark.feature("azure", reason="Azure-specific functionality")
@pytest.mark.feature("gcp", reason="GCP-specific configuration")
@pytest.mark.feature("ali", reason="Alibaba Cloud-specific configuration")

# Hypervisor specific tests
@pytest.mark.hypervisor("amazon", reason="Relies on AWS specific services")
@pytest.mark.hypervisor("microsoft", reason="Relies on Azure specific services")
@pytest.mark.hypervisor("google", reason="Relies with Google Cloud specific metadata")
@pytest.mark.hypervisor("qemu", reason="Test asserts the presence of a service which is usually running on a real hypervisor")

# Other platform-specific tests
@pytest.mark.feature("openstack", reason="OpenStack-specific configuration")
@pytest.mark.feature("vmware", reason="VMware-specific configuration")

Complex environment combinations:

python

# Multiple conditions
@pytest.mark.feature("(gardener or chost) and not _pxe",
                     reason="containerd needed but not working in PXE environment")

# Feature combinations
@pytest.mark.feature("ssh and not (ali or aws or azure or openstack)",
                     reason="We want no authorized_keys for unmanaged users")

`@pytest.mark.performance_metric`

Marks performance tests that can be skipped under emulation:

python

@pytest.mark.performance_metric
@pytest.mark.booted(reason="We can only measure startup time if we actually boot the system")
def test_startup_time(systemd: Systemd):
    # Test implementation

Parametrization

Use @pytest.mark.parametrize to avoid repetitive, nearly identical tests where the differences can naturally be expressed via parameters. However, don't overuse parametrization or strive for ultra-generic test functions at the expense of clarity. If extracting a parameter makes the test clearer and avoids straightforward duplication, use it. But a small amount of code duplication is perfectly acceptable in tests if it keeps things readable and concrete.

Yes – good use of parametrization:

python

@pytest.mark.parametrize("username", ["alice", "bob", "peter"])
def test_home_directory_exists(username):
    home = Path(f"/home/{username}")
    assert home.exists(), f"Missing home for {username}"

No – avoid excessive abstraction for unclear wins:

python

# This is too abstract—each "case" has different logic, result, or context.
@pytest.mark.parametrize("input_val,expected", [(1, True), ("foo", False), ([], Exception)])
def test_weird_cases(input_val, expected):
    # ...too much unrelated logic for one test

Guideline: Prefer clarity and intent over DRY-ness in tests. Parametrize when it makes tests simpler, not just shorter.

Test Coverage Markers

Please have a look at the Test Coverage Documentation

Missing Markers (TODO)

`@pytest.mark.security_id`

TODO: Explain what this marker is good for.

Debugging Tests

When developing or maintaining tests, you'll often need to debug failing tests or understand test framework behavior.

Adding Debug Logging

Tests, plugins, and handlers can output additional debugging information using Python's logging framework. This is the primary way to provide visibility into what your code is doing:

python

import logging
logger = logging.getLogger(__name__)

def test_example(systemd: Systemd):
    """Test that demonstrates debug logging."""
    logger.debug("Checking SSH service status")
    is_active = systemd.is_active("ssh")
    logger.debug(f"SSH service active status: {is_active}")
    assert is_active, "SSH service should be running"

When to use debug logging

Plugin operations: Log when plugins access system resources, parse files, or interact with services
Complex logic: Add debug output for non-obvious operations or when troubleshooting logic errors
State transitions: Log when handlers set up or tear down resources
Error conditions: Provide context when operations might fail

Best practices

Use descriptive messages that explain what's happening, not just variable values
Include relevant context (file paths, service names, configuration values)
Avoid excessive logging in tight loops or frequently-called functions
Use appropriate log levels: logger.debug() for detailed info, logger.info() for important milestones

INFO

Have a look at the debug tests guide if you want to know how to view those debug logs when running tests.

Debugging Tests in a Booted VM

When you need to debug tests on a fully booted system (QEMU or a cloud VM), you can use the --dev flag to:

Quickly sync the local test distribution and test files into a VM.
Re-run tests on demand (oneshot) or automatically on file changes.

QEMU VM Workflow

Start VM

Run the tests with --dev (or --ssh --skip-cleanup --skip-tests --watch for the long version) to start a QEMU VM in dev mode.

bash

./test --dev .build/aws-gardener_prod-amd64-today-local.raw

You can also pass additional --test-args.

bash

./test --dev --test-args "test_ssh.py -v" .build/aws-gardener_prod-amd64-today-local.raw

This will:

Start a QEMU VM.
Sync .build/dist.tar.gz into the VM.
Sync the current tests/ tree.
Run pytest inside the VM with the provided --test-args once.
Wait for file changes in tests/ and features/.
Re-sync the changed tests into the VM.
Re-run pytest with your given arguments after each change.

You can stop watch mode at any time with Ctrl+C.

Cloud VM Workflow

Start VM

Run the tests with --dev (or --skip-cleanup --skip-tests --watch for the lang version) to start a Cloud VM in dev mode.

bash

./test --dev --cloud azure .build/azure-gardener_prod-amd64-today-local.raw
# or use an already uploaded image
./test --dev --cloud azure \
    --cloud-image --image-requirements-file .build/azure-gardener_prod-amd64-today-local.requirements \
    /CommunityGalleries/gardenlinux-13e998fe-534d-4b0a-8a27-f16a73aef620/Images/gardenlinux-nvme-gardener_prod-amd64/Versions/2150.0.0

You can also pass additional --test-args.

bash

./test --dev --cloud azure --test-args "test_ssh.py -v" .build/aws-gardener_prod-amd64-today-local.raw

This will:

Deploy a Cloud VM.
Sync .build/dist.tar.gz into the VM.
Sync the current tests/ tree.
Run pytest inside the VM with the provided --test-args.
Wait for file changes in tests/ and features/.
Re-sync the changed tests into the VM.
Re-run pytest with your given arguments after each change.

You can stop watch mode at any time with Ctrl+C.

Debugging Tests in the Chroot Environment

The --dev flag will work the same as in the VM Setup.

Python Best Practices

Code Style and CI Enforcement

The project enforces code quality through CI linting (see .github/workflows/lint_tests.yml):

Black: Automatic code formatting
isort: Import statement sorting
Pyright: Static type checking These tools are available for most IDEs and a wide range of text editors.

In order to improve your development experience, and to allow us to process your PRs faster, we suggest your enable them with your editor or IDE.

Please refer to your editor's documentation for clues on how to set them up.

We highly suggest you configure your IDE or text editor to automatically apply formatting and type checking on save. Guidelines:

All code contributions must follow the PEP 8 – Style Guide for Python Code
The use of type hints is mandatory for all new additions to avoid typing related bugs and increase readability
Use descriptive variable and function names
Use docstrings to summarize functions and classes. The extent of which depends on the size, scope and complexity of the function or class.
Initialize all variables with a sensible default value instead of using None (unless the value and its type hint explicitly make it noneable)

TIP

Run make -f tests/dev.makefile format locally before committing

Error Handling

Use descriptive assertion messages
Handle expected failures gracefully
Use appropriate exception types

python

def read_file(path: str) -> str:
    """Read and return the contents of a file safely."""
    try:
        with open(path, "r", encoding="utf-8") as file:
            return file.read()
    except FileNotFoundError:
        print(f"Error: File not found — {path}")
    except PermissionError:
        print(f"Error: Permission denied when accessing {path}")
    except OSError as exc:
        print(f"Error: Could not read file {path}: {exc}")

Imports

Group imports: standard library, third-party, local
Use specific imports when possible
Avoid wildcard imports

python

import difflib
import gzip
import json
import os
import re
import shutil
from dataclasses import asdict, dataclass
from datetime import datetime
from pathlib import Path
from typing import Dict, List, Optional

import pytest
from debian import deb822

from .dpkg import Dpkg
from .kernel_module import KernelModule, LoadedKernelModule
from .shell import ShellRunner
from .sysctl import Sysctl, SysctlParam
from .systemd import Systemd, SystemdUnit

External Dependencies

Policy

Prefer Python standard library over third-party packages
Only add PyPI dependencies when there's clear benefit
Document why external dependencies are necessary
Consider maintenance cost of external dependencies

Current Dependencies

See tests/util/requirements.txt for current dependencies. Each dependency should be justified.

Adding New Dependencies

When adding new dependencies:

Justify the need - Why can't the standard library solve this?
Document the benefit - What does this library provide?
Consider alternatives - Are there lighter alternatives?
Update requirements.txt - Pin package versions in tests/util/requirements.txt

Key decisions that guide test development:

ADR-0006: New Test Framework - Self-contained execution model
ADR-0007: Non-Invasive Testing - Read-only testing principles
ADR-0008: Unified Test Logic - Declarative test patterns
ADR-0013: Discontinue Musthave Tests - Version-independent testing
ADR-0016: Minimal Host Dependencies - Dependency management
ADR-0022: System State Diffing - Detecting modifications
ADR-0026: Parser Plugins - Parsing guidelines

Cloud Installation

Local Installation

On-Premises Installation

Release Notes

Archived

Developing Tests ​

Test Development Principles ​

Core Principles ​

1. Be easy to read and understand (without extensive knowledge of Garden Linux internals) ​

2. Be explicit about what quality they ensure ​

3. Be very strict about declaring if they mutate system state ​

4. Only run as root when needed ​

5. Target appropriate test environments and platforms ​

6. Use abstractions judiciously to hide implementation details ​

Parser plugins (ADR-0026) ​

7. Be mindful about external dependencies ​

8. Handlers must restore system state in teardown phase ​

9. Use Test Coverage Markers ​

Framework Structure ​

How Tests, Plugins, and Handlers Connect ​

Plugins (tests/plugins/) ​

Handlers (tests/handlers/) ​

Utils (tests/plugins/utils.py) ​

Test Organization and Naming ​

Directory Structure ​

File Naming Convention ​

Test Function Naming and Comments ​

Feature-Based Organization ​

Test Writing Best Practices ​

Assertions Only in Test Code ​

Use Plugins for Infrastructure, Not Test Logic ​

Parsing Plugins ​

Handlers for Setup/Teardown ​

Utils for Helper Functions ​

Shell Calls vs Filesystem Lookups ​

Markers and Test Configuration ​

Core Markers ​

@pytest.mark.booted(reason="...") ​

@pytest.mark.modify(reason="...") ​

@pytest.mark.root(reason="...") ​

@pytest.mark.feature("condition", reason="...") ​

@pytest.mark.arch("condition", reason="...") ​

@pytest.mark.hypervisor(["name", "..."], reason="...") ​

Common Filtering Patterns ​

@pytest.mark.performance_metric ​

Parametrization ​

Test Coverage Markers ​

Missing Markers (TODO) ​

@pytest.mark.security_id ​

Debugging Tests ​

Adding Debug Logging ​

When to use debug logging ​

Best practices ​

Debugging Tests in a Booted VM ​

QEMU VM Workflow ​

Start VM ​

Cloud VM Workflow ​

Start VM ​

Debugging Tests in the Chroot Environment ​

Python Best Practices ​

Code Style and CI Enforcement ​

Error Handling ​

Imports ​

External Dependencies ​

Policy ​

Current Dependencies ​

Adding New Dependencies ​

Related Architecture Decisions ​

Related Resources ​

Related Topics ​

Developing Tests

Test Development Principles

Core Principles

1. Be easy to read and understand (without extensive knowledge of Garden Linux internals)

2. Be explicit about what quality they ensure

3. Be very strict about declaring if they mutate system state

4. Only run as root when needed

5. Target appropriate test environments and platforms

6. Use abstractions judiciously to hide implementation details

Parser plugins (ADR-0026)

7. Be mindful about external dependencies

8. Handlers must restore system state in teardown phase

9. Use Test Coverage Markers

Framework Structure

How Tests, Plugins, and Handlers Connect

Plugins (`tests/plugins/`)

Handlers (`tests/handlers/`)

Utils (`tests/plugins/utils.py`)

Test Organization and Naming

Directory Structure

File Naming Convention

Test Function Naming and Comments

Feature-Based Organization

Test Writing Best Practices

Assertions Only in Test Code

Use Plugins for Infrastructure, Not Test Logic

Parsing Plugins

Handlers for Setup/Teardown

Utils for Helper Functions

Shell Calls vs Filesystem Lookups

Markers and Test Configuration

Core Markers

`@pytest.mark.booted(reason="...")`

`@pytest.mark.modify(reason="...")`

`@pytest.mark.root(reason="...")`

`@pytest.mark.feature("condition", reason="...")`

`@pytest.mark.arch("condition", reason="...")`

`@pytest.mark.hypervisor(["name", "..."], reason="...")`

Common Filtering Patterns

`@pytest.mark.performance_metric`

Parametrization

Test Coverage Markers

Missing Markers (TODO)

`@pytest.mark.security_id`

Debugging Tests

Adding Debug Logging

When to use debug logging

Best practices

Debugging Tests in a Booted VM

QEMU VM Workflow

Start VM

Cloud VM Workflow

Start VM

Debugging Tests in the Chroot Environment

Python Best Practices

Code Style and CI Enforcement

Error Handling

Imports

External Dependencies

Policy

Current Dependencies

Adding New Dependencies

Related Architecture Decisions

Related Resources

Related Topics