Skip to content

nwilbert/pytest-given

Repository files navigation

pytest-given

A pytest plugin that generates interactive HTML reports from Given/When/Then annotated tests. Inspired by JGiven (Java). The code is the single source of truth — no separate Gherkin DSL.

Live examples:

Quick start

import pytest
from pytest_given import attach, given, scenario, then, when


@pytest.fixture
@given('a coffee machine')
def machine():
    return {'coffees': 10, 'price': 2}


@scenario('Buy coffee', tags=['billing', 'happy-path'])
def test_buy_coffee(machine):
    with when('I insert $2'):
        machine['coffees'] -= 1
    with then('I get a coffee'):
        assert machine['coffees'] == 9
        attach('Machine state', machine)

Run it:

pytest --given-html

This produces given-report/report-data.json and given-report/report.html — a single self-contained HTML file with all assets inlined.

Why pytest-given?

Classical BDD tools (Cucumber, behave, pytest-bdd) center on a natural-language DSL like Gherkin, designed so stakeholders can author tests themselves and engineers maintain the glue that binds each step to a Python function.

pytest-given is for the opposite case: engineers write normal tests, and the plugin turns them into readable documentation. The HTML report is something stakeholders, domain experts, and engineers on adjacent teams can open and follow — without any of them needing to touch the test suite. For the engineers writing the tests, the same narrative gives a high-level, domain-focused view of behavior that's easier to scan than raw test code. Grouping by tag or module, text search across scenario names and tags, and status filters help zero in on what matters.

  • Plain Python — no Gherkin, no .feature files, no parser.
  • Tests stay first-class pytest tests; the report is a by-product.
  • Self-contained HTML: open it locally or attach it to CI artifacts; no server, no external assets.

Public API

@scenario(name, tags=None)

Mark a test for inclusion in the report. Required for any test you want to appear.

@scenario('Buy coffee', tags=['billing'])
def test_buy_coffee(machine):
    ...

given(text), when(text), then(text)

Dual-purpose: use as a context manager inside a test body, or as a decorator on a fixture or helper function.

As context managers:

@scenario('Place order')
def test_order():
    with given('an empty cart'):
        cart = []
    with when('I add an item'):
        cart.append('coffee')
    with then('the cart has one item'):
        assert len(cart) == 1

As a fixture decorator (only @given is allowed — fixtures are setup, so @when/@then on a fixture is rejected at runtime):

@pytest.fixture
@given('a coffee machine')
def machine():
    return {'coffees': 10, 'price': 2}

Generator fixtures work too; teardown is silent (the post-yield block runs but recording steps from it is not allowed):

@pytest.fixture
@given('a database connection')
def db():
    conn = open_conn()
    yield conn
    conn.close()

As a helper-function decorator (any phase). The helper records its own step on each call; for dynamic narration, use pytest_given.Template and reference the helper's parameters:

@when('inserting money')
def insert(amount):
    ...

@when(Template('I insert ${amount}'))
def insert(amount):
    ...

Steps can be nested freely:

with when('I place a large order'):
    with when('I select 3 coffees'):
        order_count = 3
    with when('I apply loyalty discount'):
        ...

Parameterized tests are automatically grouped into a single scenario with a parameter table. Use t-strings (t'...') to interpolate parameter values into step text — the plugin recognizes parameter names in t-string interpolations and color-codes them in the report:

@scenario('Pricing')
@pytest.mark.parametrize('euros,expect', [(1, False), (2, True), (3, True)])
def test_pricing(machine, euros, expect):
    with when(t'I insert ${euros}'):
        can_buy = euros >= machine['price']
    with then(t'can_buy is {expect}'):
        assert can_buy == expect

For a parametrized scenario name, use pytest_given.Template — deferred substitution against the parametrize columns:

from pytest_given import Template, scenario

@scenario(Template('Brew {cup_size} ml'))
@pytest.mark.parametrize('cup_size', [200, 300])
def test_brew(cup_size):
    ...

Step text & placeholders

Form Example How it renders
Plain string (including f-strings) with given('a cup')
with given(f'a {cup_size} cup')
Rendered verbatim. F-string interpolation happens before pytest-given runs, so values aren't highlighted.
T-string with given(t'a {cup_size} cup') pytest-given interpolates at runtime. Values are color-coded when the interpolation expression matches a parametrize column; otherwise highlighted neutrally.
Template in @scenario(...) @scenario(Template('Brew {cup_size} ml')) Deferred substitution against parametrize columns at report time. Unmatched placeholders raise PytestGivenError at collection.
Template on a helper-function decorator @when(Template('I insert {amount}')) Deferred substitution against the function's bound arguments at each call. Placeholders must name a positional-or-keyword parameter of the helper. Unmatched placeholders raise PytestGivenError at decoration time.

Three things worth knowing:

  1. pytest_given.Template only accepts bare identifiers{name}, {name:spec}, {name!conv}. Attribute access ({obj.attr}), indexing ({d[key]}), and arbitrary expressions ({x + 1}) raise PytestGivenError at construction. Workaround: parametrize by the attributes directly, or move the step into a test-body t-string (which supports full expression syntax).

  2. Template is for deferred substitution. It works on @scenario(...) (against parametrize columns) and on helper-function decorators like @when(Template(...)) (against the helper's bound arguments). It is not allowed in a test body — values are in scope there, so use a t-string. with given(Template(...)) / when(Template(...)) / then(Template(...)) raise PytestGivenError at entry. T-strings in @scenario(...) or on a fixture/helper decorator are rejected for the symmetric reason: the values aren't in scope at decoration time.

  3. Parametrized scenarios use the first case's steps as the template. All rows of the parameter table share the step structure recorded by case 1, with values substituted per row. That's the right behaviour when every case runs the same code with different values — and misleading when the steps themselves vary, e.g. a conditional with given(t'...') will only show case 1's branch. If steps diverge per case, split into separate @scenario tests.

Domain Storytelling

Three optional pillars layer Domain-Driven Design on top of the core surface. Adopt any one independently — or all three for a full vocabulary-and-story workflow. The HTML report adds a tabbed view: Scenarios (always present), Stories, and Glossary (each only shown when populated).

1. Ubiquitous-language Glossary — declare the actors, work objects, and verbs your tests speak about:

from pytest_given import Glossary

g = Glossary()
guest = g.actor('Guest', definition='Person booking accommodation.')
room = g.work_object('Room', definition='A bookable hotel room.')
search = g.verb('search', definition='Look up available options.')

Use the captured handles directly in t-strings — t'a {guest} {search("searches for")} a {room}'. Each interpolation becomes a kind-coloured pill in the rendered step, with the term's definition as a tooltip. Glossary terms feed the Glossary tab.

2. Domain Stories — model a flow as a sequence of activity(...) rows tied together by story(...):

from pytest_given import activity, story

book_a_group_trip = story('Book a Group Trip', [
    activity(organizer, search('searches for'), room),
    activity(organizer('Carol'), select('selects'), room('Deluxe Suite')),
])

An activity reads left-to-right: actor → verb → work object (with optional connective words). Any part may be a bare string instead of a glossary handle — but an activity needs at least two distinct glossary terms to be tracked for coverage; under-anchored activities render as "not coverage-tracked". path(...) lets a story branch where alternate activity sequences share a prefix.

3. Scenario ↔ activity binding — link a scenario (and individual steps) to the story it implements:

@scenario('Carol selects a suite', story=book_a_group_trip)
def test_select_suite(carol):
    with when(t'{organizer("Carol")} {search("searches for")} a {room}'):
        ...

Each step's term references are matched against the story's activities to compute coverage. The Stories tab shows the timeline with a coverage chip per activity and the scenarios that touch it. A step can also bind explicitly with given(text, activity=...).

Kindless and undefined terms — use g('foo') to declare a term that the team hasn't classified yet. It registers under the Uncategorized bucket in the Glossary view (no kind pill) and shows an Undefined badge until definition= is supplied. Use g['foo'] to look up an already-declared term by name (raises if unknown). Both forms return a DeferredTermHandle usable in t-strings and story activities.

Glossary-only mode — if you want the Glossary tab without writing stories yet, put g = Glossary() in a conftest.py so the plugin discovers it.

See the domain-storytelling design spec for the full surface and the hotel-booking example for an end-to-end usage.

FileGlossary — load a Markdown glossary file — if your project already keeps a GLOSSARY.md, point FileGlossary at it instead of declaring terms in code:

from pathlib import Path
from pytest_given import FileGlossary

g = FileGlossary(Path(__file__).parent / 'GLOSSARY.md')

The file must contain at least one GFM pipe table. By default the first column is the term and the second is the description; override with term_column, description_column, and kind_column (each accepts a 0-based index or a header name, case-insensitive):

g = FileGlossary('GLOSSARY.md', kind_column='Kind')   # explicit kinds from a "Kind" column
g = FileGlossary('GLOSSARY.md', term_column='Term', description_column='Meaning')

Access terms by name — g['Guest'] (case-insensitive). The returned handle is usable inline everywhere a code-defined handle is:

# In a story activity:
activity(g['Guest'], g['book']('books'), g['Room'])

# In a t-string step:
with when(t'{g["Guest"]} {g["book"]("books")} a {g["Room"]}'):
    ...

When no kind_column is present, term kinds are inferred from story activity-slot positions at session finish: slot 0 → actor, slot 1 → verb, slot ≥ 2 → work object. A term used only in t-string steps (never in any story activity) stays kindless and renders with a neutral, uncoloured pill.

Every term in the glossary file is included in the report, even one referenced by no story and no step. Terms whose kind could not be identified are listed under the Uncategorized section in the Glossary tab (and filterable via its own toggle).

See the file-backed glossary design spec and the file-glossary-booking example for a worked end-to-end usage.

attach(label, content)

Attach data to the current step. Strings are stored verbatim; other types are JSON-serialized.

attach('Receipt', 'Coffee x1     $2.00')             # text
attach('Machine state', {'coffees': 9, 'price': 2})  # JSON

pytest options

The JSON report is always written whenever the plugin is loaded — every pytest run produces it at the path given by --given-json (the default is created if missing). The HTML report is opt-in via --given-html.

Flag Default Description
--given-json=PATH given-report/report-data.json JSON output path (always written)
--given-html off Also generate the HTML report
--given-html-output=PATH given-report/report.html HTML output path (used only with --given-html)
--given-source-link=PRESET none Editor preset (vscode, cursor, zed, pycharm, github) or raw URL template. Renders a clickable file:line anchor on each scenario card, on each story panel, and on expanded glossary term cards. See Source links.

Source links

Add a clickable file:line anchor to each scenario card, story panel, and expanded glossary term card so devs can jump straight to the source.

# pyproject.toml — pytest 9+ canonical form
[tool.pytest]
given_source_link = "vscode"

Or pass it on the CLI: pytest --given-html --given-source-link=vscode.

Preset Opens in Template
none (no link)
vscode VS Code vscode://file/{path}:{line}
cursor Cursor cursor://file/{path}:{line}
zed Zed zed://file/{path}:{line}
pycharm PyCharm pycharm://open?file={path}&line={line}
github GitHub (web) https://github.com/<org>/<repo>/blob/{sha}/{relpath}#L{line}<org>/<repo> auto-detected from GITHUB_REPOSITORY or git remote get-url origin (HTTPS and SSH forms both supported)

For a raw template, use any of these variables:

Variable Source
{path} Absolute POSIX path (resolved at render time against the cwd)
{relpath} POSIX path relative to pytest's rootdir
{line} 1-indexed line of the scenario's def
{project} Basename of pytest's rootdir
{sha} Commit SHA from GITHUB_SHA / CI_COMMIT_SHA / BUILDKITE_COMMIT, falling back to git rev-parse HEAD

Examples:

# CI archives → SHA-pinned GitHub permalinks (preset auto-detects org/repo)
given_source_link = "github"

# Same as a raw template — pin org/repo explicitly. Useful when origin is a
# mirror, fork URL, or non-standard remote that the preset can't parse:
given_source_link = "https://github.com/myorg/myrepo/blob/{sha}/{relpath}#L{line}"

Caveats:

  • Editor presets (vscode / cursor / zed) resolve {path} from the current working directory at render time. Re-rendering a CI-downloaded JSON from a different directory will produce broken links.
  • The GitHub-permalink template is SHA-pinned, so links remain stable after the line moves — what an archived CI report wants.
  • The github preset bakes the detected org/repo into the template at config-resolution time (session start / CLI invocation). If you later re-render the same JSON elsewhere, the org/repo in the link is the one detected on the original run.
  • Pytest 9 uses [tool.pytest]; older pytest used [tool.pytest.ini_options] (still accepted for back-compat).

Standalone CLI

Regenerate the HTML from a saved JSON file at any time:

pytest-given report path/to/report-data.json -o path/to/report.html \
    --source-link=vscode

--source-link accepts the same presets and raw templates as --given-source-link (see Source links). Omit it (or pass --source-link=none) to render plain file:line text without an anchor.

Examples

Three example suites live under examples/, each with pre-rendered JSON + HTML committed:

  • coffeeshop/test_coffeeshop.py — a tour of the core feature surface: when/then blocks, generator fixtures with teardown, plain text and JSON attachments, parameterized tests rendered as tables, t-string interpolation, helper functions that record their own steps, top-level given blocks, deeply nested steps, failure rendering, and skipped scenarios. Output: coffeeshop.html (live preview).
  • hotel-booking/test_hotel_booking.py — Domain Storytelling features: a Glossary of actors / work objects / verbs, a story(...) with activity(...) rows, scenarios bound to a story with per-activity coverage, and kindless + undefined terms (registered with g('foo')) awaiting classification. Output: hotel-booking.html (live preview).
  • file-glossary-booking/test_file_glossary_booking.pyFileGlossary features: loading a Markdown glossary file, name-based term access, inferred kinds from story activity slots, and a deliberately kindless term (neutral pill). Output: file-glossary-booking.html.

Run nox -s examples to regenerate all three.

Working with LLMs

pytest-given may be a good fit for AI-assisted workflows. A human can describe a scenario in plain prose — more flexible than a rigid Gherkin DSL — and an LLM can generate the full test: scaffolding, steps, and assertions. The explicit with given(...) / with when(...) / with then(...) blocks then act as a verifiable backbone: each step records what the implementation claims to do, making it easier for a human or another model to audit whether the generated code actually matches that intent.

As more implementation is generated rather than hand-written, the human's attention can shift away from line-by-line code review toward a domain-level view of behavior — which is exactly the artifact pytest-given produces. The HTML report reads as a behavior specification, useful for confirming that the generated code does what was asked.

Development

See AGENTS.md for setup, quality gates, and conventions.

License

MIT

About

A pytest plugin that generates interactive HTML reports from Given/When/Then annotated tests.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors