Chapter 06 of 18
User Stories & Process Modeling
From translating business requirements into sprint-ready user stories, to extracting and optimising formal BPMN process models — this chapter covers the two core BA deliverables that turn analysis into actionable work. You will build an Agile Story Generator and a Process Discovery Engine.
Part 2: Business Analysis with LLMs
User Stories & Process Modeling
Two of the most important BA deliverables are user stories and process models. User stories turn requirements into sprint-ready work items. Process models capture how the business actually operates and where it needs to improve. In this chapter, you will learn how LLMs can accelerate both: generating, validating, and splitting user stories from requirements, and discovering, modelling, and optimising business processes from documents and data.
Reading time: ~45 min Projects: Agile Story Generator · Process Discovery Engine
What You Will Learn
- How to convert structured requirements into user stories following standard templates
- Techniques for generating comprehensive acceptance criteria with LLMs
- Automated story splitting strategies that produce right-sized work items
- Using LLMs to validate stories against the INVEST criteria
- AI-assisted story mapping for release planning
- Building an intelligent backlog grooming assistant
- Core concepts of business process modeling and the BPMN 2.0 standard
- How to extract process descriptions from unstructured documents using LLMs
- Generating BPMN-compliant process definitions from natural language
- Automated bottleneck detection using LLM analysis of process data
- Techniques for generating process improvement recommendations
- As-Is to To-Be process mapping with AI assistance
- Change impact analysis for process modifications
Part A: User Story Generation and Refinement
6.1 From Requirements to Stories
Requirements and user stories serve different purposes. A requirement describes what the system must do. A user story describes who needs it, what they need, and why. The canonical template, "As a [role], I want [capability], so that [benefit]," forces the author to think from the user's perspective.
The translation from requirements to stories is not one-to-one. A single requirement may generate multiple stories, one per user role or interaction path. Multiple requirements may collapse into a single story when they describe facets of the same user goal.
Why LLMs Excel at This Translation
- Role inference: LLMs can identify implicit user roles from requirement text ("the finance team needs..." implies an "Accountant" or "Finance Manager" role).
- Benefit articulation: Requirements often omit the "so that" clause. LLMs can infer the business benefit from context.
- Decomposition: LLMs can break compound requirements into atomic stories that each deliver a testable increment of value.
The Translation Pipeline
Figure 6-1. Requirements to User Stories Pipeline — from a business requirement through role identification, story generation, validation, and acceptance criteria to a sprint-ready backlog item.
| Input | Process | Output |
|---|---|---|
| Classified requirement | Role extraction, benefit inference | Draft user story |
| Draft user story | Acceptance criteria generation | Story + AC |
| Story + AC | INVEST validation | Validated story or split candidates |
| Split candidates | Story splitting | Right-sized stories |
Terminology Note: Throughout this section, "AC" refers to Acceptance Criteria, and "story" is shorthand for "user story" unless otherwise specified. "Epic" refers to a large body of work that can be decomposed into multiple stories.
6.2 Generating User Stories with LLMs
The core generation step converts a requirement into one or more user stories. The quality of the output depends heavily on the prompt design. A good prompt provides the requirement text, any known user roles, the project domain, and explicit instructions about the output format.
Story Generation Approach
The story generation prompt casts the LLM as a senior Product Owner and provides five explicit rules. First, each story must be atomic, delivering exactly one piece of user-visible functionality. Second, if the requirement implies multiple user roles, a separate story is created for each role. Third, compound requirements (those containing "and" joining distinct capabilities) are split into separate stories. Fourth, the "so that" clause must state a concrete business benefit rather than merely restating the capability. Fifth, every story receives a sequential ID and a link back to its source requirement.
The prompt also accepts contextual inputs: the project domain (for example, "Healthcare Portal" or "Financial Services Platform") and a list of known user roles. These help the LLM generate stories that use consistent role names aligned with your project's terminology.
Batch Generation Across a Requirement Set
For large requirement sets, process each requirement independently and in parallel using asynchronous API calls. Apply a concurrency limit (for example, 10 simultaneous requests) to stay within API rate limits. After all responses return, merge the story lists and assign sequential IDs (STORY-001, STORY-002, and so on) across the full set. This approach lets you generate stories for 50 or more requirements in under a minute.
Tip: Use async generation with rate limiting to process large requirement sets efficiently without hitting API rate limits.
6.3 Acceptance Criteria Automation
Acceptance criteria define when a story is "done." Well-written AC are specific, testable, and complete. LLMs can generate AC in both the Given/When/Then (Gherkin) format and the checklist format, depending on your team's preference.
Generating Gherkin-Style AC
The acceptance criteria prompt instructs the LLM to generate scenarios in Given/When/Then format for each story. Four rules ensure quality: cover the happy path plus at least one edge case and one error scenario; make each scenario independently testable; use concrete values instead of vague terms (for example, "within 3 seconds" rather than "quickly"); and include boundary conditions where applicable. The output is a JSON object containing the story ID and an array of scenario objects, each with a descriptive name, preconditions, actions, expected outcomes, and a type tag (Happy Path, Edge Case, or Error).
Example Output
For the story "As a Finance Manager, I want to export monthly reports as PDF, so that I can share them with auditors," the LLM might generate:
| Scenario | Given | When | Then | Type |
|---|---|---|---|---|
| Successful export | A report for March 2026 exists with 150 line items | The user clicks "Export as PDF" | A PDF file is downloaded within 5 seconds containing all 150 line items | Happy path |
| Empty report | A report for April 2026 exists with zero line items | The user clicks "Export as PDF" | A PDF is generated with a "No data available" message | Edge case |
| Export timeout | A report has 50,000 line items and the export exceeds 30 seconds | The user clicks "Export as PDF" | The user sees a progress indicator and can cancel; the system queues the export for background processing | Error |
Warning: LLM-generated acceptance criteria often miss domain-specific edge cases (e.g., fiscal year boundaries, currency rounding rules, timezone handling). Always review AC with domain experts before marking them as final.
6.4 Story Splitting Strategies
Stories that are too large cannot be completed within a single sprint. Stories that are too small create overhead without delivering meaningful value. The art of splitting is finding the right granularity. LLMs can apply well-known splitting patterns systematically.
The Nine Splitting Patterns
| # | Pattern | When to Use | Example |
|---|---|---|---|
| 1 | Workflow steps | Story covers a multi-step process | Split "checkout" into cart review, payment, confirmation |
| 2 | Business rules | Multiple rules govern the behaviour | Split discount calculation by rule type |
| 3 | Happy/unhappy paths | Error handling is complex | Separate success flow from error handling |
| 4 | Input methods | Multiple input channels exist | Split "upload document" into drag-drop, file picker, API |
| 5 | Data types | Different data entities are involved | Split "manage users" into create, update, deactivate |
| 6 | Operations (CRUD) | Story involves multiple operations | Separate read from write operations |
| 7 | User roles | Different roles have different needs | Admin view vs. end-user view |
| 8 | Platforms | Cross-platform support needed | Web version first, then mobile |
| 9 | Performance levels | Non-functional requirements are separable | Basic functionality first, then optimise for scale |
Automated Splitting
The splitting prompt provides the LLM with the oversized story (including its current point estimate, story text, and acceptance criteria) and asks it to apply the most appropriate pattern or combination of patterns from the nine listed in the table above. For each resulting sub-story, the model produces a new ID (for example, STORY-003.1, STORY-003.2), the full "As a... I want... so that..." text, the pattern used, a revised point estimate (targeting 1, 2, 3, or 5 points), and any dependencies between sub-stories.
Figure 6-2. Story Splitting Decision Tree — a simplified guide for choosing the right splitting pattern based on story characteristics.
Tip: After splitting, re-run acceptance criteria generation on each sub-story. The original AC should be distributed across the sub-stories, with each sub-story getting the specific scenarios it covers.
6.5 INVEST Criteria Validation
The INVEST acronym defines six qualities of a well-formed user story: Independent, Negotiable, Valuable, Estimable, Small, and Testable. LLMs can evaluate each criterion and flag stories that need refinement.
INVEST Validator
The INVEST validation prompt sends the LLM each story along with its acceptance criteria and asks it to evaluate against all six criteria. For each criterion, the model returns a pass/fail flag, a score from 1 (poor) to 5 (excellent), an explanation, and an improvement suggestion for any score below 4. It also computes an overall score (the average of all six) and a recommendation: Ready, Needs Refinement, Needs Split, or Reject. Use a very low temperature (0.1) for this evaluation, as you want consistent, deterministic assessments.
Interpreting Results
| Overall Score | Recommendation | Action |
|---|---|---|
| 4.0 – 5.0 | READY | Story can go directly to sprint planning |
| 3.0 – 3.9 | NEEDS_REFINEMENT | Address the specific failing criteria before planning |
| 2.0 – 2.9 | NEEDS_SPLIT | Story is likely too large or too dependent. Split first |
| < 2.0 | REJECT | Story needs fundamental rework; return to requirements |
Key Insight: The "Testable" criterion is the most reliable LLM evaluation because it is objectively assessable from the acceptance criteria. The "Independent" criterion is the least reliable because it requires knowledge of the full backlog context. Always supply related stories when checking independence.
6.6 Story Mapping with AI
Story mapping, popularised by Jeff Patton, arranges stories along two axes: the user journey (horizontal) and priority (vertical). It provides a bird's-eye view of the product. Teams plan releases by drawing horizontal "release lines" across the map.
LLM-Assisted Story Mapping
An LLM can help with two aspects of story mapping: (1) identifying the user activities and steps that form the horizontal backbone, and (2) placing stories under the correct activity at the right priority level.
The story mapping prompt takes the full set of user stories and organises them into a two-level hierarchy. At the top level are activities, high-level user goals such as "Manage Account" or "Process Orders." Each activity contains steps (specific actions like "Register" or "Log In"), and each step contains the stories assigned to it, tagged with a priority level and a suggested release number. The model also generates a release plan grouping stories into releases: Must-Have stories go into Release 1, Should-Have into Release 2, and Could-Have into Release 3.
Visualising the Story Map
The JSON output can be rendered as an interactive HTML table, exported to tools like Miro or Mural, or printed as a simple text-based grid organised by activity, step, and release tier. The visual format makes it easy for the team to spot gaps: steps where no stories exist but where a user would logically need functionality.
Tip: Feed the story map back to the LLM with the prompt "Identify any gaps in the user journey: steps where no stories exist but a user would logically need functionality." This is a powerful way to discover missing stories.
6.7 Backlog Grooming Assistant
Backlog grooming (or refinement) is a recurring ceremony where the team reviews, re-prioritises, and refines upcoming stories. An LLM-powered grooming assistant can prepare for these sessions by pre-analysing the backlog and surfacing issues before the meeting begins.
Pre-Grooming Analysis
The pre-grooming prompt sends the full backlog to the LLM and asks it to produce a structured report across six categories: stale stories not updated in 30+ days, duplicates that overlap significantly (with merge recommendations), dependency chains including any circular dependencies, estimation gaps where similar stories have inconsistent point values, blocked items referencing external prerequisites, and refinement candidates needing more detail before sprint planning. This analysis, which would take a Scrum Master 30 to 60 minutes to perform manually, can be generated in seconds and used as the agenda for the grooming session.
Interactive Refinement Loop
During the grooming session itself, the assistant operates in a conversational mode. It maintains the full backlog in its context window and responds to team questions: "Which stories depend on STORY-042?", "Suggest a rewrite for this vague story," or "What is the total point estimate for Release 2?" Decisions made during the session (such as splitting a story, changing priority, or deferring an item) are recorded with story ID, decision, and rationale. At the end of the session, the assistant generates meeting minutes summarising all decisions, action items, and responsible owners.
Warning: The grooming assistant should augment, not replace, the Scrum Master. Use it to surface data and suggestions, but let the team make the decisions. Over-reliance on AI in ceremonies can reduce team engagement and ownership.
Project A: Agile Story Generator
Build a complete pipeline that converts a set of requirements into a sprint-ready backlog, including user stories with acceptance criteria, INVEST validation results, and a story map.
Pipeline Architecture
The pipeline follows the same flow shown in Figure 6-1. Requirements from Chapter 5 feed into the Story Generator, which produces draft stories for each requirement. These pass through the AC Generator (acceptance criteria in Gherkin format), then the INVEST Validator. Stories scoring below 3.0 are sent to the Story Splitter, which breaks them into sprint-sized sub-stories. Finally, the Story Mapper organises all stories into a release plan. The pipeline outputs two JSON files: the full story backlog (with acceptance criteria and INVEST scores) and the story map (with activities, steps, and release assignments).
Testing Your Pipeline
Create a sample requirements.json with 10 requirements covering different types (functional, non-functional, constraint). Run the pipeline and verify that:
- Each requirement produces at least one story
- Stories follow the "As a... I want... so that..." template
- Each story has at least three acceptance criteria scenarios
- INVEST scores are reasonable (compare against your own assessment)
- The story map has a logical activity structure
Part B: Process Modeling and Optimization
6.8 Understanding Business Processes
A business process is a sequence of activities that transforms inputs into outputs to deliver value to a customer or stakeholder. Processes range from simple (expense approval) to complex, such as end-to-end order fulfilment spanning multiple departments and systems.
Process Hierarchy
| Level | Name | Description | Example |
|---|---|---|---|
| L0 | Value Chain | End-to-end enterprise capabilities | Order to Cash |
| L1 | Process Group | Major functional processes | Sales, Fulfilment, Billing |
| L2 | Process | Defined sequence of activities | Order Processing |
| L3 | Sub-Process | Detailed steps within a process | Credit Check, Inventory Allocation |
| L4 | Task | Atomic unit of work | Verify customer address |
BPMN 2.0 Essentials
BPMN (Business Process Model and Notation) is the industry standard for visually representing business processes. The core elements are:
- Events: Start (circle), Intermediate (double circle), End (thick circle). These represent triggers and outcomes.
- Activities: Tasks (rounded rectangles) and Sub-Processes (rounded rectangles with a "+" marker).
- Gateways: Exclusive (X), Parallel (+), Inclusive (O). These control flow branching and merging.
- Flows: Sequence flows (solid arrows), Message flows (dashed arrows), Associations (dotted lines).
- Swimlanes: Pools and Lanes that represent participants and departments.
Why BPMN Matters: BPMN is not just documentation. Modern BPM engines (Camunda, jBPM, Flowable) can directly execute BPMN XML definitions. An LLM that generates valid BPMN produces artifacts that are both human-readable and machine-executable.
6.9 Process Discovery from Documents
Process discovery is the act of understanding how work actually gets done. Traditional approaches involve interviews, workshops, and process mining from system logs. LLMs add a new capability: extracting process descriptions from the text-heavy documents that already exist in every organisation. These include SOPs, training manuals, email threads, and audit reports.
Process Extraction Approach
The process discovery prompt instructs the LLM to act as a business process analyst and extract every process described or implied in the source document. For each process, the model produces a rich, structured output: a sequential ID (PROC-001, PROC-002), a descriptive name, the trigger event that initiates the process, the actors (roles and departments) involved, and a detailed step list. Each step includes the step number, the responsible actor, the action performed, required inputs and produced outputs, and a flag indicating whether the step is a decision point (with branching conditions if so). The model also captures the end state, known exception paths, and any mentioned or inferable duration estimates.
Validating Discovered Processes
Extracted processes should be validated against these quality checks:
- Completeness: Does every process have a clear start trigger and end state?
- Consistency: Are actor names consistent across processes? (e.g., "Finance Dept" vs. "Accounting Team")
- Granularity: Are all processes at roughly the same level of detail?
- Coverage: Do the extracted processes cover the entire scope of the source document?
Figure 6-3. Process Discovery Pipeline — from source documents through text extraction and LLM analysis to formal BPMN diagrams with optimisation recommendations.
After extraction, validate the discovered processes against four quality checks. Completeness: does every process have a clear start trigger and end state? Consistency: are actor names consistent across processes (for example, "Finance Dept" and "Accounting Team" should not refer to the same group)? Granularity: are all processes at roughly the same level of detail? Coverage: do the extracted processes cover the entire scope of the source document? A simple automated checker can flag processes missing triggers or end states, those with fewer than two steps, and actor name variants that differ only in capitalisation or spacing.
Tip: Run process discovery on multiple documents from the same department, then ask the LLM to reconcile the results. Different documents often describe the same process at different levels of detail or with different terminology. The reconciliation step produces a more complete picture than any single source.
6.10 BPMN Generation with LLMs
Once a process is described in structured JSON, the next step is generating a formal BPMN 2.0 XML definition that can be imported into modelling tools (Camunda Modeler, Signavio, Bizagi) or executed by a BPM engine.
BPMN XML Generation
The BPMN generation prompt takes the structured process JSON and converts it into valid BPMN 2.0 XML. The prompt enforces seven rules: proper namespace declarations, inclusion of start and end events, exclusive gateways for decision points, parallel gateways for concurrent steps, correct lane assignments within pools, complete sequence flow connections, and compatibility with tools like Camunda Modeler. Use a temperature of 0.0 for maximum consistency in the generated XML structure.
Validating the Generated BPMN
After generation, validate the XML programmatically by parsing it with an XML library and running structural checks: verify at least one start event and one end event exist, count tasks and gateways, and confirm that sequence flows connect all elements. For full schema validation, compare against the official BPMN 2.0 XSD. Always visually inspect the result in a modelling tool before sharing with stakeholders.
Warning: LLM-generated BPMN XML often has minor issues: missing sequence flow connections, incorrect namespace prefixes, or overlapping diagram coordinates. Always validate the XML programmatically and visually inspect it in a modelling tool before sharing with stakeholders.
A Simpler Alternative: Mermaid Diagrams
If full BPMN XML is overkill for your needs, ask the LLM to generate Mermaid flowchart syntax instead. Mermaid diagrams render natively in GitHub, Notion, Confluence, and many other tools. The prompt maps process elements to Mermaid shapes: rounded rectangles for tasks, diamonds for decisions, stadium shapes for start and end events, and labelled arrows for conditional flows. This lightweight approach is ideal for early-stage process documentation or when you need quick visual validation before investing in formal BPMN modelling.
6.11 Bottleneck Identification
A bottleneck is any step in a process that limits the throughput of the entire system. Traditional bottleneck analysis uses process mining on event logs. LLMs complement this by analysing qualitative data: interview transcripts, complaint tickets, and delay reports. This identifies bottlenecks that may not appear in structured logs.
Quantitative Bottleneck Analysis
The quantitative approach works from event log data (structured records with case ID, activity name, start time, end time, and resource). For each activity in the process, calculate three metrics: average duration, duration variability (the ratio of the 95th percentile to the median), and throughput per resource. A composite bottleneck score weights these: 40 percent for duration (longer is worse), 30 percent for variability (more variable is worse), and 30 percent for inverse resource count (fewer resources handling the step means a tighter constraint). Steps with the highest composite scores are the most likely bottlenecks.
Qualitative Bottleneck Analysis with LLMs
The LLM-driven approach complements quantitative analysis by processing qualitative inputs: interview transcripts, complaint tickets, and delay reports. The bottleneck prompt provides the LLM with the process description, any available performance metrics, and the qualitative feedback text. For each identified bottleneck, the model returns the affected activity, the supporting evidence, likely root causes, the downstream impact, and a severity rating (Critical, High, Medium, or Low).
Key Insight: The most valuable bottleneck insights often come from combining quantitative and qualitative data. A step with average metrics might still be a bottleneck if interview data reveals it causes frustration, errors, or workarounds that are not captured in the event log.
6.12 Process Optimization Suggestions
Once bottlenecks are identified, the next step is generating improvement recommendations. LLMs can propose optimisations based on established process improvement methodologies: Lean (eliminate waste), Six Sigma (reduce variation), and automation-first strategies.
Optimization Recommendation Engine
The optimisation prompt gives the LLM the full process description alongside the identified bottlenecks and asks it to propose specific, actionable improvements. Each recommendation includes a sequential ID, the target activity, the optimisation category (from the six listed in the table below), a detailed description, a quantified expected impact, the implementation effort level, associated risks, prerequisites, and a qualitative ROI assessment. Recommendations are prioritised by their impact-to-effort ratio, putting "quick wins" at the top of the list.
Optimization Categories Explained
| Category | Description | When to Apply | Example |
|---|---|---|---|
| Eliminate | Remove the step entirely | Step adds no value (pure waste) | Remove redundant approval for orders under $100 |
| Automate | Replace manual work with software | Step is rule-based and repetitive | Auto-generate invoices from order data |
| Simplify | Reduce complexity of the step | Step has unnecessary sub-steps or forms | Replace 12-field form with 4-field smart form |
| Parallelise | Run steps concurrently | Steps are independent | Run credit check and inventory check simultaneously |
| Outsource | Delegate to a specialist team or service | Step requires expertise not core to the business | Use a third-party identity verification service |
| Standardise | Create a consistent procedure | Step is done differently by different people | Create a standard checklist for quality review |
Tip: Present optimisations on a 2x2 impact-effort matrix. High-impact, low-effort items are "quick wins" that build momentum. Schedule them first to demonstrate value before tackling larger changes.
6.13 As-Is to To-Be Mapping
As-Is/To-Be mapping is the standard BA technique for visualising the current state of a process alongside the proposed future state. LLMs generate the To-Be model by applying the optimisation recommendations to the As-Is process.
Generating the To-Be Process
The To-Be generation prompt takes two inputs: the current (As-Is) process in structured JSON and the list of approved optimisations. The model applies each optimisation and produces three outputs: the redesigned To-Be process (in the same structure as the As-Is, making comparison straightforward), a list of changes applied (each linked to the optimisation that drove it, with change types of Added, Modified, Removed, or Reordered), and a metrics comparison showing estimated before-and-after values for total steps, cycle time, manual step count, and decision points.
Figure 6-4. As-Is to To-Be Mapping — the current process (left) with a bottleneck at the approval step is transformed into an optimised process (right) with automation and step consolidation.
Comparison Report
The comparison report presents the transformation results in a stakeholder-friendly format. A metrics summary table shows before-and-after values for total steps, cycle time, manual steps, and decision points. Below it, a changes list details each modification: which As-Is step was affected, what it became in the To-Be model (or "Removed" if eliminated), the change type, and the optimisation recommendation that drove the change. When presenting to executives, use the side-by-side visual from Figure 6-4 with the metrics table at the bottom to provide the business case.
Presentation Tip: When presenting As-Is/To-Be to executives, use a side-by-side visual with the As-Is on the left and To-Be on the right. Highlight removed steps in red, new steps in green, and modified steps in amber. The metrics comparison table at the bottom provides the business case.
6.14 Change Impact Analysis
Before implementing process changes, you must understand the ripple effects. A change impact analysis identifies which systems, roles, policies, and training materials are affected by a process modification.
Impact Analysis Framework
The impact analysis prompt assesses proposed changes across five dimensions: people (roles affected, training needs, headcount implications), process (upstream and downstream processes impacted), technology (systems, integrations, and data flows), policy (regulations, SOPs, and governance documents needing updates), and risk (new risks introduced by the change). For each impacted item, the model provides the dimension, what is affected, the impact level (High, Medium, or Low), a description of how it is affected, and the action required to address it. The overall output includes a total impact count, a readiness score from 0 to 100, and a go/no-go recommendation.
Impact Heatmap
Visualise the results as a heatmap table with the five dimensions as rows and High, Medium, and Low impact counts as columns. This format lets decision-makers quickly identify which dimensions have the most critical impacts and where additional preparation is needed before the process change can proceed safely.
Warning: Change impact analysis is where LLMs are most likely to miss organisation-specific factors: union agreements, legacy system constraints, and cultural resistance patterns. Always supplement LLM output with input from change management practitioners who know the organisation.
Project B: Process Discovery Engine
Build an end-to-end tool that takes operational documents as input and produces a complete process analysis package: discovered processes, BPMN diagrams, bottleneck analysis, optimisation recommendations, and a To-Be process model with change impact assessment.
Pipeline Architecture
The pipeline follows the flow shown in Figure 6-3. Source documents (SOPs, manuals, transcripts) feed into the Process Discovery module, which extracts structured process descriptions. These pass through validation, BPMN or Mermaid diagram generation, bottleneck identification, optimisation recommendation, As-Is/To-Be mapping, and finally change impact analysis. The main script iterates through all text files in an input directory, discovers processes from each, validates the combined set, generates diagrams, and then runs the full analysis chain for each process with bottlenecks. All results are written to a single comprehensive JSON output file.
Extension Ideas
- Add process mining integration by reading event logs from CSV and correlating them with discovered processes.
- Generate an executive presentation deck (Markdown or HTML slides) summarising the top three bottlenecks and recommended optimisations.
- Build a web interface using Streamlit that allows stakeholders to upload documents and interact with the results.
Summary
User Stories
- LLMs bridge the requirements-to-stories gap by inferring roles, articulating benefits, and decomposing compound requirements into atomic stories.
- Acceptance criteria generation in Gherkin format produces testable scenarios for happy paths, edge cases, and error conditions. Domain-specific edge cases still require human review.
- Nine splitting patterns provide a systematic framework for breaking oversized stories into sprint-sized work items.
- INVEST validation gives each story a quality score, with "Testable" being the most reliable LLM assessment and "Independent" requiring the most human judgment.
- Story mapping with LLM assistance creates a visual product plan and reveals gaps in the user journey.
- Backlog grooming assistants can pre-analyse the backlog, surface issues, and record decisions. They should augment, not replace, team ceremonies.
Process Modeling
- Process discovery from documents allows LLMs to extract structured process descriptions from SOPs, manuals, and transcripts, providing a starting point that would take days to produce manually.
- BPMN generation produces machine-readable process models, though validation and visual inspection are essential before use.
- Bottleneck identification benefits most from combining quantitative metrics with qualitative LLM analysis of interviews and complaints.
- Six optimisation categories (Eliminate, Automate, Simplify, Parallelise, Outsource, Standardise) provide a systematic framework for improvement recommendations.
- As-Is/To-Be mapping with LLMs produces a clear before-and-after comparison with estimated metrics improvements.
- Change impact analysis across people, process, technology, policy, and risk dimensions ensures that process changes are implemented with full awareness of their ripple effects.
Exercises
Conceptual
- Explain why the "so that" clause in a user story is critical for prioritisation. How would a backlog without benefits statements affect sprint planning?
- Compare the strengths and weaknesses of Gherkin-style versus checklist-style acceptance criteria. When would you use each?
- Explain why an LLM might discover a process step that does not appear in any single document but is implied by the combination of multiple documents. Give an example.
- Compare the strengths of process mining (from event logs) versus LLM-based process discovery (from documents). When would you use each, and when would you combine them?
Coding
- Extend the
generate_storiesfunction to detect and reject stories that violate the "Independent" criterion by checking for shared data dependencies. - Write a function that takes two story maps (current and previous sprint) and highlights new stories, removed stories, and stories that moved between releases.
- Write a function that detects circular dependencies in a set of processes (Process A feeds into Process B, which feeds into Process A).
- Extend the
validate_bpmnfunction to check that every task has at least one incoming and one outgoing sequence flow (no orphaned tasks).
Design
- Design a Slack bot that integrates with the Agile Story Generator. Sketch the conversation flow for a user who wants to generate stories from a pasted requirement.
- Propose a feedback loop where developers mark stories as "unclear during implementation" and this feedback is used to improve the story generation prompt over time.
- Design a "Process Health Dashboard" that monitors live processes and uses LLM analysis to flag emerging bottlenecks before they become critical. Sketch the architecture and data flows.