Reimagining E2E Testing at Bizom: From Manual Scripts to Intelligent RI Agents

By Santhosh J S, Senior Tech Lead

30th December, 2025

Testing is the safety net of any software platform — it’s what gives teams confidence to deploy changes without breaking production. But at Bizom, we realized that traditional E2E testing approaches were no longer sufficient for our fast-paced, complex B2B distribution platform.

 

In the world of FMCG and retail distribution, our platform serves multiple user types. Traditional test automation scripts, while helpful, had significant limitations: they were brittle, required constant maintenance, and couldn’t adapt to UI changes without developer intervention.

 

We needed E2E testing that could:

  • Adapt intelligently to UI changes without breaking
  • Understand test intent rather than blindly executing hardcoded steps
  • Generate test coverage automatically for new features
  • Self-heal when minor UI changes occurred
  • Provide meaningful insights beyond pass/fail status

 

To solve these challenges, we adopted a hybrid agent-based approach, combining the power of Playwright’s robust testing framework with AI agents that could understand, plan, and execute tests intelligently.

Let's Understand the Approach

The Foundation: Playwright Test Framework

We built a solid foundation using Playwright— a modern, reliable browser automation framework. Playwright provides auto-waiting, network interception, multi-browser support, and a built-in test runner.

 

However, even with Playwright’s robustness, writing and maintaining tests remained a manual, time-consuming process.

Enter AI Agents: From Script Writers to Test Designers

We created a three-stage intelligent testing system where specialized AI agents collaborate to plan, generate, and heal tests automatically, powered by large language models (LLMs).

 

1. Planner Agent: Understanding Test Requirements

The Planner Agent understands what needs to be tested. It explores the live application, maps user flows, analyzes different user journeys, and designs comprehensive scenarios, generating structured test plans in markdown. This ensures semantic coverage—testing based on intent and user goals.

 

2. Generator Agent: Translating Plans to Executable Tests

The Generator Agent figures out how to test it. This agent:

  • Reads the test plan and navigates the actual application UI.
  • Takes accessibility snapshots and executes test steps live.
  • Generates clean, reusable TypeScript tests using the Page Object Model.

 

It understands the page semantically, looking for elements by their accessible name or role, making tests far more resilient.

 

3. Healer Agent: Self-Healing and Root Cause Analysis

When tests fail, the Healer Agent steps in. It executes failing tests in debug mode, performs root cause analysis (checking for selector changes, timing issues, etc.), edits test files to address issues, and re-runs tests to validate the fix. The Healer automatically fixes 80% of test failures due to minor UI changes.

Handling Real-World Complexity: Context-Aware Testing

We ensure E2E tests operate in stateful environments through:

  • Authentication fixtures
  • Data builders
  • Cleanup strategies

 

AI agents leverage these fixtures intelligently, understanding when to reuse state versus when to start fresh.

Implementation Details: How We Built the Agent System

Model Context Protocol (MCP) for Agent Communication

We implemented the Model Context Protocol (MCP)—an open standard for connecting AI models with development tools. MCP acts as a bridge, allowing AI agents to interact with the browser, inspect test results, understand project structure, and generate code following our architectural conventions.

Agent Orchestration and Workflow

The three agents work in a coordinated workflow:

  1. Planner Agent → Generates Test Plan (Markdown)
  2. Generator Agent → Generates Tests (TypeScript)
  3. Playwright Runner → Executes Tests
  4. Healer Agent (If fails) → Debugs, Fixes Code, Re-runs

Real-World Impact: What We Achieved

Our adaptive, intelligent testing system resulted in significant gains:

  • 10x faster test planning: Planner agent generates comprehensive test plans in minutes.
  • Automated test generation: Generator agent creates complete test suites.
  • Self-healing tests: Healer agent automatically fixes most UI-related failures.
  • 100% database validation coverage: All CRUD operations verify both UI state and database integrity.
  • Reduced maintenance burden by ~70% compared to traditional scripting.

Conclusion

We built a next-generation E2E testing system by combining Playwright’s robustness with three specialized AI agents. We moved away from treating tests as static artifacts and instead embraced adaptive, intelligent testing.


The journey from manual test scripting to AI-powered autonomous testing has fundamentally transformed how we approach quality assurance at Bizom. Engineers can focus on building great features, confident that AI agents are continuously ensuring quality.

Share this:

Like this:

Like Loading...

Discover more from Algorithmic Jugaad @ Bizom

Subscribe now to keep reading and get access to the full archive.

Continue reading