Confluent named a Leader in the Forrester Wave: Streaming Data Platforms | Access the report

From 1 to 1 Million: How Agent Taskflow Scales Agentic AI With AWS and Confluent

Written By

In the explosive new landscape of generative AI (GenAI), the difference between a proof of concept and a production-grade system is scale. For artificial intelligence (AI) infrastructure startup Agent Taskflow Inc. (ATF), this wasn't just a future goal; it was a foundational requirement.

Founded in 2023, ATF provides a platform for rapid AI agent bootstrapping, multi-agent orchestration, and comprehensive observability. Its clients in healthcare, finance, and government intelligence don't need just AI agents; they need a serverless, event-driven system that can elastically scale from one agent to one million agents, all while meeting stringent enterprise security and reliability standards.

To validate that ambition, ATF recently benchmarked its Enterprise Edition running on a single Amazon Web Services (AWS) t3.xlarge instance (4 vCPU, 16 GB RAM) with Confluent Cloud. The platform sustained roughly 1,880 agent invocations per second with a 100% success rate and sub‑30ms median latency under sustained load—proof that the architecture can comfortably handle real-world enterprise traffic, even on minimal infrastructure.

The Challenge: Avoiding the "Apache Kafka® Trap"

For a platform designed to orchestrate millions of AI agents in real time, the critical bottleneck was clear: It was the event streaming infrastructure. ATF's founder knew from painful experience that self-managing Apache Kafka® was a massive operational drain.

"Scaling event streaming was the critical bottleneck," the founder noted. "Having personally felt the pain of partition management, consumer tuning, and cluster optimization, I knew this wasn't a problem to defer."

Without a managed, production-grade solution, ATF faced critical business risks:

ATF needed to ship its multi-agent platform within six months. The company couldn't afford to build infrastructure from scratch; it needed a partner.

The Solution: A Joint Win With Confluent and AWS

ATF built its distributed, serverless orchestration platform on a foundation of Confluent and AWS, creating a system that’s as scalable as it is secure.

The architecture is a blueprint for modern, event-driven GenAI:

  • The Streaming Backbone (Confluent): All agent communications—from large language model (LLM) calls and tool invocations to memory operations and chat events—flow through Confluent Cloud as events. This event-driven model is the key to ATF's serverless agents, which consume compute resources only at the exact moment they’re processing a task, eliminating idle costs.

  • The AI Engine (Amazon Bedrock): All LLM inference for models such as Claude, Llama, and DeepSeek is handled by Amazon Bedrock. This provides two massive advantages:

    1. Low Latency: In-region inference keeps network latency minimal, helping ATF achieve its sub-100ms agent response target.

    2. Data Sovereignty: For enterprise customers in sensitive sectors, Bedrock hosts models (such as DeepSeek) within the customer's own AWS Virtual Private Cloud (VPC). This ensures that sensitive data never leaves its secure environment—a critical compliance win.

  • Enterprise-Grade Security and Ops: The solution uses identity and access management (IAM) role-based authentication (no access keys) for Bedrock, meeting Fortune 500 security standards. Confluent's Schema Registry enforces data quality with 27 Avro schemas, preventing data corruption across services.

This combination of Confluent's managed data streaming and AWS's secure, in-region AI inference proved to be the optimal choice, enabling ATF to focus on features, not infrastructure.

The Results: Scale, Speed, and Observability

By building on Confluent and AWS from day one, ATF transformed how quickly it could scale and how confidently it could operate its agent workloads. The recent performance benchmarks put concrete numbers behind that story.

Summary of Agent Taskflow’s performance benchmark results with Confluent Confluent on AWS for peak and sustained throughput, accept latency, and success rate under load for agentic AI workloads.

A Replicable Blueprint for Enterprise AI

ATF shows what’s possible when you standardize on an event-driven backbone with Confluent and run AI workloads on AWS: elastic scale, enterprise-grade reliability, and real-time observability from day one. This architecture provides flexible deployment models to meet any customer need:

  • Public software-as-a-service (SaaS): Leverages Confluent Cloud and AWS Bedrock for a fully managed, multi-tenant public offering

  • Private Enterprise: Deploys Amazon MSK (Amazon Managed Streaming for Apache Kafka) with AWS Bedrock inside the customer's private VPC for maximum security and HIPAA compliance

The joint success of AWS and Confluent is more than just an infrastructure story; it's a business model enabler. ATF’s consumption-based pricing scales directly with customer usage—a perfectly aligned partnership for a scalable future. By solving the hard problem of scale first, ATF is now free to focus on what it does best: building the future of AI orchestration.

If you’re building your own agentic applications on AWS, start with Confluent Cloud on AWS Marketplace to spin up a fully managed Kafka and Apache Flink® foundation in minutes, with usage billed directly through your AWS account. From there, your Confluent and AWS teams can help you evaluate architectures, align on security requirements, and chart the fastest path from pilot to production.


Apache®, Apache Kafka®, Kafka®, Apache Flink®, Flink®, are registered trademarks of the Apache Software Foundation. No endorsement by the Apache Software Foundation is implied by the use of these marks.

  • Saul Sparber is the Founder & CEO of Agent Taskflow, a platform for building real-time, multi-agent AI systems with zero infrastructure overhead. A longtime data engineer and systems thinker, Saul now focuses on making agentic AI usable, observable, and accessible to everyone—from startups to enterprises. linkedin.com/in/sparbz | linkedin.com/company/agent-taskflow | https://atf.ai/

  • Braeden Quirante began his career as a software consultant where he worked on a wide array of technical solutions including web development, cloud architecture, microservices, automation, and data warehousing. Following these experiences, he joined Amazon Web Services as a partner solutions architect working with AWS partners in scaled motions such as go-to-market activities and partner differentiation programs. Braeden currently serves as a partner solutions engineer for Confluent and an AWS evangelist.

  • Michael is a Product Marketing Manager focused on collaborating with partners on joint go-to-market efforts.

Did you like this blog post? Share it now