Reasonable

A high-performance, reasoning-centered AI agent service built on the modern LlamaIndex Workflows architecture.

#React, #LlamaIndex, #FastAPI, #Python, #Ollama, #Workflows, #SSE

AI & Machine Learning

April 2026

Goal

Build a transparent, reasoning-first AI agent that guides users through complex problems using a Socratic approach rather than just providing flat answers.

My Role

Lead AI Architect & Full Stack Developer

Timeline & Tools

April 2024 – Present

LlamaIndex Workflows, FastAPI, Ollama, SSE

01. Overview

Reasonable is a high-performance AI agent service designed to prioritize transparent reasoning over simple task completion. Built on the modern LlamaIndex Workflows architecture, it moves away from traditional "black-box" LLM wrappers to provide a fully observable, event-driven agentic experience.

Instead of a standard chatbot, Reasonable acts as a Socratic Tutor, using diagnostic tools to analyze problem structures and guiding users through identification, strategy, and execution.

02. The "Reasoning-First" Philosophy

Unlike traditional agents that "get an answer" in one jump, Reasonable is engineered to show its work.

Total Transparency: Every step of the agent's internal logic is streamed to the user.
Scaffolded Interaction: The agent uses tools as "Diagnostic Engines" to gather facts, which it then uses to challenge and guide the user.
Educational Intent: It focuses on how to arrive at a conclusion, making it a powerful tool for learning and complex problem-solving.

03. Core Architecture: Workflows-Native

Reasonable leverages the latest LlamaIndex event-driven architecture (v0.14+). This allows for robust, multi-step agentic behavior (like the ReAct loop) that is far more flexible and resilient than legacy sequential agents.

Event-Driven Logic

The system uses a specialized Event Schema to communicate between the backend and frontend:

Pulse: Real-time heartbeat of the engine.
Reasoning: Dedicated channel for "Thinking" tokens.
Call/Result: Transparent tool interaction logs.
Message: The final synthesized response.

04. Chain-of-Thought (CoT) Extraction

One of Reasonable's standout features is its ability to handle and visualize internal reasoning.

It is specifically engineered to handle "Thinking" tokens from models like Qwen 3.5. It identifies and streams internal reasoning processes separately from final answers, providing total transparency into how the AI arrived at a conclusion.

05. The "Socratic" Philosophy

Following its latest architectural pivot, Reasonable has transitioned from a simple "solver" into a Socratic Tutor.

The Heuristic Ladder: Instead of blurting out answers, the agent uses diagnostic tools to analyze the "shape" of a problem.
Fact-Based Challenging: It treats tools (like the Math Diagnostic Engine) as sources of objective truth used to guide the user's own reasoning process.

06. Technical Highlights

Local-First Intelligence

Reasonable is locally-optimized, currently leveraging models like Qwen 3.5 (9B) via Ollama. This allows for heavy reasoning tasks without relying on external cloud APIs for the "brain" of the operation.

Real-Time Streaming (SSE)

The backend is built with FastAPI, providing full CORS support and Server-Sent Events (SSE). This protocol allows frontends to render complex agentic thoughts and tool interactions in real-time as they happen.

07. Tech Stack

Frontend: React, Vite, Tailwind CSS, Framer Motion, Radix UI
Backend: Python, FastAPI, LlamaIndex Workflows
AI Engine: Ollama (Qwen 3.5), ChromaDB
Protocols: SSE (Server-Sent Events)