Portfolio Projects#
Six end-to-end agent MVPs. Each is a genuinely useful business agent you can fork, extend, and ship this weekend. All run in SWARM_MOCK=true mode with no API key. All have passing tests.
Total: ~78 tests, 6 agents, 0 toy demos.
-
Multi-turn support agent with order lookup, customer lookup, shipping status, FAQ retrieval, and HITL escalation rules (repeat errors, frustration keywords, irreversible actions over threshold).
~$0.01 per conversation· 18 tests · Ch03b + Ch04 + Ch07 -
PR triage: parses the diff, runs 8 security regex checks, detects missing tests, flags high complexity, produces a structured APPROVE / REQUEST_CHANGES / NEEDS_DISCUSSION review.
~$0.05 per PR· 18 tests · Ch03b + Ch05 + Ch07 -
Natural-language-to-SQL-to-chart pipeline. Ask "top customers by spend last quarter"; get a matplotlib chart, a 2-sentence summary, and the SQL that produced it.
~$0.02 per query· 10 tests · Ch03a + Ch03b + Ch05 -
Document research with page-level citations. Searches a 10-doc corpus on software engineering history, takes structured notes, synthesizes with inline
[1]citations, and runs a grounding checker before returning.~$0.10 per research query· 18 tests · Ch03b + Ch04 + Ch05 -
Hybrid keyword + semantic search over 15+ markdown runbook docs. Answer grounding checker verifies every claim maps to a retrieved chunk. Ships with a realistic SaaS-company knowledge base.
~$0.005 per query· 10 tests · Ch04 (+ Vector Memory appendix) -
Decision support via N debater agents + moderator. Six-persona role library (database expert, performance skeptic, cost analyst, dev experience, risk manager, pragmatist). Consensus detection via Jaccard overlap. Evidence table flags which claims have numeric backing.
~$0.15 per decision· 15 tests · Ch04 + Ch06
Running a project#
Clone the repo, install in dev mode, then run any project with a question:
git clone https://github.com/TheAiSingularity/crafting-agentic-swarms.git
cd crafting-agentic-swarms
pip install -e ".[dev]"
# Pick one and run it:
SWARM_MOCK=true python -m projects.customer_support_agent.agent "I want to return order #12345"
SWARM_MOCK=true python -m projects.code_review_bot.agent projects/code_review_bot/sample_diffs/security_issue.diff
SWARM_MOCK=true python -m projects.data_analyst_agent.agent "What are the top 5 customers by total spend?"
SWARM_MOCK=true python -m projects.research_assistant.agent "When did Unix get the pipe operator?"
SWARM_MOCK=true python -m projects.knowledge_base_rag.agent "How do I deploy to staging?"
SWARM_MOCK=true python -m projects.multi_agent_debate.agent "Monolith vs microservices?"
Each project's README.md has the architecture diagram, cost estimate, failure modes, and extension guide for going from MVP to production.
Pick one for your portfolio#
If you finish the course and pick one project to ship, you'll stand out in interviews. Concrete suggestion: pick the project closest to your domain, extend it with two real integrations (real API + real DB), add authentication, put it behind a FastAPI endpoint, deploy to Fly.io or Railway, write a short post-mortem.
That's a portfolio piece. Hiring managers will have opinions about LangChain. They'll respect someone who built the underlying loop.