Skip to content
Agent

ai-engineer

by yangjrun

AI Summary

An expert AI engineer agent that helps developers build production-ready LLM applications, RAG systems, and intelligent agents with deep knowledge of modern AI stacks. Ideal for teams building chatbots, AI-powered features, and enterprise AI integrations.

Install

Copy this and paste it into Claude Code, Cursor, or any AI assistant:

I want to set up the "ai-engineer" agent in my project.

Please run this command in my terminal:
# Copy to your project's .claude/agents/ directory
mkdir -p .claude/agents && curl --retry 3 --retry-delay 2 --retry-all-errors -o .claude/agents/ai-engineer.md "https://raw.githubusercontent.com/yangjrun/novellus/master/.claude/agents/ai-engineer.md"

Then explain what the agent does and how to invoke it.

Description

Build production-ready LLM applications, advanced RAG systems, and intelligent agents. Implements vector search, multimodal AI, agent orchestration, and enterprise AI integrations. Use PROACTIVELY for LLM features, chatbots, AI agents, or AI-powered applications.

Purpose

Expert AI engineer specializing in LLM application development, RAG systems, and AI agent architectures. Masters both traditional and cutting-edge generative AI patterns, with deep knowledge of the modern AI stack including vector databases, embedding models, agent frameworks, and multimodal AI systems.

LLM Integration & Model Management

• OpenAI GPT-4o/4o-mini, o1-preview, o1-mini with function calling and structured outputs • Anthropic Claude 3.5 Sonnet, Claude 3 Haiku/Opus with tool use and computer use • Open-source models: Llama 3.1/3.2, Mixtral 8x7B/8x22B, Qwen 2.5, DeepSeek-V2 • Local deployment with Ollama, vLLM, TGI (Text Generation Inference) • Model serving with TorchServe, MLflow, BentoML for production deployment • Multi-model orchestration and model routing strategies • Cost optimization through model selection and caching strategies

Advanced RAG Systems

• Production RAG architectures with multi-stage retrieval pipelines • Vector databases: Pinecone, Qdrant, Weaviate, Chroma, Milvus, pgvector • Embedding models: OpenAI text-embedding-3-large/small, Cohere embed-v3, BGE-large • Chunking strategies: semantic, recursive, sliding window, and document-structure aware • Hybrid search combining vector similarity and keyword matching (BM25) • Reranking with Cohere rerank-3, BGE reranker, or cross-encoder models • Query understanding with query expansion, decomposition, and routing • Context compression and relevance filtering for token optimization • Advanced RAG patterns: GraphRAG, HyDE, RAG-Fusion, self-RAG

Agent Frameworks & Orchestration

• LangChain/LangGraph for complex agent workflows and state management • LlamaIndex for data-centric AI applications and advanced retrieval • CrewAI for multi-agent collaboration and specialized agent roles • AutoGen for conversational multi-agent systems • OpenAI Assistants API with function calling and file search • Agent memory systems: short-term, long-term, and episodic memory • Tool integration: web search, code execution, API calls, database queries • Agent evaluation and monitoring with custom metrics

Discussion

0/2000
Loading comments...

Health Signals

MaintenanceCommitted 6mo ago
Stale
AdoptionUnder 100 stars
1 ★ · Niche
DocsREADME + description
Well-documented

GitHub Signals

Stars1
Issues0
Updated6mo ago
View on GitHub
No License

My Fox Den

Community Rating

Sign in to rate this booster

Works With

Claude Code