What I'm tracking
A running watchlist of models, tools, papers, and communities I follow for production AI: RAG, agents, evals, and shipping real products.
Models
- Claude
Long-context reasoning and tool use for agent workflows.
- GPT-4o / o-series
Multimodal APIs and structured outputs for client integrations.
- Gemini
Google stack integrations and large context windows.
Frameworks
- LangChain / LangGraph
Agent graphs, RAG chains, and production orchestration patterns.
- LlamaIndex
Data connectors, indexing, and retrieval pipelines.
Eval & ops
- LangSmith
Tracing, eval datasets, and regression checks for LLM apps.
- Arize Phoenix
Observability and embedding drift for RAG systems.
Inference
Research
- Chip Huyen: AI Engineering
Production ML systems thinking applied to the LLM era.
Community
- Good AI List
Daily curated open-source AI repos.
- Latent Space
Podcast and community for AI engineers and builders.