A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.
-
Updated
Jun 18, 2026
A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.
Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing
SutroYaro — Sutro Group research workspace for energy-efficient AI training. Point any coding agent at the repo and it becomes a research agent. 34 experiments, eval environment, weekly catch-ups, multi-researcher workflow.
LitReview Skill is an installable agent skill for end-to-end literature review generation. It helps agents conduct literature reviews with a well-designed and widely used review framework so the search process is broad, iterative, and less likely to miss relevant articles.
🤖 CodeForge AI: An autonomous multi-agent coding system powered by LangGraph for agentic software development and automated workflows. SOTA custom agentic GraphRag, shared-state memory, auto-model routing for cost optimization, and a range of custom tooling.
Faraday: An Autonomous Web Research Agent (LangGraph/Streamlit). 🕵️♀️ Investigates queries using dynamic tools (Tavily, Google, NewsAPI, etc.), gathers multi-source info, and synthesizes structured reports in a Streamlit UI. Features agentic workflow & source tracking.
Foundation for an open strong-agent platform: controllers, operators, skills, A2A, runtime, and graph execution.
Lightweight Python CLI for the Exa API (Search, Contents, Find Similar, Answer, Research, Context) with JSON-first output, SSE streaming, and model-aware polling. LLM‑agnostic: integrate with OpenAI Agents SDK/Codex CLI or Claude tool use by invoking CLI commands, no MCP server required.
Curated paper-related AI skills and GitHub repositories for idea discovery, literature search, experiments, writing, citations, LaTeX/DOCX, review, and submission.
Faraday: An Autonomous Web Research Agent (LangGraph/Streamlit). 🕵️♀️ Investigates queries using dynamic tools (Tavily, Google, NewsAPI, etc.), gathers multi-source info, and synthesizes structured reports in a Streamlit UI. Features agentic workflow & source tracking.
Six MCP servers that automate the full academic research pipeline — from refining a vague research question to generating a publication-ready report. Each server handles a distinct stage of the workflow: question development, data processing, code generation, script execut
A curated collection of research agents, skill libraries, autonomous research loops, paper-writing pipelines, MCP servers, and benchmarks built around Claude Code, OpenAI Codex CLI, and adjacent coding-agent CLIs for AI/ML research.
Benchmark whether agent skills actually improve research and engineering tasks.
An advanced agentic workflow implementation using LangGraph and LangChain, featuring iterative research, autonomous planning, and persistent state management for high-quality content generation.
Agent skills and workflows for reproducible ML research.
A Karpathy-inspired Python autoresearch agent on FastAPI that autonomously drafts, evaluates, and iterates on markdown research reports through durable Inngest workflows, powered by OpenRouter LLMs.
Evidence ledger and publication gate for AI research agents
this is a tool to use AI agents to help with job applications
Portable AI agent skills and specialist subagents for prompt enhancement, workspace resume, source-grounded research, and release readiness.
Brand system for Crafter Research, a lab for public-interest research systems, corpora, agents, and evidence trails.
Add a description, image, and links to the research-agents topic page so that developers can more easily learn about it.
To associate your repository with the research-agents topic, visit your repo's landing page and select "manage topics."