Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

TL;DR

Semble is a code search tool designed for AI agents that significantly reduces token consumption—by approximately 98%—while maintaining high retrieval accuracy. It offers fast, CPU-based indexing and querying, enabling agents to access code repositories instantly.

Semble, a new code search library tailored for AI agents, claims to reduce token consumption by approximately 98% compared to traditional grep-based methods, while delivering instant, accurate code retrieval on CPU without external dependencies.

Developed to serve the needs of AI agents requiring efficient codebase access, Semble indexes repositories in around 250 milliseconds and answers queries in approximately 1.5 milliseconds, all on CPU. It achieves this by returning only relevant code snippets instead of full files, significantly reducing token usage.

According to its creators, Semble’s retrieval quality is comparable to specialized transformer models, with a NDCG@10 score of 0.854 on benchmarks. It can be integrated as an MCP server or called directly via command line, supporting local repositories or remote git URLs. The tool requires no API keys, GPU, or external services, making it lightweight and easy to deploy.

Why It Matters

This development matters because it addresses a key challenge in AI code understanding: balancing accuracy with token efficiency. By drastically reducing token usage, Semble enables faster, more cost-effective code searches within AI workflows. Its ability to run on CPU and without external dependencies broadens accessibility for developers and organizations seeking efficient code search solutions for AI agents.

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

【Vehicle CEL Doctor】The NT301 obd2 scanner enables you to read DTCs, access to e-missions readiness status, turn off…

As an affiliate, we earn on qualifying purchases.

Background

Traditional code search methods like grep are simple but can be inefficient for large codebases and consume many tokens when integrated into AI workflows. Recent advances have focused on transformer-based models, which offer high accuracy but require significant computational resources and tokens. Semble enters this landscape as a lightweight alternative, emphasizing speed and token efficiency while maintaining comparable accuracy.

Its introduction aligns with ongoing efforts to optimize AI tooling for developer productivity, especially in environments where resource constraints or latency are critical factors.

“Semble reduces token usage by approximately 98% compared to grep+read while providing instant, accurate code snippets.”

— Semble Development Team

“It’s impressive how Semble achieves near transformer-level accuracy with such minimal token consumption, all on CPU.”

— Hacker News User

Amazon

AI code repository search software

As an affiliate, we earn on qualifying purchases.

What Remains Unclear

Details about the long-term scalability, integration with various AI agents beyond initial setup, and real-world benchmarking across diverse codebases remain to be seen. Additionally, the exact impact on large-scale enterprise environments is still unconfirmed.

Amazon

command line code search tools

As an affiliate, we earn on qualifying purchases.

What’s Next

Further testing and adoption by AI developer communities are expected. Future updates may include broader integration options, enhanced benchmarks, and performance improvements based on user feedback. Watching for official documentation updates and case studies will be key to assessing its practical impact.

Amazon

lightweight code search library

As an affiliate, we earn on qualifying purchases.

Key Questions

How does Semble compare to traditional grep?

Semble offers similar or better accuracy in code retrieval but with approximately 98% fewer tokens and faster response times, making it more suitable for AI workflows.

Can Semble be integrated with existing AI agents?

Yes, Semble supports integration as an MCP server or via command line, compatible with agents like Claude Code, Codex, Cursor, and OpenCode, among others.

Is Semble suitable for large enterprise codebases?

While promising, its performance in large-scale, complex environments requires further validation. Its fast indexing and retrieval suggest it could scale well, but real-world testing is ongoing.

Does Semble require external services or GPUs?

No. Semble runs entirely on CPU without the need for API keys, GPUs, or external services, simplifying deployment and reducing costs.

What is the main advantage of Semble over transformer-based models?

Its primary advantage is token efficiency and speed, enabling instant code snippets with minimal resource consumption, while maintaining high retrieval accuracy.

Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

Up next

The Questions That Unlock Better Self-Discovery Fast

Author

CipherDot Team

Share article

Why It Matters

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light