The First LAS App โ nemo3super
A zero-config private RAG chatbot powered by NVIDIA Nemotron 3 Super 120B. Drop your docs into a folder. Chat with them in your browser. Your data stays local.
๐ฅ Download the latest releaseโ
What It Isโ
nemo3super is the reference implementation for AIoOS's first pillar (Memory). It proves the LAS thesis in the smallest possible form:
- One binary. One double-click. One private knowledge base.
- Drop
.pdf/.docx/.xlsx/.pptx/.csv/.md/.txt/.html/.py/.jsonintodocs/โ the watcher embeds them in ~2 seconds. - Ask anything. Get answers with source citations.
- Your files never leave your machine. Only embedding vectors round-trip to NVIDIA's free-tier API (or, with DGX Spark, stay 100% local).
What It Provesโ
| Claim from AIoOS thesis | How nemo3super demonstrates it |
|---|---|
| "LAS means AI you own" | MIT license. Fork it. Ship your own. |
| "Data stays local" | Raw docs never uploaded. Only query-time embeddings. |
| "Zero config" | Double-click Start.bat. First-run wizard handles venv + API key. |
| "Production quality at day 1" | 10 public releases in 24 hours, fully tested. |
| "Built with AI-native dev" | Every commit shipped via Claude Code CLI โ zero lines hand-written. |
Shipping Statsโ
- ๐ 10 releases in the first 24 hours (v0.1.0 โ v0.2.1)
- ๐ป ~700 lines of Python (all generated by Claude Code)
- ๐ฆ ~20 KB zip download (dependencies install on first run)
- ๐ Cross-platform โ
Start.bat(Windows) /Start.command(macOS/Linux) - ๐ Privacy-first โ your docs are your docs
Architectureโ
User drops file into docs/
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโ
โ watchdog Observer โ โ daemon thread inside app.py
โ (auto-index in 2s) โ
โโโโโโโโโโโโฌโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโ
โ NVIDIA Embeddings โ โ nv-embedqa-e5-v5
โ (chunks โ vectors) โ
โโโโโโโโโโโโฌโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโ
โ ChromaDB (local) โ โ persists between runs
โโโโโโโโโโโโฌโโโโโโโโโโโโ
โ
โผ (on query)
โโโโโโโโโโโโโโโโโโโโโโโโ
โ Nemotron 3 Super โ โ 120B, via build.nvidia.com
โ (w/ retrieved ctx) โ (or local NIM on DGX Spark)
โโโโโโโโโโโโฌโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโ
โ Gradio Web UI โ โ WeChat-style bubbles
โโโโโโโโโโโโโโโโโโโโโโโโ
Try It in 3 Minutesโ
- Install Python 3.10+ from python.org
- Get a free NVIDIA API key at build.nvidia.com
- Download Nemotron-RAG.zip ยท extract ยท double-click
Start.bat
First-run setup takes ~3 minutes. After that it's instant.
Fork Itโ
nemo3super is MIT. If you're building something LAS-native, use it as a starting point:
git clone https://github.com/AIoOS-67/nemo3super
cd nemo3super
Swap the model. Swap the UI. Swap the vector DB. The architecture is yours.
What's Nextโ
More apps are coming โ and we want yours. If you ship a LAS app, let us know: we'll feature it here.