Skip to main content

The First LAS App โ€” nemo3super

A zero-config private RAG chatbot powered by NVIDIA Nemotron 3 Super 120B. Drop your docs into a folder. Chat with them in your browser. Your data stays local.

๐Ÿ“ฅ Download the latest releaseโ€‹


What It Isโ€‹

nemo3super is the reference implementation for AIoOS's first pillar (Memory). It proves the LAS thesis in the smallest possible form:

  • One binary. One double-click. One private knowledge base.
  • Drop .pdf / .docx / .xlsx / .pptx / .csv / .md / .txt / .html / .py / .json into docs/ โ€” the watcher embeds them in ~2 seconds.
  • Ask anything. Get answers with source citations.
  • Your files never leave your machine. Only embedding vectors round-trip to NVIDIA's free-tier API (or, with DGX Spark, stay 100% local).

What It Provesโ€‹

Claim from AIoOS thesisHow nemo3super demonstrates it
"LAS means AI you own"MIT license. Fork it. Ship your own.
"Data stays local"Raw docs never uploaded. Only query-time embeddings.
"Zero config"Double-click Start.bat. First-run wizard handles venv + API key.
"Production quality at day 1"10 public releases in 24 hours, fully tested.
"Built with AI-native dev"Every commit shipped via Claude Code CLI โ€” zero lines hand-written.

Shipping Statsโ€‹

  • ๐Ÿš€ 10 releases in the first 24 hours (v0.1.0 โ†’ v0.2.1)
  • ๐Ÿ’ป ~700 lines of Python (all generated by Claude Code)
  • ๐Ÿ“ฆ ~20 KB zip download (dependencies install on first run)
  • ๐ŸŒ Cross-platform โ€” Start.bat (Windows) / Start.command (macOS/Linux)
  • ๐Ÿ” Privacy-first โ€” your docs are your docs

Architectureโ€‹

User drops file into docs/
โ”‚
โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ watchdog Observer โ”‚ โ† daemon thread inside app.py
โ”‚ (auto-index in 2s) โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚
โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ NVIDIA Embeddings โ”‚ โ† nv-embedqa-e5-v5
โ”‚ (chunks โ†’ vectors) โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚
โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ ChromaDB (local) โ”‚ โ† persists between runs
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚
โ–ผ (on query)
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Nemotron 3 Super โ”‚ โ† 120B, via build.nvidia.com
โ”‚ (w/ retrieved ctx) โ”‚ (or local NIM on DGX Spark)
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
โ”‚
โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Gradio Web UI โ”‚ โ† WeChat-style bubbles
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Try It in 3 Minutesโ€‹

  1. Install Python 3.10+ from python.org
  2. Get a free NVIDIA API key at build.nvidia.com
  3. Download Nemotron-RAG.zip ยท extract ยท double-click Start.bat

First-run setup takes ~3 minutes. After that it's instant.


Fork Itโ€‹

nemo3super is MIT. If you're building something LAS-native, use it as a starting point:

git clone https://github.com/AIoOS-67/nemo3super
cd nemo3super

Swap the model. Swap the UI. Swap the vector DB. The architecture is yours.


What's Nextโ€‹

More apps are coming โ€” and we want yours. If you ship a LAS app, let us know: we'll feature it here.

Contact us โ†’