Research — Aquin Labs

NewEarly access is open. Apply for sessions, watch, simulate, and inspect, synced with the web.Early access is open. Apply now.

AquinLabs

Research Docs Community Get early access

Research

Experiments & Studies

Jul 2026

Interpreting Sarvam30B

experiment study

What 62 CLI probes against a 30B-parameter, HF-native, Mixture-of-Experts model tell us about how Aquin works with Sarvam30B end to end.

Jun 2026

Activation Steering for Bias Reduction

experiment study

SAE-localized activation steering on Llama-3.2-1B-Instruct: BBQ evaluation across Nationality, Religion, and Race/Ethnicity, multi-agent conversational stability, and boundary conditions for mechanistic debiasing.

May 2026

vivly.in×

Aquin

Structuring Social Data for AI

How Vivly used Reddit, X, and Hacker News discussions around Meta Ray-Ban glasses to build a structured JSONL training dataset, processed through a multi-stage pipeline and ingested into Aquin end-to-end.

Apr 2026

Experimental Weight Editor

experiment study

Agentic ROME on Pythia 2.8B: causal trace layer location, rank-one MLP updates, and a three-check validation loop that rolls back and retries on failure. Includes case studies on factuality, bias correction, and censor auditing.

Applied Research

Jun 2026

Training Sparse Autoencoders

applied

When inspect and steer need a dictionary on the model you have loaded: capture-activations, sae train, load sae --user, sae diff, and sae align as one orchestrator-backed toolchain.

Jun 2026

Simulating Training

applied

Analytical training forecast before GPU spend: dataset quality, LiSSA influence, NTK weight delta, and SAE gradient decomposition. Distinct from aquin watch (live metrics ingest). Includes embedding contrastive simulation via pairs-generate.

May 2026

Embedding Models

applied

Geometry inspection (layer-drift, isotropy, matrix, space), retrieval evals, SAE tools (sae-browser, sae-faithfulness, space-decomp), contrastive simulation, and aquin watch on encoder fine-tunes.

May 2026

Transformers & LLMs

applied

How Aquin supports dense transformer LLMs, Mixture-of-Experts models, and hybrid architectures, from Llama and Mistral to Mixtral, DeepSeek, and Grok. Covers architecture-aware inspection, attribution, training monitoring, and evaluation across the full transformer family.

Apr 2026

Security

applied

Red-team, audit, find-feature deception probes, jailbreak taxonomy, weight trojan detection, and robustness tracking across training runs via aquin watch.

Apr 2026

Training

applied

Live training monitor via aquin watch ingest: signal detection on loss/grad streams, post-run weight-diff, residual-drift, and SAE feature diff on real checkpoints. Not the same as aquin simulate.

Apr 2026

Attribution

applied

Causal mediation, SAE features, circuit graph, logit lens, steering, sae-stats, feature-logits, find-feature deception ranking, and UMAP — one pipeline on the loaded model.

Apr 2026

Evals

applied

Consistency, suppression, boundary, confidence-analysis, audit, and red-team evals — behavioral probes without a trained SAE. Custom evals on LLMs and embedding models.

Apr 2026

Benchmarks

applied

InterpScore, FeaturePurityScore, MUI, and sae-stats for dictionary health, plus the in-session Benchmark Builder (aquin benchmark). Dense LLM, MoE, and embedding models.

Work with us

Interpretability tooling, custom SAE databases, mechanistic audits, circuit reports, and hands-on research, experiments, and studies for teams of all sizes. Reach us at aquin@aquin.app

Book a call

Not sure if Aquin is right for you?

Join the Aquin Research Community

LLM researchers & ML engineers — open research, fellowships, hackathons, and early beta access.

Join Discord

Follow our research and open source work

GitHub

HuggingFace Research

Terms Privacy License Acceptable Use Security Research Community aquin@aquin.app

Aquin