NewPublic SAE feature database is live.
Research

Public SAE Database

A public database of sparse autoencoder features across layers. Inspect activation patterns, feature geometry, and learned representations as they emerge through the network.

In app settings

Experiment Study

May 2026
vivly.in×
AquinAquin

Structuring Social Data for AI

How Vivly used Reddit, X, and Hacker News discussions around Meta Ray-Ban glasses to build a structured JSONL training dataset, processed through a multi-stage pipeline and ingested into Aquin end-to-end.

Apr 2026

Experimental Weight Editor

experiment study

Agentic ROME on Pythia 2.8B: causal trace layer location, rank-one MLP updates, and a three-check validation loop that rolls back and retries on failure. Includes case studies on factuality, bias correction, and censor auditing.

Applied Research

Jun 2026

Simulating Training

applied

End-to-end simulation of the training loop: forward and backward pass, optimizer step, gradient dynamics, SAE feature evolution across checkpoints, behavioral diffs, confidence calibration, and regression tracking across runs. Everything the loss curve does not show.

May 2026

Embedding Models

applied

Geometry inspection, retrieval evaluation, fine-tuning monitoring, and embedding diff for any sentence-transformers compatible encoder. BGE, E5, GTE, Nomic, Jina, Instructor, MiniLM, and SBERT all get anisotropy scoring, UMAP exploration, layer-wise similarity, OOD detection, and hard-negative gap analysis.

May 2026

Transformers & LLMs

applied

How Aquin supports dense transformer LLMs, Mixture-of-Experts models, and hybrid architectures, from Llama and Mistral to Mixtral, DeepSeek, and Grok. Covers architecture-aware inspection, attribution, training monitoring, and evaluation across the full transformer family.

Apr 2026

Security

applied

Adversarial risk detection across the full ML pipeline: prompt injection and poisoned samples in training data, red teaming and jailbreak taxonomy in model inspection, model robustness scoring, weight trojan detection, and attack surface comparison across model versions in the training monitor.

Apr 2026

Training

applied

Live signal detection across five failure modes, gradient and loss monitoring per step, SAE feature diffs and behavioral model diffs post-training, and an agentic chat that reads from live training state at send time.

Apr 2026

Attribution

applied

Causal mediation analysis, SAE feature extraction, circuit attribution graph, logit lens, feature steering, UMAP exploration, fact verification, bias detection, and censor auditing, all in one pipeline on Llama 3.2 1B Instruct.

Apr 2026

Evals

applied

Consistency, suppression detection, and knowledge boundary probing. Behavioral evals that surface failure modes without requiring a trained SAE, and works on any TransformerLens-compatible model.

Apr 2026

Benchmarks

applied

InterpScore, FeaturePurityScore, and MUI for SAE feature evaluation, plus a conversational Benchmark Builder that works across all supported architectures, dense LLMs, MoE, hybrid, and embedding models. Describe what to measure, get a scored inline card exportable as CSV, JSON, image, or PDF.

Work with us

Interpretability tooling, custom SAE databases, mechanistic audits, circuit reports, and hands-on research, experiments, and studies for teams of all sizes. Reach us at aquin@aquin.app

Book a call

Not sure if Aquin is right for you?

SubstackMedium
© 2026 Aquin. All rights reserved.

Aquin