FRONTIER LLM  //  MODEL CARD

EchoNest

v0.1-alpha Training in progress Early access

A compact instruction-following language model built around a novel hierarchical architecture — designed for coding assistance and general-purpose chat, with an emphasis on efficient inference on commodity hardware.

AT A GLANCE
Parameters102.8M
Context window512 tok
ArchitectureHierarchical
LSTM-attention
Progressepoch 1 / 20
LicenseEarly access
01

Overview

EchoNest is designed for coding assistance and general-purpose chat, with an emphasis on efficient inference on commodity hardware.

Unlike standard transformer stacks, EchoNest organizes computation into a three-tier nested structure that separates local sequential processing from global context integration. This makes the model lightweight and practical to run without dedicated accelerators.

THREE-TIER NESTED ARCHITECTURE
MacroNest · global context integration
MicroNest · mid-range composition
EchoUnit · local sequential processing
02

Model details

Parameters102.8M
ArchitectureHierarchical LSTM-attention (EchoNest)
Context window512 tokens
Vocabulary50,257 tokens (GPT-2 tokenizer)
Embedding768-dim, weight-tied output projection
Training objectiveInstruction tuning — next-token prediction on response tokens only
03

Training data

EchoNest v0.1 is instruction-tuned on a curated mix of general and code-focused datasets.

DATASETEXAMPLESCOVERAGE
Stanford Alpaca51,974General instruction following
Python Code Instructions18,612Python code generation
CodeSearchNet~30,000Python, JS, Go, Java, Ruby, PHP
Total~100,000prompt-response pairs

The model is trained directly on instruction-response pairs from initialization — there is no separate pre-training stage on raw text.

04

Training

OptimizerAdamW (weight decay 0.01)
Learning rate1e-4 peak, linear warmup + cosine decay
Effective batch size16 (batch 2, grad accumulation 8)
Label smoothing0.1
EMA decay0.999
HardwareCPU
05

Evaluation

Training in progress — results will be updated on completion.

Validation loss
Validation perplexity
Training loss
06

Intended use

Python code generation from natural-language descriptions
Instruction-following tasks in English
Lightweight deployment where a small, fast model is preferred over a large general-purpose one
07

Limitations

Context length
512 tokens. The model cannot reason over very long documents or multi-turn conversations exceeding this window.
Language
English only.
Code languages
Primarily Python in v0.1. Multilingual code support is planned for a future release.
Scale
At 102.8M parameters, EchoNest is designed for practical deployment rather than frontier capability. It is not intended to compete with large general-purpose models.
Early stage
v0.1-alpha is a research release. Outputs should be reviewed before use in production.
08

Roadmap

v0.1-alpha102.8M params · 512-token context · Alpaca + Python + multilingual codeTraining
v0.2Evaluation suite, inference optimisation, public weightsPlanned
v1.0Safety review, Agentic Rails integration, API releasePlanned

Build on EchoNest
while it's still early.

Join the early-access program to test the alpha, shape the roadmap, and get first access to the v1.0 API.

Get API access Talk to sales