EchoNest — Frontier LLM Models

Overview

EchoNest is designed for coding assistance and general-purpose chat, with an emphasis on efficient inference on commodity hardware.

Unlike standard transformer stacks, EchoNest organizes computation into a three-tier nested structure that separates local sequential processing from global context integration. This makes the model lightweight and practical to run without dedicated accelerators.

THREE-TIER NESTED ARCHITECTURE

MacroNest · global context integration

MicroNest · mid-range composition

EchoUnit · local sequential processing

Model details

Parameters102.8M

ArchitectureHierarchical LSTM-attention (EchoNest)

Context window512 tokens

Vocabulary50,257 tokens (GPT-2 tokenizer)

Embedding768-dim, weight-tied output projection

Training objectiveInstruction tuning — next-token prediction on response tokens only

Training data

EchoNest v0.1 is instruction-tuned on a curated mix of general and code-focused datasets.

DATASETEXAMPLESCOVERAGE

Stanford Alpaca51,974General instruction following

Python Code Instructions18,612Python code generation

CodeSearchNet~30,000Python, JS, Go, Java, Ruby, PHP

Total~100,000prompt-response pairs

The model is trained directly on instruction-response pairs from initialization — there is no separate pre-training stage on raw text.

Training

OptimizerAdamW (weight decay 0.01)

Learning rate1e-4 peak, linear warmup + cosine decay

Effective batch size16 (batch 2, grad accumulation 8)

Label smoothing0.1

EMA decay0.999

HardwareCPU

Evaluation

Training in progress — results will be updated on completion.

—

Validation loss

—

Validation perplexity

—

Training loss

Intended use

→Python code generation from natural-language descriptions

→Instruction-following tasks in English

→Lightweight deployment where a small, fast model is preferred over a large general-purpose one

Limitations

Context length

512 tokens. The model cannot reason over very long documents or multi-turn conversations exceeding this window.

Language

English only.

Code languages

Primarily Python in v0.1. Multilingual code support is planned for a future release.

Scale

At 102.8M parameters, EchoNest is designed for practical deployment rather than frontier capability. It is not intended to compete with large general-purpose models.

Early stage

v0.1-alpha is a research release. Outputs should be reviewed before use in production.

Roadmap

v0.1-alpha102.8M params · 512-token context · Alpaca + Python + multilingual codeTraining

v0.2Evaluation suite, inference optimisation, public weightsPlanned

v1.0Safety review, Agentic Rails integration, API releasePlanned