RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co api & ReactiveAI RxT-Alpha-Micro-Plus-Decoder-SFT github AI Model

Introduction of RxT-Alpha-Micro-Plus-Decoder-SFT

Model Details of RxT-Alpha-Micro-Plus-Decoder-SFT

RxT-Alpha Micro Plus Decoder (SFT)

Reactive Transformer Architecture

Experimental research model made to test our Reactive Transformer architecture and Attention-based Memory System.

RxT-Alpha Micro Plus

Extended architecture to improve results in Memory Reinforcement Learning. Changes vs. base RxT-Alpha Micro:

10 layers instead of 6

20 experts instead of 12

4 active experts instead of 2

symmetric SQA for memory cross-attention (4 key/value heads instead of 2)

7.5k vocab instead of 5k

better tokenizer training

Model has ~22.4M vs ~8.6M in base version.

Reactive Transformer has additional Short-Term Memory layers, connected to model with Memory Cross-Attention, and updated by Memory Encoder and Memory Attention. Short-Term Memory state is kept between interactions/event (single message), not between tokens in sequence - that's key difference between RxNNs and RNNs.

The goal of the architecture is to process only single messages and keep conversation history in Short-Term Memory - we believe, that this is the key requirement for awareness and AGI. Processing all the chat history on every interaction is not natural and that's not how human awareness is working. Then, Reactive Transformer architecture is a first step in transition from language models to awareness models.

This model (decoder) is a generator decoder for Reactive Transformer system and is made for first stage of training - base model pre-training.

During first stage, Memory Cross-Attention layers are frozen and STM is in default initial random state (normal distribution with 0 mean and almost 0 variance), to not disturb basic language modelling training. We are training decoder and encoder separately with shared embeddings. Then, in second stage - Memory Reinforcement Learning, they will be connected into bigger ensemble with additional Memory Norm and Memory Attention layers, and will learn how to keep and update memory.

RxT-Alpha models intentionally use very short sequence length and STM size (256 tokens for Micro), but that isn't their "full" context size - it's only for single message. "Full" context is theoretically infinite, restricted by STM size and memory abilites. That sizes are good for research, final models will handle SOTA contexts.

Decoder is based on Mixture-of-Experts architecture with 20 experts and 4 active ones.

RxT-Alpha Micro Plus Training

Micro models from RxT-Alpha series are first PoC for Reactive Transformer, Attention-Based Memory System and Memory Reinforcement Learning, used mainly to test library and architecture basics, before training bigger models (that are still relatively small, as it's PoC).

Decoder was trained on Autoregressive Language Modelling task with embedding from encoder pre-training , with roneneldan/TinyStories dataset, using 4B total tokens and reached ~75% accuracy .

Supervised Fine-Tuning

RxT-Alpha-Micro Plus models were fine-tuned to generate real-time interactions (sequences) on our improved synthetic dataset, inspired by TinyStories - ReactiveAI/TinyStories-Plus-Interaction-SFT .

Decoder reached the best validation loss and train/validation loss ratio after 7 epochs (~173M processed tokens)

Details

GPU: 1x L4
epochs: 7/20 (early stoppage)
lr: 2e-4 peak, cosine annealing schedule
batch size: 256
processed tokens: ~173M
loss: 0.5626 (validation) / 0.5580 (train)
accuracy: 86.6%

Next Stage: Memory Reinforcement Learning

The model is able to generate meaningful interactions, using grammatically correct sentences, and is ready for the memory training in the next stage. More info soon.

Decoder architecture details:

dim: 128
layers: 10
heads: 8
self-attention: symmetric Sparse Query Attention
- query/key/value groups: 4
memory cross-attention: symmetric Sparse Query Attention
- query/key/value groups: 4
Mixture-of-Experts Feed Forward
- experts: 20
- active experts: 4
- SwiGLU feed forward with 256 dim
RoPE
RMS Norm
vocab: 7.5k (english only)
message length: 256
STM size: 256 * 10 layers
size: ~22.4M (~6.5M Activated)
Library: RxNN
Docs: draft/in progress

Usage

Model requires RxNN framework for training/inference. It's integrated with HuggingFace Hub and libraries.

Inference:

Install RxNN, PyTorch and dependencies: pip install rxnn torch transformers tokenizers
Install Flash Attention (optional, but recommended) - details in RxNN framework docs

import torch
from rxnn.rxt.models import RxTAlphaDecoder
from rxnn.transformers.sampler import Sampler, SampleDecoder
from rxnn.training.tokenizer import load_tokenizer_from_hf_hub

model = RxTAlphaDecoder.from_pretrained('ReactiveAI/RxT-Alpha-Micro-Plus-Decoder-SFT')
tokenizer = load_tokenizer_from_hf_hub('ReactiveAI/RxT-Alpha-Micro-Plus-Decoder')
sampler = Sampler(model, torch.device('cuda' if torch.cuda.is_available() else 'cpu'), end_token_id=3)
sample = SampleDecoder(sampler, tokenizer)

# 0.1 and 0.9 are default values for temperature and top_p
generated = sample('[Q] Tell me a story about a little black dog [A]', temperature=0.1, top_p=0.9, max_seq_len=256)
sample('[Q] Tell me a story about a little black dog [A]', temperature=0.1, top_p=0.9, max_seq_len=256, print_stream=True)

Runs of ReactiveAI RxT-Alpha-Micro-Plus-Decoder-SFT on huggingface.co

Total runs

24-hour runs

3-day runs

7-day runs

-9

30-day runs

More Information About RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co Model

More RxT-Alpha-Micro-Plus-Decoder-SFT license Visit here:

https://choosealicense.com/licenses/apache-2.0

RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co

RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co is an AI model on huggingface.co that provides RxT-Alpha-Micro-Plus-Decoder-SFT's model effect (), which can be used instantly with this ReactiveAI RxT-Alpha-Micro-Plus-Decoder-SFT model. huggingface.co supports a free trial of the RxT-Alpha-Micro-Plus-Decoder-SFT model, and also provides paid use of the RxT-Alpha-Micro-Plus-Decoder-SFT. Support call RxT-Alpha-Micro-Plus-Decoder-SFT model through api, including Node.js, Python, http.

RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co Url

https://huggingface.co/ReactiveAI/RxT-Alpha-Micro-Plus-Decoder-SFT

ReactiveAI RxT-Alpha-Micro-Plus-Decoder-SFT online free

RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co is an online trial and call api platform, which integrates RxT-Alpha-Micro-Plus-Decoder-SFT's modeling effects, including api services, and provides a free online trial of RxT-Alpha-Micro-Plus-Decoder-SFT, you can try RxT-Alpha-Micro-Plus-Decoder-SFT online for free by clicking the link below.

ReactiveAI RxT-Alpha-Micro-Plus-Decoder-SFT online free url in huggingface.co:

https://huggingface.co/ReactiveAI/RxT-Alpha-Micro-Plus-Decoder-SFT

RxT-Alpha-Micro-Plus-Decoder-SFT install

RxT-Alpha-Micro-Plus-Decoder-SFT is an open source model from GitHub that offers a free installation service, and any user can find RxT-Alpha-Micro-Plus-Decoder-SFT on GitHub to install. At the same time, huggingface.co provides the effect of RxT-Alpha-Micro-Plus-Decoder-SFT install, users can directly use RxT-Alpha-Micro-Plus-Decoder-SFT installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

RxT-Alpha-Micro-Plus-Decoder-SFT install url in huggingface.co:

https://huggingface.co/ReactiveAI/RxT-Alpha-Micro-Plus-Decoder-SFT

huggingface.co

ReactiveAI/RxT-Beta-Mini-Decoder-Base

Total runs: 423

Run Growth: 0

Growth Rate: 0.00%

Updated:February 20 2026

huggingface.co

ReactiveAI/RxT-Beta-Micro-Encoder-Base

Total runs: 379

Run Growth: 0

Growth Rate: 0.00%

Updated:February 16 2026

huggingface.co

ReactiveAI/RxT-Beta-Micro-Decoder-Base

Total runs: 377

Run Growth: 0

Growth Rate: 0.00%

Updated:February 16 2026

huggingface.co

ReactiveAI/RxT-Beta-Micro-MLM-Base

Total runs: 349

Run Growth: 0

Growth Rate: 0.00%

Updated:February 16 2026

huggingface.co

ReactiveAI/RxT-Beta-Mini-Encoder-Base

Total runs: 252

Run Growth: 0

Growth Rate: 0.00%

Updated:February 20 2026

huggingface.co

ReactiveAI/RxT-Beta-Mini-MLM-Base

Total runs: 248

Run Growth: 0

Growth Rate: 0.00%

Updated:February 20 2026

huggingface.co

ReactiveAI/RxT-Beta-Decoder-Base

Total runs: 116

Run Growth: 37

Growth Rate: 31.90%

Updated:February 19 2026

huggingface.co

ReactiveAI/RxT-Beta-Encoder-Base

Total runs: 115

Run Growth: 73

Growth Rate: 63.48%

Updated:February 19 2026

huggingface.co

ReactiveAI/RxT-Beta-MLM-Base

Total runs: 106

Run Growth: 42

Growth Rate: 39.62%

Updated:February 19 2026

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Encoder-SMAT

Total runs: 99

Run Growth: 99

Growth Rate: 100.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MemAttn-MRL

Total runs: 65

Run Growth: 54

Growth Rate: 83.08%

Updated:September 24 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Decoder-SMAT

Total runs: 61

Run Growth: 61

Growth Rate: 100.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MemAttn-SMAT

Total runs: 60

Run Growth: 60

Growth Rate: 100.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Encoder-MRL

Total runs: 56

Run Growth: 42

Growth Rate: 75.00%

Updated:September 24 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Decoder-MRL

Total runs: 55

Run Growth: 46

Growth Rate: 83.64%

Updated:September 24 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Critic-MRL

Total runs: 53

Run Growth: 44

Growth Rate: 83.02%

Updated:September 24 2025

huggingface.co

ReactiveAI/RxT-Beta-Micro-Supervised

Total runs: 37

Run Growth: 37

Growth Rate: 100.00%

Updated:November 19 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Encoder-SFT

Total runs: 36

Run Growth: 29

Growth Rate: 80.56%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Encoder

Total runs: 33

Run Growth: 15

Growth Rate: 45.45%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Decoder-SFT

Total runs: 30

Run Growth: 23

Growth Rate: 76.67%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MLM-SFT

Total runs: 30

Run Growth: 26

Growth Rate: 86.67%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Beta-Decoder-iSFT

Total runs: 29

Run Growth: -349

Growth Rate: -1203.45%

Updated:March 04 2026

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MLM-SFT

Total runs: 28

Run Growth: 0

Growth Rate: 0.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Decoder

Total runs: 22

Run Growth: 13

Growth Rate: 59.09%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MLM

Total runs: 21

Run Growth: 6

Growth Rate: 28.57%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Synthetic-Encoder-MRL

Total runs: 20

Run Growth: 12

Growth Rate: 60.00%

Updated:September 23 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MemAttn-Self-Interlayer

Total runs: 17

Run Growth: 17

Growth Rate: 100.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Synthetic-Decoder-MRL

Total runs: 13

Run Growth: 6

Growth Rate: 46.15%

Updated:September 23 2025

huggingface.co

ReactiveAI/RxT-Alpha-Synthetic-Critic-MRL

Total runs: 12

Run Growth: 5

Growth Rate: 41.67%

Updated:September 23 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Supervised

Total runs: 12

Run Growth: 7

Growth Rate: 58.33%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Encoder-I-SMAT

Total runs: 9

Run Growth: -1

Growth Rate: -11.11%

Updated:August 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Synthetic-Memory-Attention-MRL

Total runs: 9

Run Growth: 2

Growth Rate: 22.22%

Updated:September 23 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Encoder-SMAT

Total runs: 8

Run Growth: -48

Growth Rate: -600.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-Self-Interlayer

Total runs: 8

Run Growth: -1

Growth Rate: -12.50%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Decoder-I-SMAT

Total runs: 8

Run Growth: -2

Growth Rate: -25.00%

Updated:August 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Encoder-SI-SMAT

Total runs: 7

Run Growth: -25

Growth Rate: -357.14%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-MRL

Total runs: 5

Run Growth: 0

Growth Rate: 0.00%

Updated:September 25 2025

huggingface.co

ReactiveAI/RxT-Beta-Encoder-iSFT

Total runs: 5

Run Growth: -439

Growth Rate: -8780.00%

Updated:March 04 2026

huggingface.co

ReactiveAI/RxT-Beta-MLM-iSFT

Total runs: 5

Run Growth: -367

Growth Rate: -7340.00%

Updated:March 04 2026

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-I-SMAT

Total runs: 4

Run Growth: -5

Growth Rate: -125.00%

Updated:August 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-Interlayer

Total runs: 4

Run Growth: -3

Growth Rate: -75.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Decoder-SI-SMAT

Total runs: 4

Run Growth: -12

Growth Rate: -300.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-SI-SMAT

Total runs: 4

Run Growth: -11

Growth Rate: -275.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Decoder-SMAT

Total runs: 3

Run Growth: -36

Growth Rate: -1200.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/MQA-Ref-Micro

Total runs: 3

Run Growth: 3

Growth Rate: 100.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-SMAT

Total runs: 3

Run Growth: -36

Growth Rate: -1200.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-Self

Total runs: 3

Run Growth: -17

Growth Rate: -566.67%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-Simple

Total runs: 3

Run Growth: -7

Growth Rate: -233.33%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Decoder

Total runs: 2

Run Growth: -4

Growth Rate: -200.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-Decoder

Total runs: 2

Run Growth: 0

Growth Rate: 0.00%

Updated:June 05 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-Self-Interlayer

Total runs: 2

Run Growth: -3

Growth Rate: -150.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Encoder

Total runs: 2

Run Growth: 0

Growth Rate: 0.00%

Updated:July 19 2025

huggingface.co

ReactiveAI/GQA-Ref-Micro

Total runs: 2

Run Growth: 2

Growth Rate: 100.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Encoder

Total runs: 2

Run Growth: -5

Growth Rate: -250.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MLM

Total runs: 2

Run Growth: -3

Growth Rate: -150.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-SI-Supervised

Total runs: 2

Run Growth: -2

Growth Rate: -100.00%

Updated:August 09 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus

Total runs: 2

Run Growth: 0

Growth Rate: 0.00%

Updated:August 10 2025

huggingface.co

ReactiveAI/RxT-Alpha-Nano

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 07 2025

huggingface.co

ReactiveAI/RxT-Beta

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:February 16 2026

huggingface.co

ReactiveAI/sSQAT-m

Total runs: 0

Run Growth: -1

Growth Rate: 0.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/SQAT-m

Total runs: 0

Run Growth: -1

Growth Rate: 0.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/SQAT-mm

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-Supervised

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Supervised

Total runs: 0

Run Growth: -2

Growth Rate: 0.00%

Updated:October 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MLM-SFT

Total runs: 0

Run Growth: -6

Growth Rate: 0.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-Encoder

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:June 05 2025

huggingface.co

ReactiveAI/sSQAT-mm

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-MLM

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:June 05 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Decoder-Plus

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 29 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-I-Supervised

Total runs: 0

Run Growth: -3

Growth Rate: 0.00%

Updated:August 09 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-Interlayer

Total runs: 0

Run Growth: -43

Growth Rate: 0.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/xSQAT-mm

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Beta-Micro-Supervised-AI

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:November 04 2025

huggingface.co

ReactiveAI/xSMQAT-m

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Decoder-SFT

Total runs: 0

Run Growth: -20

Growth Rate: 0.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/xSQAT-m

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Decoder

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 19 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Encoder-SFT

Total runs: 0

Run Growth: -42

Growth Rate: 0.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Supervised

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MLM

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 19 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Encoder-SFT

Total runs: 0

Run Growth: -10

Growth Rate: 0.00%

Updated:August 07 2025

ReactiveAI / RxT-Alpha-Micro-Plus-Decoder-SFT

Introduction of RxT-Alpha-Micro-Plus-Decoder-SFT

Model Details of RxT-Alpha-Micro-Plus-Decoder-SFT

RxT-Alpha Micro Plus Decoder (SFT)

Reactive Transformer Architecture

RxT-Alpha Micro Plus

RxT-Alpha Micro Plus Training

Supervised Fine-Tuning

Details

Next Stage: Memory Reinforcement Learning

Decoder architecture details:

Usage

Inference:

Runs of ReactiveAI RxT-Alpha-Micro-Plus-Decoder-SFT on huggingface.co

More Information About RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co Model

More RxT-Alpha-Micro-Plus-Decoder-SFT license Visit here:

RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co

RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co Url

ReactiveAI RxT-Alpha-Micro-Plus-Decoder-SFT online free

ReactiveAI RxT-Alpha-Micro-Plus-Decoder-SFT online free url in huggingface.co:

RxT-Alpha-Micro-Plus-Decoder-SFT install

RxT-Alpha-Micro-Plus-Decoder-SFT install url in huggingface.co:

Url of RxT-Alpha-Micro-Plus-Decoder-SFT

RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co Url

Provider of RxT-Alpha-Micro-Plus-Decoder-SFT huggingface.co

Other API from ReactiveAI