RxT-Beta-Micro-Supervised huggingface.co api & ReactiveAI RxT-Beta-Micro-Supervised github AI Model

Introduction of RxT-Beta-Micro-Supervised

Model Details of RxT-Beta-Micro-Supervised

RxT-Beta-Micro-Supervised 270M

World's first experimental real-time Reactive Language Model (RxLM) trained on limited real-world data (after synthetic RxT-Alpha generation). It's based on revolutionary Reactive Transformer architecture - processing only single interactions/messages, with all the context moved to Short-Term Memory , managed by Attention-Based Memory System .

Docs in progress

Model Details

Model Description

First Reactive Language Model (RxLM) trained on limited real-world datasets, based on Reactive Transformer (RxT) architecture

RxLMs have linear computational/inference cost scaling ( O(NT) ) compared to LLMs quadratic growth ( O(N²T) ), where N is the number of messages in conversation and T is the number of tokens in single interaction. Thanks to that scaling, they are just N times faster and cheaper than LLMs .

That's not all from the advantages - event-driven real-time processing with memory is a lot more natural and human-like, than LLMs data-driven approach (processing full conversation history everytime). It's a crucial milestone in development of AGI and awareness models.

This is Supervised version of the model with "weak" memory system - result of Supervised Memory System Training (SMST). It's able to remember information between interactions (without passing it explicitly in prompt/chat template), but it has to be refined in next Memory Reinforcement Learning (MRL) stage for full functionality.

After successful experiments with simple synthetic datasets, we moved to real-world data, but this model still had limited amount of english-only data for pre-training - only 10B tokens from Wikipedia and FineWeb-Edu (+2B tokens in later stages). Then it could have limited general knowledge and should be fine-tuned for some specialization - for example, we trained RxT-Beta-Micro-Supervised-AI on AI/Data Science knowledge based chats.

Reactive Transformer Architecture

Experimental research model made to test our Reactive Transformer architecture and Attention-based Memory System.

Reactive Transformer has additional Short-Term Memory layers, connected to model with Memory Cross-Attention, and updated by Memory Encoder and Memory Attention. Short-Term Memory state is kept between interactions/event (single message), not between tokens in sequence - that's key difference between RxNNs and RNNs.

The goal of the architecture is to process only single messages and keep conversation history in Short-Term Memory - we believe, that this is the key requirement for awareness and AGI. Processing all the chat history on every interaction is not natural and that's not how human awareness is working. Then, Reactive Transformer architecture is a first step in transition from language models to awareness models.

To balance number of the parameters, decoder is based on Mixture-of-Experts architecture, while the encoder is using regular dense feed forward layers. This model is using gated self/interlayer version of memory attention network with sigmoid residual gates.

Architecture details:

dim: 256
layers: 14
heads (for split): 16
Decoder:
- self-attention: Sparse Query Attention
  - query heads: 8/16
  - key/value heads: 4/16
- memory cross-attention: Sparse Query Attention
  - query heads: 8/16
  - key/value heads: 4/16
- Mixture-of-Experts Feed Forward
  - experts: 42
  - active experts: 4
  - SwiGLU feed forward with 512 dim
- size: ~251M (~41M Activated)
Encoder:
- self-attention: symmetric Sparse Query Attention
  - query/key/value heads: 8/16
- SwiGLU feed forward with 768 dim
- size: ~18.3M
Memory Attention:
- variant: Gated Self/Interlayer Memory Attention
- attention layers: symmetric Sparse Query Attention
  - query/key/value heads: 8/16
- residual gate: elementwise with sigmoid activation (per STM slot)
- size: ~3.73M
RoPE for self-attention, memory cross-attention (query only) and memory attention (key only)
RMS Norm for all normalization layers
vocab: 32k (english only)
interaction (query + answer) length: 1024 tokens
STM size: 14 layers * 1024 slots (* 256 dim)
context/messages: Infinite
size: ~270M
Library: RxLM

Developed by: Adam Filipek & Reactive AI
Funded by: Reactive AI
Model type: Reactive Language Model (RxLM)
Language(s) (NLP): English
License: Reactive AI Model & Architecture License (RAML) v1.0

Model Sources

Repository: RxLM Framework
Paper: Reactive Transformer (RxT) - Stateful Real-Time Processing for Event-Driven Reactive Language Models
Demo: In progress

Uses

This model is still experimental and it was pre-trained on limited corpus with only 10B tokens, so it's general knowledge is also limited. It's recommended to further fine-tune the model for some specialization, like our RxT-Beta-Micro-Supervised-AI , that's trained on AI/Data Science based conversations.

Supervised RxT models are partially functional intermediate stage models - it's recommended to refine them in Memory Reinforcement Learning (MRL) and Reactive Reinforcement Learning from Human Feedback (RxRLHF) to reach final stage.

Direct Use

It's not recommended to use this model directly without additional specialization training or reinforcement learning stages.

Reactive Transformer models are made for conversational tasks, especially chatbots or as a stateful base for agentic systems.

Downstream Use

It's recommended to further fine-tune the model for some specialization, because of limited pre-training data. For the example, we trained RxT-Beta-Micro-Supervised-AI

Out-of-Scope Use

Reactive Transformer models are natively conversational and made for multi-step tasks. They aren't typical Gen AI and aren't made for single-step generative tasks (like summarization, dataset generation, etc.) - they will work in those scenarios, but it will be waste of computational resources (initializing/processing memory, when it's not needed). For that case it's better to use stateless LLM.

Bias, Risks, and Limitations

The model is still experimental, made to test Reactive Transformer architecture on real-world data, after succesful experiments with simple synthetic data. It was pre-trained on 10B tokens only (and additional 2B in next stages), so it's general knowledge is limited and responses could be inaccurate.

Conversation context is theoretically infinite (1024 tokens limit is only for single interaction), but after some number of messages model will slowly forget outdated information - that's why it's called Short-Term Memory . It will be extended in upcoming generations with Long-Term Memory for true infinite context.

Recommendations

As mentioned before, supervised models are in intermediate stage and it's recommended to continue the training in reinforcement learning stages. It's also recommended to fine-tune this base model for some specialization.

How to Get Started with the Model

Model could be loaded and used with our RxLM framework ( https://github.com/RxAI-dev/RxLM ):

import torch
from rxlm.rxt.models import RxTBeta
from rxlm.training.tokenizer import load_tokenizer_from_hf_hub

tokenizer = load_tokenizer_from_hf_hub('ReactiveAI/RxT-Beta-Micro')

model = RxTBeta.from_pretrained('RxT-Beta-Micro-Supervised', tokenizer=tokenizer)
model.share_components() # currently required to connect embeddings/STM

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
model.to(device)

seq_len = 1024

# Memory init - could be used as "system prompt" in LLMs (not recommended in this model, as it wasn't trained with system prompts)
stm_init_state = model.tokenize_full_interaction('System prompt like', 'Initial memory for the model', max_seq_len=seq_len, device=device)
model.init_stm_state(**stm_init_state)

# Helper function
def interaction(query: str):
  tokenized_query = model.tokenize_query(query, max_seq_len=seq_len, device=device)
  for token_id in model.interact(**tokenized_query, max_seq_len=seq_len, temperature=1.0):
    if token_id == -1: print('\n', '[Start memory update...]')
    elif token_id == -2: print('[Memory updated]')
    else:
      txt_token = model.stringify_token(token_id)
      print(txt_token, end='')

# Process first interaction
interaction('Hello! Who are you?')    
# Process follow-up interaction      
interaction('Follow-up question?')

Training Details

Stateful & real-time nature of Reactive Transformer architecture, especially asynchronous memory update, requires advanced training pipeline with multiple supervised and reinforcement learning stages:

Supervised:
- Joint Language Models Pre-Training | raw large text corpora
- Interaction Supervised Fine-Tuning | single, not connected interactions (query + answer)
- Self-Supervised Memory Attention Pre-Training | multi-step conversations (SMAT datasets)
- Supervised Memory-Aware Training (SMAT) | multi-step conversations
Reinforcement:
- Memory Reinforcement Learning (MRL) | multi-step conversations
- Reactive Reinforcement Learning from Human Feedback (RxRLHF) | multi-step conversations

Training Data

We used public open-source datasets for pre-training and our custom datasets (converted from public datasets) for other stages:

Joint Language Models Pre-Training
- 'sample-10BT' subset from HuggingFaceFW/fineweb-edu
- '20231101.en' subset from wikimedia/wikipedia
Interaction SFT
- ReactiveAI/smol-smoltalk-Interaction-SFT
- ReactiveAI/cosmopedia-100k-Interaction-SFT
Self-Supervised Memory Attention Pre-Training
- 30% of ReactiveAI/Real-Chat-SMAT
Supervised Memory-Aware Training (SMAT)
- ReactiveAI/Real-Chat-SMAT
- ReactiveAI/Real-Chat-No-System-SMAT

Training Procedure

Supervised Memory System Training includes 4 steps, before proceeding to Reinforcement Learning stages.

Joint Language Models Pre-Training

Decoder was trained with Encoder and additional MLM head model, using Joint LM Training (with MLM and Autoregressive loss), using HuggingFaceFW/fineweb-edu and wikimedia/wikipedia datasets. Both encoder and decoder are using shared embedding layer

Supervised Fine-Tuning

RxT-Beta Micro model was fine-tuned to real-time interactions (sequences) format on our datasets, derived from HuggingFace ones:

ReactiveAI/smol-smoltalk-Interaction-SFT
ReactiveAI/cosmopedia-100k-Interaction-SFT .

Models were fine-tuned using Joint LM Training mode (for memory cross-attention pre-training):

encode data with encoder and calculate MLM loss for it
save encoder layer's results as Short-Term Memory (available for decoder by memory cross-attention)
process data with decoder and calculate autoregressive loss

That training results in decoder with ~95% accuracy, because it has access to all next tokens information with memory cross-attention. In next training stages it will access previous interactions data with those layers.

Self-Supervised Memory Attention Pre-Training

Memory Attention was pre-trained to combine accumulated Short-Term Memory states with next interaction data processed by the encoder, using weighted mean (with randomized arbitrary weights) as labels and negative cosine similarity as loss. Label weights depending on inner step:

first step, when STM is in initial random normal state, using 90% of new encoded data
follow-up steps are using 50% - step * 5% of new encoded data
each step could have 0-15% random differences in weights

Additionally, random noise is added to both inputs and labels.

This model was trained on six arbitrary selected steps using single epoch on 30% from ReactiveAI/Real-Chat-SMAT dataset.

Supervised Memory-Aware Training

Finally, with pre-trained/fine-tuned components, in last supervised stage, model is trained to use previous/accumulated STM states as memory cross-attention input, instead of the same sequences as decoder's input:

previous (or first) interaction is processed by encoder and used to update memory
next interaction is processed by decoder, using related information from STM
loss is calculated from decoder's logits and gradients propagate through memory attention to encoder

We used staged memory-aware training with different datasets:

starting from 2 epochs on raw 80k examples (with 7 interactions) - ReactiveAI/Real-Chat-SMAT
then 5 epochs on filtered 27k better quality examples - ReactiveAI/Real-Chat-No-System-SMAT

Preprocessing

Pre-training is done on raw text corpora and it require only tokenization. In next stages, model is processing sequences in simple Interaction format , that's used instead complex chat templates - [Q] User's query... [A] Model's answer . For upcoming reasoning models, it will be extended to [Q] User's query... [T] Reasoning... [A] Model's answer

Training Hyperparameters

Training regime: bf16 mixed precision (AMP autocast)
Optimizer : AdamW
Scheduler : Cosine annealing

Evaluation

Evaluation is in progress - more details soon!

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Environmental Impact

Hardware Type: 4x NVIDIA A100 40GB
Hours used: 150

Model Card Contact

Adam Filipek - [email protected]

Licences - [email protected]

Runs of ReactiveAI RxT-Beta-Micro-Supervised on huggingface.co

Total runs

24-hour runs

3-day runs

7-day runs

30-day runs

More Information About RxT-Beta-Micro-Supervised huggingface.co Model

More RxT-Beta-Micro-Supervised license Visit here:

https://choosealicense.com/licenses/raml-v1.0

RxT-Beta-Micro-Supervised huggingface.co

RxT-Beta-Micro-Supervised huggingface.co is an AI model on huggingface.co that provides RxT-Beta-Micro-Supervised's model effect (), which can be used instantly with this ReactiveAI RxT-Beta-Micro-Supervised model. huggingface.co supports a free trial of the RxT-Beta-Micro-Supervised model, and also provides paid use of the RxT-Beta-Micro-Supervised. Support call RxT-Beta-Micro-Supervised model through api, including Node.js, Python, http.

RxT-Beta-Micro-Supervised huggingface.co Url

https://huggingface.co/ReactiveAI/RxT-Beta-Micro-Supervised

ReactiveAI RxT-Beta-Micro-Supervised online free

RxT-Beta-Micro-Supervised huggingface.co is an online trial and call api platform, which integrates RxT-Beta-Micro-Supervised's modeling effects, including api services, and provides a free online trial of RxT-Beta-Micro-Supervised, you can try RxT-Beta-Micro-Supervised online for free by clicking the link below.

ReactiveAI RxT-Beta-Micro-Supervised online free url in huggingface.co:

https://huggingface.co/ReactiveAI/RxT-Beta-Micro-Supervised

RxT-Beta-Micro-Supervised install

RxT-Beta-Micro-Supervised is an open source model from GitHub that offers a free installation service, and any user can find RxT-Beta-Micro-Supervised on GitHub to install. At the same time, huggingface.co provides the effect of RxT-Beta-Micro-Supervised install, users can directly use RxT-Beta-Micro-Supervised installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

RxT-Beta-Micro-Supervised install url in huggingface.co:

https://huggingface.co/ReactiveAI/RxT-Beta-Micro-Supervised

huggingface.co

ReactiveAI/RxT-Beta-Mini-Decoder-Base

Total runs: 423

Run Growth: 0

Growth Rate: 0.00%

Updated:February 20 2026

huggingface.co

ReactiveAI/RxT-Beta-Micro-Encoder-Base

Total runs: 379

Run Growth: 0

Growth Rate: 0.00%

Updated:February 16 2026

huggingface.co

ReactiveAI/RxT-Beta-Micro-Decoder-Base

Total runs: 377

Run Growth: 0

Growth Rate: 0.00%

Updated:February 16 2026

huggingface.co

ReactiveAI/RxT-Beta-Micro-MLM-Base

Total runs: 349

Run Growth: 0

Growth Rate: 0.00%

Updated:February 16 2026

huggingface.co

ReactiveAI/RxT-Beta-Mini-Encoder-Base

Total runs: 252

Run Growth: 0

Growth Rate: 0.00%

Updated:February 20 2026

huggingface.co

ReactiveAI/RxT-Beta-Mini-MLM-Base

Total runs: 248

Run Growth: 0

Growth Rate: 0.00%

Updated:February 20 2026

huggingface.co

ReactiveAI/RxT-Beta-Decoder-Base

Total runs: 116

Run Growth: 37

Growth Rate: 31.90%

Updated:February 19 2026

huggingface.co

ReactiveAI/RxT-Beta-Encoder-Base

Total runs: 115

Run Growth: 73

Growth Rate: 63.48%

Updated:February 19 2026

huggingface.co

ReactiveAI/RxT-Beta-MLM-Base

Total runs: 106

Run Growth: 42

Growth Rate: 39.62%

Updated:February 19 2026

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Encoder-SMAT

Total runs: 99

Run Growth: 99

Growth Rate: 100.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MemAttn-MRL

Total runs: 65

Run Growth: 54

Growth Rate: 83.08%

Updated:September 24 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Decoder-SMAT

Total runs: 61

Run Growth: 61

Growth Rate: 100.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MemAttn-SMAT

Total runs: 60

Run Growth: 60

Growth Rate: 100.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Encoder-MRL

Total runs: 56

Run Growth: 42

Growth Rate: 75.00%

Updated:September 24 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Decoder-MRL

Total runs: 55

Run Growth: 46

Growth Rate: 83.64%

Updated:September 24 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Critic-MRL

Total runs: 53

Run Growth: 44

Growth Rate: 83.02%

Updated:September 24 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Encoder-SFT

Total runs: 36

Run Growth: 29

Growth Rate: 80.56%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Encoder

Total runs: 33

Run Growth: 15

Growth Rate: 45.45%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Decoder-SFT

Total runs: 30

Run Growth: 23

Growth Rate: 76.67%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MLM-SFT

Total runs: 30

Run Growth: 26

Growth Rate: 86.67%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Beta-Decoder-iSFT

Total runs: 29

Run Growth: -349

Growth Rate: -1203.45%

Updated:March 04 2026

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MLM-SFT

Total runs: 28

Run Growth: 0

Growth Rate: 0.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Decoder

Total runs: 22

Run Growth: 13

Growth Rate: 59.09%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MLM

Total runs: 21

Run Growth: 6

Growth Rate: 28.57%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Synthetic-Encoder-MRL

Total runs: 20

Run Growth: 12

Growth Rate: 60.00%

Updated:September 23 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-MemAttn-Self-Interlayer

Total runs: 17

Run Growth: 17

Growth Rate: 100.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Synthetic-Decoder-MRL

Total runs: 13

Run Growth: 6

Growth Rate: 46.15%

Updated:September 23 2025

huggingface.co

ReactiveAI/RxT-Alpha-Synthetic-Critic-MRL

Total runs: 12

Run Growth: 5

Growth Rate: 41.67%

Updated:September 23 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S-Supervised

Total runs: 12

Run Growth: 7

Growth Rate: 58.33%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Encoder-I-SMAT

Total runs: 9

Run Growth: -1

Growth Rate: -11.11%

Updated:August 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Synthetic-Memory-Attention-MRL

Total runs: 9

Run Growth: 2

Growth Rate: 22.22%

Updated:September 23 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Encoder-SMAT

Total runs: 8

Run Growth: -48

Growth Rate: -600.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-Self-Interlayer

Total runs: 8

Run Growth: -1

Growth Rate: -12.50%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Decoder-I-SMAT

Total runs: 8

Run Growth: -2

Growth Rate: -25.00%

Updated:August 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Encoder-SI-SMAT

Total runs: 7

Run Growth: -25

Growth Rate: -357.14%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-MRL

Total runs: 5

Run Growth: 0

Growth Rate: 0.00%

Updated:September 25 2025

huggingface.co

ReactiveAI/RxT-Beta-MLM-iSFT

Total runs: 5

Run Growth: -367

Growth Rate: -7340.00%

Updated:March 04 2026

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-I-SMAT

Total runs: 4

Run Growth: -5

Growth Rate: -125.00%

Updated:August 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-Interlayer

Total runs: 4

Run Growth: -3

Growth Rate: -75.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Decoder-SI-SMAT

Total runs: 4

Run Growth: -12

Growth Rate: -300.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Beta-Encoder-iSFT

Total runs: 4

Run Growth: -441

Growth Rate: -11025.00%

Updated:March 04 2026

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MemAttn-SI-SMAT

Total runs: 4

Run Growth: -11

Growth Rate: -275.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Decoder-SMAT

Total runs: 3

Run Growth: -36

Growth Rate: -1200.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/MQA-Ref-Micro

Total runs: 3

Run Growth: 3

Growth Rate: 100.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-SMAT

Total runs: 3

Run Growth: -36

Growth Rate: -1200.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-Self

Total runs: 3

Run Growth: -17

Growth Rate: -566.67%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-Simple

Total runs: 3

Run Growth: -7

Growth Rate: -233.33%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Decoder

Total runs: 2

Run Growth: -4

Growth Rate: -200.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-Decoder

Total runs: 2

Run Growth: 0

Growth Rate: 0.00%

Updated:June 05 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-Self-Interlayer

Total runs: 2

Run Growth: -3

Growth Rate: -150.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Encoder

Total runs: 2

Run Growth: 0

Growth Rate: 0.00%

Updated:July 19 2025

huggingface.co

ReactiveAI/GQA-Ref-Micro

Total runs: 2

Run Growth: 2

Growth Rate: 100.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Encoder

Total runs: 2

Run Growth: -5

Growth Rate: -250.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-MLM

Total runs: 2

Run Growth: -3

Growth Rate: -150.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-SI-Supervised

Total runs: 2

Run Growth: -2

Growth Rate: -100.00%

Updated:August 09 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus

Total runs: 2

Run Growth: 0

Growth Rate: 0.00%

Updated:August 10 2025

huggingface.co

ReactiveAI/RxT-Alpha-Nano

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 07 2025

huggingface.co

ReactiveAI/RxT-Beta

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:February 16 2026

huggingface.co

ReactiveAI/sSQAT-m

Total runs: 0

Run Growth: -1

Growth Rate: 0.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/SQAT-m

Total runs: 0

Run Growth: -1

Growth Rate: 0.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/SQAT-mm

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-Supervised

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Supervised

Total runs: 0

Run Growth: -2

Growth Rate: 0.00%

Updated:October 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MLM-SFT

Total runs: 0

Run Growth: -6

Growth Rate: 0.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-Encoder

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:June 05 2025

huggingface.co

ReactiveAI/sSQAT-mm

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 04 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-MLM

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:June 05 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Decoder-Plus

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 29 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-I-Supervised

Total runs: 0

Run Growth: -3

Growth Rate: 0.00%

Updated:August 09 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MemAttn-Interlayer

Total runs: 0

Run Growth: -43

Growth Rate: 0.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/xSQAT-mm

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 08 2025

huggingface.co

ReactiveAI/RxT-Alpha-Mini-S

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:September 02 2025

huggingface.co

ReactiveAI/RxT-Beta-Micro-Supervised-AI

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:November 04 2025

huggingface.co

ReactiveAI/xSMQAT-m

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Decoder-SFT

Total runs: 0

Run Growth: -20

Growth Rate: 0.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/xSQAT-m

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:May 02 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Decoder

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 19 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Encoder-SFT

Total runs: 0

Run Growth: -42

Growth Rate: 0.00%

Updated:July 30 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Supervised

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:October 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-MLM

Total runs: 0

Run Growth: 0

Growth Rate: 0.00%

Updated:July 19 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Encoder-SFT

Total runs: 0

Run Growth: -10

Growth Rate: 0.00%

Updated:August 07 2025

huggingface.co

ReactiveAI/RxT-Alpha-Micro-Plus-Decoder-SFT

Total runs: 0

Run Growth: -9

Growth Rate: 0.00%

Updated:August 07 2025

ReactiveAI / RxT-Beta-Micro-Supervised

Introduction of RxT-Beta-Micro-Supervised

Model Details of RxT-Beta-Micro-Supervised

RxT-Beta-Micro-Supervised 270M

Model Details

Model Description

Reactive Transformer Architecture

Architecture details:

Model Sources

Uses

Direct Use

Downstream Use

Out-of-Scope Use

Bias, Risks, and Limitations

Recommendations

How to Get Started with the Model

Training Details

Training Data

Training Procedure

Joint Language Models Pre-Training

Supervised Fine-Tuning

Self-Supervised Memory Attention Pre-Training

Supervised Memory-Aware Training

Preprocessing

Training Hyperparameters

Evaluation

Testing Data, Factors & Metrics

Testing Data

Factors

Metrics

Results

Summary

Environmental Impact

Model Card Contact

Runs of ReactiveAI RxT-Beta-Micro-Supervised on huggingface.co

More Information About RxT-Beta-Micro-Supervised huggingface.co Model

More RxT-Beta-Micro-Supervised license Visit here:

RxT-Beta-Micro-Supervised huggingface.co

RxT-Beta-Micro-Supervised huggingface.co Url

ReactiveAI RxT-Beta-Micro-Supervised online free

ReactiveAI RxT-Beta-Micro-Supervised online free url in huggingface.co:

RxT-Beta-Micro-Supervised install

RxT-Beta-Micro-Supervised install url in huggingface.co:

Url of RxT-Beta-Micro-Supervised

RxT-Beta-Micro-Supervised huggingface.co Url

Provider of RxT-Beta-Micro-Supervised huggingface.co

Other API from ReactiveAI