huggingface.co
Total runs: 0
24-hour runs: 0
7-day runs: 0
30-day runs: 0
Model's Last Updated: July 10 2025

Introduction of MODA

Model Details of MODA

MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding

[📃 Paper ] [📦 Code ] [⚒️ Project ] [📅 Slide ]

TL;DR

We i) identify attention deficit disorder as a critical barrier hindering fine-grained content understanding in MLLMs; ii) introduce a modular duplex attention mechanism to mitigate modality bias and enhance attention score justification; and iii) develop MODA-based MLLMs that enable fine-grained multimodal understanding across perception, cognition, and emotion tasks.

Abstract

Multimodal large language models (MLLMs) recently showed strong capacity in integrating data among multiple modalities, empowered by a generalizable attention architecture. Advanced methods predominantly focus on language-centric tuning while less exploring multimodal tokens mixed through attention, posing challenges in high-level tasks that require fine-grained cognition and emotion understanding. In this work, we identify the attention deficit disorder problem in multimodal learning, caused by inconsistent cross-modal attention and layer-by-layer decayed attention activation. To address this, we propose a novel attention mechanism, termed MOdular Duplex Attention (MODA), simultaneously conducting the inner-modal refinement and inter-modal interaction. MODA employs a correct-after-align strategy to effectively decouple modality alignment from cross-layer token mixing. In the alignment phase, tokens are mapped to duplex modality spaces based on the basis vectors, enabling the interaction between visual and language modality. Further, the correctness of attention scores is ensured through adaptive masked attention, which enhances the model's flexibility by allowing customizable masking patterns for different modalities. Extensive experiments on 21 benchmark datasets verify the effectiveness of MODA in perception, cognition, and emotion tasks.

License

This project is released under the Apache-2.0 License . Parts of this project contain code and models from other sources, which are subject to their respective licenses.

Runs of KwaiVGI MODA on huggingface.co

0
Total runs
0
24-hour runs
0
3-day runs
0
7-day runs
0
30-day runs

More Information About MODA huggingface.co Model

MODA huggingface.co

MODA huggingface.co is an AI model on huggingface.co that provides MODA's model effect (), which can be used instantly with this KwaiVGI MODA model. huggingface.co supports a free trial of the MODA model, and also provides paid use of the MODA. Support call MODA model through api, including Node.js, Python, http.

KwaiVGI MODA online free

MODA huggingface.co is an online trial and call api platform, which integrates MODA's modeling effects, including api services, and provides a free online trial of MODA, you can try MODA online for free by clicking the link below.

KwaiVGI MODA online free url in huggingface.co:

https://huggingface.co/KwaiVGI/MODA

MODA install

MODA is an open source model from GitHub that offers a free installation service, and any user can find MODA on GitHub to install. At the same time, huggingface.co provides the effect of MODA install, users can directly use MODA installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

MODA install url in huggingface.co:

https://huggingface.co/KwaiVGI/MODA

Url of MODA

Provider of MODA huggingface.co

KwaiVGI
ORGANIZATIONS

Other API from KwaiVGI

huggingface.co

Total runs: 1.6K
Run Growth: -9.7K
Growth Rate: -601.73%
Updated:March 04 2025
huggingface.co

Total runs: 0
Run Growth: 0
Growth Rate: 0.00%
Updated:February 11 2025
huggingface.co

Total runs: 0
Run Growth: 0
Growth Rate: 0.00%
Updated:June 02 2025
huggingface.co

Total runs: 0
Run Growth: 0
Growth Rate: 0.00%
Updated:October 01 2025