impactframes / Janus-1.3B

huggingface.co
Total runs: 4
24-hour runs: 0
7-day runs: -2
30-day runs: -17
Model's Last Updated: October 19 2024
any-to-any

Introduction of Janus-1.3B

Model Details of Janus-1.3B

1. Introduction

Janus is a novel autoregressive framework that unifies multimodal understanding and generation. It addresses the limitations of previous approaches by decoupling visual encoding into separate pathways, while still utilizing a single, unified transformer architecture for processing. The decoupling not only alleviates the conflict between the visual encoder’s roles in understanding and generation, but also enhances the framework’s flexibility. Janus surpasses previous unified model and matches or exceeds the performance of task-specific models. The simplicity, high flexibility, and effectiveness of Janus make it a strong candidate for next-generation unified multimodal models.

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Github Repository

image
2. Model Summary

Janus is a unified understanding and generation MLLM, which decouples visual encoding for multimodal understanding and generation. Janus is constructed based on the DeepSeek-LLM-1.3b-base which is trained on an approximate corpus of 500B text tokens. For multimodal understanding, it uses the SigLIP-L as the vision encoder, which supports 384 x 384 image input. For image generation, Janus uses the tokenizer from here with a downsample rate of 16.

image
3. Quick Start

Please refer to Github Repository

4. License

This code repository is licensed under the MIT License . The use of Janus models is subject to DeepSeek Model License .

5. Citation
@misc{wu2024janus,
      title={Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation}, 
      author={Chengyue Wu and Xiaokang Chen and Zhiyu Wu and Yiyang Ma and Xingchao Liu and Zizheng Pan and Wen Liu and Zhenda Xie and Xingkai Yu and Chong Ruan and Ping Luo},
      year={2024},
      eprint={2410.13848},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2410.13848}, 
}
6. Contact

If you have any questions, please raise an issue or contact us at [email protected] .

Runs of impactframes Janus-1.3B on huggingface.co

4
Total runs
0
24-hour runs
-1
3-day runs
-2
7-day runs
-17
30-day runs

More Information About Janus-1.3B huggingface.co Model

More Janus-1.3B license Visit here:

https://choosealicense.com/licenses/mit

Janus-1.3B huggingface.co

Janus-1.3B huggingface.co is an AI model on huggingface.co that provides Janus-1.3B's model effect (), which can be used instantly with this impactframes Janus-1.3B model. huggingface.co supports a free trial of the Janus-1.3B model, and also provides paid use of the Janus-1.3B. Support call Janus-1.3B model through api, including Node.js, Python, http.

impactframes Janus-1.3B online free

Janus-1.3B huggingface.co is an online trial and call api platform, which integrates Janus-1.3B's modeling effects, including api services, and provides a free online trial of Janus-1.3B, you can try Janus-1.3B online for free by clicking the link below.

impactframes Janus-1.3B online free url in huggingface.co:

https://huggingface.co/impactframes/Janus-1.3B

Janus-1.3B install

Janus-1.3B is an open source model from GitHub that offers a free installation service, and any user can find Janus-1.3B on GitHub to install. At the same time, huggingface.co provides the effect of Janus-1.3B install, users can directly use Janus-1.3B installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

Janus-1.3B install url in huggingface.co:

https://huggingface.co/impactframes/Janus-1.3B

Url of Janus-1.3B

Provider of Janus-1.3B huggingface.co

impactframes
ORGANIZATIONS

Other API from impactframes

huggingface.co

Total runs: 0
Run Growth: 0
Growth Rate: 0.00%
Updated:December 20 2024