Original Model Authors:
Paul Teiletche, Quentin Macé, Max Conti, Antonio Loison, Gautier Viaud, Pierre Colombo, Manuel Faysse
ONNX Conversion:
Kacper Łukawski (Qdrant)
About This Repository
This is an ONNX-converted version of ColModernVBERT, optimized for deployment with FastEmbed. The original model was developed by the authors listed above and is described in detail in their
paper
.
What's Different:
Model format: PyTorch → ONNX
Optimization: Configured for efficient CPU/GPU inference
Integration: Ready for use with FastEmbed's multimodal embedding APIs
What's Preserved:
Model architecture and weights
Tokenizer and processor configurations
Image preprocessing pipeline
Embedding dimensions and behavior
Model Description
ColModernVBERT is a vision-language model based on the ModernVBERT architecture, designed for document retrieval tasks. The
ModernVBERT paper
demonstrates that this 250M-parameter model achieves state-of-the-art performance in its size class, matching models up to 10x larger on visual document retrieval benchmarks.
It combines:
ModernBERT text encoder (jhu-clsp/ettin-encoder-150m)
ColModernVBERT achieves competitive performance with models up to 10x larger on visual document retrieval tasks. For detailed benchmarks and evaluation results, please refer to the
original paper
and
model card
.
Note:
ONNX inference performance may differ slightly from PyTorch due to optimizations and runtime differences. The model architecture and weights are preserved to maintain embedding quality.
License
This model is released under the MIT License, consistent with the original ColModernVBERT model.
Original Model License:
Copyright (c) 2025 Paul Teiletche, Quentin Macé, Max Conti, Antonio Loison, Gautier Viaud, Pierre Colombo, Manuel Faysse
If you use this model in your work, please cite the original ModernVBERT paper:
@misc{teiletche2025modernvbertsmallervisualdocument,
title={ModernVBERT: Towards Smaller Visual Document Retrievers},
author={Paul Teiletche and Quentin Macé and Max Conti and Antonio Loison and Gautier Viaud and Pierre Colombo and Manuel Faysse},
year={2025},
eprint={2510.01149},
archivePrefix={arXiv},
primaryClass={cs.IR},
url={https://arxiv.org/abs/2510.01149},
}
If you use this ONNX conversion specifically, you may also acknowledge:
This work builds upon the excellent ColModernVBERT model developed by Paul Teiletche, Quentin Macé, Max Conti, Antonio Loison, Gautier Viaud, Pierre Colombo, and Manuel Faysse. We thank them for releasing their work under the MIT License, enabling derivative works like this ONNX conversion.
The ONNX conversion was performed by the Qdrant team to enable efficient deployment through the FastEmbed library.
Runs of Qdrant colmodernvbert on huggingface.co
128
Total runs
0
24-hour runs
-2
3-day runs
21
7-day runs
77
30-day runs
More Information About colmodernvbert huggingface.co Model
colmodernvbert huggingface.co is an AI model on huggingface.co that provides colmodernvbert's model effect (), which can be used instantly with this Qdrant colmodernvbert model. huggingface.co supports a free trial of the colmodernvbert model, and also provides paid use of the colmodernvbert. Support call colmodernvbert model through api, including Node.js, Python, http.
colmodernvbert huggingface.co is an online trial and call api platform, which integrates colmodernvbert's modeling effects, including api services, and provides a free online trial of colmodernvbert, you can try colmodernvbert online for free by clicking the link below.
Qdrant colmodernvbert online free url in huggingface.co:
colmodernvbert is an open source model from GitHub that offers a free installation service, and any user can find colmodernvbert on GitHub to install. At the same time, huggingface.co provides the effect of colmodernvbert install, users can directly use colmodernvbert installed effect in huggingface.co for debugging and trial. It also supports api for free installation.