andreasjansson / blip-2

Answers questions about images

replicate.com
Total runs: 29.6M
24-hour runs: 0
7-day runs: 0
30-day runs: 0
Github
Model's Last Updated: November 21 2023

Introduction of blip-2

Model Details of blip-2

Readme

Unofficial BLIP-2 demo and API

Note that this is an unofficial implementation of BLIP-2 that is not associated with Salesforce.

Usage

Blip-2 is a model that answers questions about images. To use it, provide an image, and then ask a question about that image. For example, you can provide the following image:

Image

and then pose the following question:

What is this a picture of?

and get the output:

marina bay sands, singapore .

Blip-2 is also capable of captioning images. This works by sending the model a blank prompt, though we have an explicit toggle for image captioning in the UI & API.

You can also provide Blip-2 with more context when asking a question. For example, given the following image:

img

you can provide the output of a previous Q&A as context in question: ... answer: ... format like so:

question: what animal is this? answer: panda

and then pose an additional question:

what country is this animal from?

and get the output:

china

Model description

BLIP-2 is a generic and efficient pre-training strategy that easily harvests development of pretrained vision models and large language models (LLMs) for vision-language pretraining. BLIP-2 beats Flamingo on zero-shot VQAv2 (65.0 vs 56.3), establishing new state-of-the-art on zero-shot captioning (on NoCaps 121.6 CIDEr score vs previous best 113.2). Equipped with powerful LLMs (e.g. OPT, FlanT5), BLIP-2 also unlocks the new zero-shot instructed vision-to-language generation capabilities for various interesting applications! Learn more at the official repo

Citation
@misc{https://doi.org/10.48550/arxiv.2301.12597,
  doi = {10.48550/ARXIV.2301.12597},
  url = {https://arxiv.org/abs/2301.12597},
  author = {Li, Junnan and Li, Dongxu and Savarese, Silvio and Hoi, Steven},
  keywords = {Computer Vision and Pattern Recognition (cs.CV), FOS: Computer and information sciences, FOS: Computer and information sciences},
  title = {BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models},
  publisher = {arXiv},
  year = {2023},
  copyright = {Creative Commons Attribution 4.0 International}
}

Pricing of blip-2 replicate.com

Run time and cost

This model costs approximately $0.0015 to run on Replicate, or 666 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker .

This model runs on Nvidia A100 (80GB) GPU hardware . Predictions typically complete within 2 seconds.

Runs of andreasjansson blip-2 on replicate.com

29.6M
Total runs
0
24-hour runs
0
3-day runs
0
7-day runs
0
30-day runs

More Information About blip-2 replicate.com Model

blip-2 replicate.com

blip-2 replicate.com is an AI model on replicate.com that provides blip-2's model effect (Answers questions about images), which can be used instantly with this andreasjansson blip-2 model. replicate.com supports a free trial of the blip-2 model, and also provides paid use of the blip-2. Support call blip-2 model through api, including Node.js, Python, http.

andreasjansson blip-2 online free

blip-2 replicate.com is an online trial and call api platform, which integrates blip-2's modeling effects, including api services, and provides a free online trial of blip-2, you can try blip-2 online for free by clicking the link below.

andreasjansson blip-2 online free url in replicate.com:

https://replicate.com/andreasjansson/blip-2

blip-2 install

blip-2 is an open source model from GitHub that offers a free installation service, and any user can find blip-2 on GitHub to install. At the same time, replicate.com provides the effect of blip-2 install, users can directly use blip-2 installed effect in replicate.com for debugging and trial. It also supports api for free installation.

blip-2 install url in replicate.com:

https://replicate.com/andreasjansson/blip-2

blip-2 install url in github:

https://github.com/daanelson/cog-blip-2

Url of blip-2

Provider of blip-2 replicate.com

Other API from andreasjansson

replicate

Return CLIP features for the clip-vit-large-patch14 model

Total runs: 83.4M
Run Growth: 0
Growth Rate: 0.00%
Updated:March 06 2023
replicate

The DeepFloyd IF model has been initially released as a non-commercial research-only model. Please make sure you read and abide to the license before using it.

Total runs: 2.0M
Run Growth: 0
Growth Rate: 0.00%
Updated:April 29 2023
replicate

Create tileable animations with seamless transitions

Total runs: 529.1K
Run Growth: 0
Growth Rate: 0.00%
Updated:February 13 2023
replicate

Monster Labs' control_v1p_sd15_qrcode_monster ControlNet on top of SD 1.5

Total runs: 402.0K
Run Growth: 0
Growth Rate: 0.00%
Updated:September 18 2023
replicate

Stable Diffusion fined-tuned on frames from Monkey Island 1 and 2

Total runs: 2.4K
Run Growth: 0
Growth Rate: 0.00%
Updated:November 02 2022
replicate

Suite of models to evaluate the image quality of text-to-image models with respect to their input prompts.

Total runs: 1.9K
Run Growth: 0
Growth Rate: 0.00%
Updated:October 15 2024
replicate

FLUX.1 [dev] trained on the Replicate goo

Total runs: 1.3K
Run Growth: 0
Growth Rate: 0.00%
Updated:August 23 2024
replicate

An open-source model for program synthesis. Competitive with OpenAI Codex.

Total runs: 1.1K
Run Growth: 0
Growth Rate: 0.00%
Updated:May 25 2022
replicate

Flux LoRA trained on generated images of random shapes

Total runs: 897
Run Growth: 0
Growth Rate: 0.00%
Updated:August 22 2024
replicate

Llama-2 13B with support for grammars and jsonschema

Total runs: 748
Run Growth: 0
Growth Rate: 0.00%
Updated:September 20 2023
replicate

Monkey Island database for Retrieval-augmented Diffusion model

Total runs: 622
Run Growth: 0
Growth Rate: 0.00%
Updated:August 20 2022
replicate

Generate plasma shader equations

Total runs: 466
Run Growth: 0
Growth Rate: 0.00%
Updated:December 06 2023
replicate

Implicit neural differentiable FM synthesizer

Total runs: 390
Run Growth: 0
Growth Rate: 0.00%
Updated:October 28 2022
replicate

Generate music based on the Monkey Island theme using the FIGARO model

Total runs: 305
Run Growth: 0
Growth Rate: 0.00%
Updated:August 20 2022
replicate

Image similarity metric that that compares a reference image to a set of images.

Total runs: 299
Run Growth: 0
Growth Rate: 0.00%
Updated:October 15 2024
replicate

Generate "algorithmic symphony" one-liners with grammar-constrained CodeLlama

Total runs: 205
Run Growth: 0
Growth Rate: 0.00%
Updated:September 18 2023
replicate

Inpainting and video2video experiments with Wan 2.1

Total runs: 171
Run Growth: 0
Growth Rate: 0.00%
Updated:March 11 2025
replicate

Random rectangles, ellipses, lines, in random colors

Total runs: 87
Run Growth: 0
Growth Rate: 0.00%
Updated:August 22 2024