Unet-Segmentation: Optimized for Mobile Deployment
Real-time segmentation optimized for mobile and edge
UNet is a machine learning model that produces a segmentation mask for an image. The most basic use case will label each pixel in the image as being in the foreground or the background. More advanced usage will assign a class label to each pixel. This version of the model was trained on the data from Kaggle's Carvana Image Masking Challenge (see
https://www.kaggle.com/c/carvana-image-masking-challenge
) and is used for vehicle segmentation.
This model is an implementation of Unet-Segmentation found
here
.
This repository provides scripts to run Unet-Segmentation on Qualcomm® devices.
More details on model performance across various devices, can be found
here
.
Model Details
Model Type:
Semantic segmentation
Model Stats:
Model checkpoint: unet_carvana_scale1.0_epoch2
Input resolution: 224x224
Number of parameters: 31.0M
Model size: 118 MB
Number of output classes: 2 (foreground / background)
Profile Job summary of Unet-Segmentation
--------------------------------------------------
Device: SA8255 (Proxy) (13)
Estimated Inference Time: 157.83 ms
Estimated Peak Memory Range: 9.41-27.04 MB
Compute Units: NPU (51) | Total (51)
How does this work?
This
export script
leverages
Qualcomm® AI Hub
to optimize, validate, and deploy this model
on-device. Lets go through each step below in detail:
Step 1:
Compile model for on-device deployment
To compile a PyTorch model for on-device deployment, we first trace the model
in memory using the
jit.trace
and then call the
submit_compile_job
API.
import torch
import qai_hub as hub
from qai_hub_models.models.unet_segmentation import Model
# Load the model
torch_model = Model.from_pretrained()
# Device
device = hub.Device("Samsung Galaxy S23")
# Trace model
input_shape = torch_model.get_input_spec()
sample_inputs = torch_model.sample_inputs()
pt_model = torch.jit.trace(torch_model, [torch.tensor(data[0]) for _, data in sample_inputs.items()])
# Compile model on a specific device
compile_job = hub.submit_compile_job(
model=pt_model,
device=device,
input_specs=torch_model.get_input_spec(),
)
# Get target model to run on-device
target_model = compile_job.get_target_model()
Step 2:
Performance profiling on cloud-hosted device
After compiling models from step 1. Models can be profiled model on-device using the
target_model
. Note that this scripts runs the model on a device automatically
provisioned in the cloud. Once the job is submitted, you can navigate to a
provided job URL to view a variety of on-device performance metrics.
Unet-Segmentation huggingface.co is an AI model on huggingface.co that provides Unet-Segmentation's model effect (), which can be used instantly with this qualcomm Unet-Segmentation model. huggingface.co supports a free trial of the Unet-Segmentation model, and also provides paid use of the Unet-Segmentation. Support call Unet-Segmentation model through api, including Node.js, Python, http.
Unet-Segmentation huggingface.co is an online trial and call api platform, which integrates Unet-Segmentation's modeling effects, including api services, and provides a free online trial of Unet-Segmentation, you can try Unet-Segmentation online for free by clicking the link below.
qualcomm Unet-Segmentation online free url in huggingface.co:
Unet-Segmentation is an open source model from GitHub that offers a free installation service, and any user can find Unet-Segmentation on GitHub to install. At the same time, huggingface.co provides the effect of Unet-Segmentation install, users can directly use Unet-Segmentation installed effect in huggingface.co for debugging and trial. It also supports api for free installation.