nvidia / instruction-data-guard

huggingface.co
Total runs: 1.3K
24-hour runs: 0
7-day runs: 108
30-day runs: 718
Model's Last Updated: September 23 2025

Introduction of instruction-data-guard

Model Details of instruction-data-guard

Model Overview

Description:

Instruction-Data-Guard is a deep-learning classification model that helps identify LLM poisoning attacks in datasets. It is trained on an instruction:response dataset and LLM poisoning attacks of such data. Note that optimal use for Instruction-Data-Guard is for instruction:response datasets.

License/Terms of Use:

NVIDIA Open Model License Agreement

Reference:

The Internal State of an LLM Knows When It's Lying: https://arxiv.org/pdf/2304.13734

Model Architecture:

Architecture Type: FeedForward MLP
Network Architecture: 4 Layer MLP

Input:

Input Type(s): Text Embeddings
Input Format(s): Numerical Vectors
Input Parameters: 1D Vectors
Other Properties Related to Input: The text embeddings are generated from the Aegis Defensive Model . The length of the vectors is 4096.

Output:

Output Type(s): Classification Scores
Output Format: Array of shape 1
Output Parameters: 1D
Other Properties Related to Output: Classification scores represent the confidence that the input data is poisoned or not.

Software Integration:

Runtime Engine(s):

Supported Hardware Microarchitecture Compatibility:

Preferred Operating System(s): Ubuntu 22.04/20.04

Model Version(s):

v1.0

Training, Testing, and Evaluation Datasets:

The data used to train this model contained synthetically-generated LLM poisoning attacks.

Evaluation Benchmarks:

Instruction-Data-Guard is evaluated based on two overarching criteria:

  • Success on identifying LLM poisoning attacks, after the model was trained on examples of the attacks.
  • Success on identifying LLM poisoning attacks, but without training on examples of those attacks, at all.

Success is defined as having an acceptable catch rate (recall scores for each attack) over a high specificity score (ex. 95%). Acceptable catch rates need to be high enough to identify at least several poisoned records in the attack.

Inference:

Engine: NeMo Curator and Aegis
Test Hardware:

  • A100 80GB GPU
How to Use in NeMo Curator:

The inference code is available on NeMo Curator's GitHub repository .
Check out this example notebook to get started.

Ethical Considerations:

NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.

Please report security vulnerabilities or NVIDIA AI Concerns here .

Runs of nvidia instruction-data-guard on huggingface.co

1.3K
Total runs
0
24-hour runs
74
3-day runs
108
7-day runs
718
30-day runs

More Information About instruction-data-guard huggingface.co Model

More instruction-data-guard license Visit here:

https://choosealicense.com/licenses/other

instruction-data-guard huggingface.co

instruction-data-guard huggingface.co is an AI model on huggingface.co that provides instruction-data-guard's model effect (), which can be used instantly with this nvidia instruction-data-guard model. huggingface.co supports a free trial of the instruction-data-guard model, and also provides paid use of the instruction-data-guard. Support call instruction-data-guard model through api, including Node.js, Python, http.

instruction-data-guard huggingface.co Url

https://huggingface.co/nvidia/instruction-data-guard

nvidia instruction-data-guard online free

instruction-data-guard huggingface.co is an online trial and call api platform, which integrates instruction-data-guard's modeling effects, including api services, and provides a free online trial of instruction-data-guard, you can try instruction-data-guard online for free by clicking the link below.

nvidia instruction-data-guard online free url in huggingface.co:

https://huggingface.co/nvidia/instruction-data-guard

instruction-data-guard install

instruction-data-guard is an open source model from GitHub that offers a free installation service, and any user can find instruction-data-guard on GitHub to install. At the same time, huggingface.co provides the effect of instruction-data-guard install, users can directly use instruction-data-guard installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

instruction-data-guard install url in huggingface.co:

https://huggingface.co/nvidia/instruction-data-guard

Url of instruction-data-guard

instruction-data-guard huggingface.co Url

Provider of instruction-data-guard huggingface.co

nvidia
ORGANIZATIONS

Other API from nvidia

huggingface.co

Total runs: 408.3K
Run Growth: 356.2K
Growth Rate: 87.25%
Updated:December 04 2025
huggingface.co

Total runs: 232.6K
Run Growth: 214.6K
Growth Rate: 92.28%
Updated:September 10 2025
huggingface.co

Total runs: 169.0K
Run Growth: 15.8K
Growth Rate: 9.35%
Updated:December 04 2025
huggingface.co

Total runs: 157.1K
Run Growth: 148.5K
Growth Rate: 94.54%
Updated:April 11 2026
huggingface.co

Total runs: 128.4K
Run Growth: 24.0K
Growth Rate: 18.70%
Updated:January 15 2025
huggingface.co

Total runs: 127.0K
Run Growth: -3.0K
Growth Rate: -2.36%
Updated:November 15 2023
huggingface.co

Total runs: 76.7K
Run Growth: 59.4K
Growth Rate: 77.48%
Updated:September 10 2025
huggingface.co

Total runs: 66.2K
Run Growth: 52.5K
Growth Rate: 79.25%
Updated:November 29 2025
huggingface.co

Total runs: 58.7K
Run Growth: -5.0K
Growth Rate: -8.45%
Updated:July 22 2025
huggingface.co

Total runs: 37.2K
Run Growth: 10.1K
Growth Rate: 27.07%
Updated:December 03 2025
huggingface.co

Total runs: 36.8K
Run Growth: 4.0K
Growth Rate: 10.85%
Updated:December 16 2025
huggingface.co

Total runs: 30.4K
Run Growth: 15.7K
Growth Rate: 51.63%
Updated:September 10 2025
huggingface.co

Total runs: 30.3K
Run Growth: 16.5K
Growth Rate: 54.28%
Updated:August 06 2022
huggingface.co

Total runs: 29.3K
Run Growth: 6.2K
Growth Rate: 21.04%
Updated:August 06 2022
huggingface.co

Total runs: 23.9K
Run Growth: 53
Growth Rate: 0.22%
Updated:May 08 2025
huggingface.co

Total runs: 21.9K
Run Growth: 6.9K
Growth Rate: 31.43%
Updated:January 30 2026