Inseq CLI

The Inseq CLI is a command line interface for the Inseq library. The CLI enables repeated attribution of individual examples and even entire 🤗 datasets directly from the console. See the available options by typing inseq -h in the terminal after installing the package.

Three commands are supported:

inseq attribute: Wrapper for enabling model.attribute usage in console.
inseq attribute-dataset: Extends attribute to full dataset using Hugging Face datasets.load_dataset API.
inseq attribute-context: Detects and attribute context dependence for generation tasks using the approach of Sarti et al. (2023).

`attribute`

The attribute command enables attribution of individual examples directly from the console. The command takes the following arguments:

class inseq.commands.attribute.attribute_args.AttributeWithInputsArgs(model_name_or_path: str | None = None, attribution_method: str | None = 'saliency', device: str = 'cpu', attributed_fn: str | None = None, attribution_selectors: list[int] | None = None, attribution_aggregators: list[str] | None = None, normalize_attributions: bool = False, model_kwargs: dict = <factory>, tokenizer_kwargs: dict = <factory>, generation_kwargs: dict = <factory>, attribution_kwargs: dict = <factory>, attribute_target: bool = False, generate_from_target_prefix: bool = False, step_scores: list[str] = <factory>, output_step_attributions: bool = False, include_eos_baseline: bool = False, batch_size: int = 8, aggregate_output: bool = False, hide_attributions: bool = False, save_path: str | None = None, viz_path: str | None = None, start_pos: int | None = None, end_pos: int | None = None, verbose: bool = False, very_verbose: bool = False, input_texts: list[str] | None = None, generated_texts: list[str] | None = None)[source]

Attributes:

model_name_or_path (<class 'str'>): The name or path of the model on which attribution is performed.

attribution_method (typing.Optional[str]): The attribution method used to perform feature attribution.

device (<class 'str'>): The device used for inference with Pytorch. Multi-GPU is not supported.

attributed_fn (typing.Optional[str]): The attribution target used for the attribution method. Default: probability. If a step function requiring additional arguments is used (e.g. contrast_prob_diff), they should be specified using the attribution_kwargs argument.

attribution_selectors (typing.Optional[list[int]]): The indices of the attribution scores to be used for the attribution aggregation. If specified, the aggregation function is applied only to the selected scores, and the other scores are discarded. If not specified, the aggregation function is applied to all the scores.

attribution_aggregators (list[str]): The aggregators used to aggregate the attribution scores for each context. The outcome should produce one score per input token

normalize_attributions (<class 'bool'>): Whether to normalize the attribution scores for each context. If True, the attribution scores for each context are normalized to sum up to 1, providing a relative notion of input salience.

model_kwargs (<class 'dict'>): Additional keyword arguments passed to the model constructor in JSON format.

tokenizer_kwargs (<class 'dict'>): Additional keyword arguments passed to the tokenizer constructor in JSON format.

generation_kwargs (<class 'dict'>): Additional keyword arguments passed to the generation method in JSON format.

attribution_kwargs (<class 'dict'>): Additional keyword arguments passed to the attribution method in JSON format.

attribute_target (<class 'bool'>): Performs the attribution procedure including the generated target prefix at every step.

generate_from_target_prefix (<class 'bool'>): Whether the generated_texts should be used as target prefixes for the generation process. If False, the generated_texts are used as full targets. Option only available for encoder-decoder models, since for decoder-only ones it is sufficient to add prefix to input string. Default: False.

step_scores (list[str]): Adds the specified step scores to the attribution output.

output_step_attributions (<class 'bool'>): Adds step-level feature attributions to the output.

include_eos_baseline (<class 'bool'>): Whether the EOS token should be included in the baseline, used for some attribution methods.

batch_size (<class 'int'>): The batch size used for the attribution computation. Default: no batching.

aggregate_output (<class 'bool'>): If specified, the attribution output is aggregated using its default aggregator before saving.

hide_attributions (<class 'bool'>): If specified, the attribution visualization are not shown in the output.

save_path (typing.Optional[str]): Path where the attribution output should be saved in JSON format.

viz_path (typing.Optional[str]): Path where the attribution visualization should be saved in HTML format.

start_pos (typing.Optional[int]): Start position for the attribution. Default: first token

end_pos (typing.Optional[int]): End position for the attribution. Default: last token

verbose (<class 'bool'>): If specified, use INFO as logging level for the attribution.

very_verbose (<class 'bool'>): If specified, use DEBUG as logging level for the attribution.

input_texts (list[str]): One or more input texts used for generation.

generated_texts (typing.Optional[list[str]]): If specified, constrains the decoding procedure to the specified outputs.

`attribute-dataset`

The attribute-dataset command extends the attribute command to full datasets using the Hugging Face datasets.load_dataset API. The command takes the following arguments:

class inseq.commands.attribute_dataset.attribute_dataset_args.LoadDatasetArgs(dataset_name: str, input_text_field: str | None, generated_text_field: str | None = None, dataset_config: str | None = None, dataset_dir: str | None = None, dataset_files: list[str] | None = None, dataset_split: str | None = 'train', dataset_revision: str | None = None, dataset_auth_token: str | None = None, dataset_kwargs: dict | None = <factory>)[source]

Attributes:

dataset_name (<class 'str'>): The type of dataset to be loaded for attribution.

input_text_field (typing.Optional[str]): Name of the field containing the input texts used for attribution.

generated_text_field (typing.Optional[str]): Name of the field containing the generated texts used for constrained decoding.

dataset_config (typing.Optional[str]): The name of the Huggingface dataset configuration.

dataset_dir (typing.Optional[str]): Path to the directory containing the data files.

dataset_files (typing.Optional[list[str]]): Path to the dataset files.

dataset_split (typing.Optional[str]): Dataset split.

dataset_revision (typing.Optional[str]): The Huggingface dataset revision.

dataset_auth_token (typing.Optional[str]): The auth token for the Huggingface dataset.

dataset_kwargs (typing.Optional[dict]): Additional keyword arguments passed to the dataset constructor in JSON format.

class inseq.commands.attribute.attribute_args.AttributeExtendedArgs(model_name_or_path: str | None = None, attribution_method: str | None = 'saliency', device: str = 'cpu', attributed_fn: str | None = None, attribution_selectors: list[int] | None = None, attribution_aggregators: list[str] | None = None, normalize_attributions: bool = False, model_kwargs: dict = <factory>, tokenizer_kwargs: dict = <factory>, generation_kwargs: dict = <factory>, attribution_kwargs: dict = <factory>, attribute_target: bool = False, generate_from_target_prefix: bool = False, step_scores: list[str] = <factory>, output_step_attributions: bool = False, include_eos_baseline: bool = False, batch_size: int = 8, aggregate_output: bool = False, hide_attributions: bool = False, save_path: str | None = None, viz_path: str | None = None, start_pos: int | None = None, end_pos: int | None = None, verbose: bool = False, very_verbose: bool = False)[source]