CLI Reference

This page lists all available command-line parameters for the OmniGenBench (ogb) toolkit.
For usage examples, see the Command-Line Usage page.

Top-Level Command: `ogb`

The main CLI entry point.
Available subcommands:

To view global options:

ogb --help

Run inference with fine-tuned genomic foundation models.

Parameter	Type	Required	Default	Description
`--model`	str	✓	—	Model name or path (e.g., `yangheng/ogb_tfb_finetuned`).
`--sequence`	str	No	—	Single sequence or comma-separated sequences.
`--input-file`	str	No	—	JSON/CSV/TXT file containing sequences.
`--output-file`	str	No	`inference_results.json`	Output JSON file.
`--batch-size`	int	No	`32`	Inference batch size.
`--device`	str	No	auto	Device to run inference (`cuda:0`, `cpu`).

Note

At least one of --sequence or --input-file must be provided.

ogb autoinfer \
  --model yangheng/ogb_tfb_finetuned \
  --sequence "ATCGATCGATCGATCGATCGATCGATCGATCG" \
  --output-file tfb_predictions.json

Automatically fine-tune genomic foundation models.

Parameter	Type	Required	Default	Description
`--dataset`, `-d`	str	✓	—	Dataset name or path.
`--model`, `-m`	str	✓	—	Base model name or path.
`--tokenizer`	str	No	model default	Tokenizer to use.
`--output-dir`	str	No	—	Directory to save fine-tuned model.
`--num-epochs`	int	No	—	Training epochs.
`--batch-size`	int	No	—	Training batch size.
`--learning-rate`	float	No	—	Learning rate.
`--overwrite`	flag	No	—	Overwrite output directory.
`--trainer`	str	No	`accelerate`	Trainer backend.

ogb autotrain \
  --dataset yangheng/tfb_promoters \
  --model zhihan1996/DNABERT-2-117M \
  --num-epochs 10

Benchmark genomic models on standard datasets.

Parameter	Type	Required	Default	Description
`--model`, `-m`	str	✓	—	Model to benchmark.
`--benchmark`, `-b`	str	✓	—	Benchmark dataset (RGB, GUE, PGB, BEACON).
`--tokenizer`, `-t`	str	No	model default	Tokenizer to use.
`--trainer`	str	No	`accelerate`	Trainer backend.
`--overwrite`	flag	No	—	Overwrite existing benchmark results.

ogb autobench \
  --model yangheng/OmniGenome-186M \
  --benchmark RGB

Design RNA sequences for target secondary structures.

Parameter	Type	Required	Default	Description
`--structure`	str	✓	—	Target structure in dot-bracket notation.
`--model`	str	No	`yangheng/OmniGenome-186M`	Model for MLM-guided mutation.
`--mutation-ratio`	float	No	`0.1`	Fraction of nucleotides mutated each generation.
`--num-population`	int	No	`100`	GA population size.
`--num-generation`	int	No	`100`	Maximum generations.
`--output-file`	str	No	—	Save final sequences to file.

ogb rna_design \
  --structure "(((...)))" \
  --output-file designs.txt