CLI Reference
This page lists all available command-line parameters for the OmniGenBench (ogb) toolkit.
For usage examples, see the Command-Line Usage page.
Top-Level Command: ogb
The main CLI entry point.
Available subcommands:
autoinfer— Run inference with fine-tuned modelsautotrain— Train or fine-tune genomic modelsautobench— Benchmark models on curated datasetsrna_design— Design RNA sequences for target secondary structures
To view global options:
ogb autoinfer
Run inference with fine-tuned genomic foundation models.
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
--model |
str | ✓ | — | Model name or path (e.g., yangheng/ogb_tfb_finetuned). |
--sequence |
str | No | — | Single sequence or comma-separated sequences. |
--input-file |
str | No | — | JSON/CSV/TXT file containing sequences. |
--output-file |
str | No | inference_results.json |
Output JSON file. |
--batch-size |
int | No | 32 |
Inference batch size. |
--device |
str | No | auto | Device to run inference (cuda:0, cpu). |
Note
At least one of --sequence or --input-file must be provided.
Example
ogb autoinfer \
--model yangheng/ogb_tfb_finetuned \
--sequence "ATCGATCGATCGATCGATCGATCGATCGATCG" \
--output-file tfb_predictions.json
ogb autotrain
Automatically fine-tune genomic foundation models.
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
--dataset, -d |
str | ✓ | — | Dataset name or path. |
--model, -m |
str | ✓ | — | Base model name or path. |
--tokenizer |
str | No | model default | Tokenizer to use. |
--output-dir |
str | No | — | Directory to save fine-tuned model. |
--num-epochs |
int | No | — | Training epochs. |
--batch-size |
int | No | — | Training batch size. |
--learning-rate |
float | No | — | Learning rate. |
--overwrite |
flag | No | — | Overwrite output directory. |
--trainer |
str | No | accelerate |
Trainer backend. |
Example
ogb autotrain \
--dataset yangheng/tfb_promoters \
--model zhihan1996/DNABERT-2-117M \
--num-epochs 10
ogb autobench
Benchmark genomic models on standard datasets.
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
--model, -m |
str | ✓ | — | Model to benchmark. |
--benchmark, -b |
str | ✓ | — | Benchmark dataset (RGB, GUE, PGB, BEACON). |
--tokenizer, -t |
str | No | model default | Tokenizer to use. |
--trainer |
str | No | accelerate |
Trainer backend. |
--overwrite |
flag | No | — | Overwrite existing benchmark results. |
Example
ogb rna_design
Design RNA sequences for target secondary structures.
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
--structure |
str | ✓ | — | Target structure in dot-bracket notation. |
--model |
str | No | yangheng/OmniGenome-186M |
Model for MLM-guided mutation. |
--mutation-ratio |
float | No | 0.1 |
Fraction of nucleotides mutated each generation. |
--num-population |
int | No | 100 |
GA population size. |
--num-generation |
int | No | 100 |
Maximum generations. |
--output-file |
str | No | — | Save final sequences to file. |