Skip to main content

Chromosome Name Converter

ChrNameConverter

Different files using different chromosome naming systems for the same genome build presents a challenge during analysis. This tool is provided as a converter between chromosome naming systems for tab-delimited file formats such as BED and GFF.

Arabic Roman
chr1 chrI
chr2 chrII
chr3 chrIII
...
chr16 chrXVI

More specifically, the sacCer3 genome build uses an arabic numeral naming system (chr1 through chr16) or a roman numeral naming system (chrI through chrXVI). This script serves as a converter between these naming systems and includes options around the mitochondrial and 2-micron naming features.

chrM chrmt

Also, a mitochondiral chromosome name conversion is also included with the selection of a checkbox option.

Command Line Interface

Convert BED Chr Names

Usage:

java -jar ScriptManager.jar file-utilities convert-bed-genome [-ahmV] [-o=<output>] <coordFile>

Description:

Convert BED coordinate files between the standard SGD roman numeral chromosome names to the legacy SacCer3_cegr arabic numeral chromosome names.

Convert GFF Chr Names

Usage:

java -jar ScriptManager.jar file-utilities convert-gff-genome [-ahmV] [-o=<output>] <coordFile>

Description:

Convert GFF coordinate files between the standard SGD roman numeral chromosome names to the legacy SacCer3_cegr arabic numeral chromosome names.

Positional Input

Expects a single BED file or GFF file for input.

Output Options

OptionDescription
-o, --output=<output>specify output directory (name will be same as original with .bed ext)

Conversion Options

OptionDescription
-a, --to-arabicswitch converter to output arabic numeral chromsome names (default outputs roman numeral chrnames)
-m, --chrmtconverter will map "chrM" --> "chrmt" (default with no flag is "chrmt" --> "chrM")