Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

seqspec

github version pypi version python versions license

seqspec, short for “sequence specification” (pronounced “seek-speck”), is a file format that describes data generated from genomics experiments. Both the file format and seqspec tool enable uniform processing of genomics data.

alt text Figure 1: Anatomy of a seqspec file.

We have multiple tutorials to get you up and running with seqspec:

  1. Learn how to use seqspec to standardize your genomics data preprocessing.

  2. Understand how to manipulate seqspec files using the seqspec command-line tool.

Current release

seqspec 0.4.0 keeps the Python and Rust implementations aligned around the same core command set.

Citation

The seqspec format and tool are described in this publication. If you use seqspec please cite

Ali Sina Booeshaghi, Xi Chen, Lior Pachter, A machine-readable specification for genomics assays, Bioinformatics, Volume 40, Issue 4, April 2024, btae168.

seqspec was inspired by and builds off of the Teichmann Lab Single Cell Genomics Library Structure by Xi Chen.

Documentation

Rust implementation

The standalone Rust CLI supports library-ascii, seqspec-ascii, and seqspec-html in seqspec print. seqspec-png remains Python-only for now.

References
  1. Booeshaghi, A. S., Chen, X., & Pachter, L. (2024). A machine-readable specification for genomics assays. Bioinformatics, 40(4). 10.1093/bioinformatics/btae168
  2. Booeshaghi, A. S., Chen, X., & Pachter, L. (2024). A machine-readable specification for genomics assays. Bioinformatics, 40(4). 10.1093/bioinformatics/btae168