Current version: 0.5.2, Apr 15, 2022
Bioconvert¶
Bioconvert is a collaborative project to facilitate the interconversion of life science data from one format to another.
- contributions
Want to add a convertor ? Please join https://github.com/bioconvert/bioconvert/issues/1
- issues
Overview¶
Life science uses many different formats. They may be old, or with complex syntax and converting those formats may be a challenge. Bioconvert aims at providing a common tool / interface to convert life science data formats from one to another.
Many conversion tools already exist but they may be dispersed, focused on few specific formats, difficult to install, or not optimised. With Bioconvert, we plan to cover a wide spectrum of format conversions; we will re-use existing tools when possible and provide facilities to compare different conversion tools or methods via benchmarking. New implementations are provided when considered better than existing ones.
In March 2022, we had 48 formats, 98 direct conversions (125 different methods).

Installation¶
In order to install bioconvert, you can use pip:
pip install bioconvert
We also provide releases on bioconda (http://bioconda.github.io/):
conda install bioconvert
and Singularity container are available. See http://bioconvert.readthedocs.io/en/master/user_guide.html#installation for details.
Usage¶
From the command line, you can convert a FastQ file into a FastA file as follows (compressed or not):
bioconvert fastq2fasta input.fastq output.fasta
bioconvert fastq2fasta input.fq output.fasta
bioconvert fastq2fasta input.fq.gz output.fasta.gz
bioconvert fastq2fasta input.fq.gz output.fasta.bz2
When there is no ambiguity, you can be implicit:
bioconvert input.fastq output.fasta
For help, just type:
bioconvert --help
bioconvert fastq2fasta --help
From a Python shell:
# import a converter
from bioconvert.fastq2fasta import FASTQ2FASTA
# Instanciate with infile/outfile names
convert = FASTQ2FASTA(infile, outfile)
# the conversion itself
convert()
Available Converters¶
Converters |
CI testing |
Benchmarking |
---|---|---|
Contributors¶
Setting up and maintaining Bioconvert has been possible thanks to users and contributors. Thanks to all:
Changes¶
Version |
Description |
---|---|
0.5.2 |
|
0.5.1 |
|
0.5.0 |
|
Complete documentation including User and Developer Guides¶
- 1. Installation
- 2. User Guide
- 3. Tutorial
- 4. Developer guide
- 4.1. Installation for developers
- 4.2. How to add a new conversion
- 4.3. One-to-many and many-to-one conversions
- 4.4. How to add a new method to an existing converter
- 4.5. Decorators
- 4.6. How to add a test
- 4.7. How to add a test file
- 4.8. How to locally run the tests
- 4.9. How to benchmark your new method vs others
- 4.10. How to add you new converter to the main documentation ?
- 4.11. pep8 and conventions
- 4.12. Requirements files
- 4.13. How to update bioconvert on bioconda
- 4.14. Sphinx Documentation
- 4.15. Docker
- 5. Benchmarking
- 6. Gallery
- 7. References
- 8. Formats
- 8.1. TWOBIT
- 8.2. AGP
- 8.3. ABI
- 8.4. ASQG
- 8.5. BAI
- 8.6. BAM
- 8.7. BCF
- 8.8. BCL
- 8.9. BED for plink
- 8.10. BEDGRAPH
- 8.11. BED
- 8.12. BED3
- 8.13. BED4
- 8.14. BED5
- 8.15. BED6
- 8.16. BED12
- 8.17. BIGBED
- 8.18. BIGWIG
- 8.19. BIM
- 8.20. BZ2
- 8.21. COV
- 8.22. CRAM
- 8.23. CLUSTAL
- 8.24. CSV
- 8.25. DSRC
- 8.26. EMBL
- 8.27. FAM
- 8.28. FAA
- 8.29. FASTA
- 8.30. FastG
- 8.31. FastQ
- 8.32. GENBANK
- 8.33. GENPEPT
- 8.34. GFA
- 8.35. GFF
- 8.36. GZ
- 8.37. JSON
- 8.38. MAF (Mutation Annotation Format)
- 8.39. MAF (Multiple Alignement Format)
- 8.40. MAP
- 8.41. NEWICK
- 8.42. NEXUS
- 8.43. ODS
- 8.44. PAF (Pairwise mApping Format)
- 8.45. PDB
- 8.46. PED
- 8.47. PHYLOXML
- 8.48. PHYLIP
- 8.49. PLINK flat files (MAP/PED)
- 8.50. PLINK binary files (BED/BIM/FAM)
- 8.51. QUAL
- 8.52. SAM
- 8.53. SCF
- 8.54. SRA
- 8.55. TSV
- 8.56. STOCKHOLM
- 8.57. VCF
- 8.58. WIG
- 8.59. WIGGLE (WIG)
- 8.60. XLS
- 8.61. XLSX
- 8.62. XMFA
- 8.63. YAML
- 8.64. Others
- 8.65. IG
- 9. Glossary
- 10. Faqs
- 11. Bibliography
- 12. Whats' new, what has changed