Bioconvert is a collaborative project to facilitate the interconversion of life science data from one format to another.
|contributions:||Please join the team to contribute: https://github.com/biokit/bioconvert/issues/1|
Life science uses many different formats. They may be old, or with complex syntax and converting those formats may be a challenge. bioconvert aims at providing a common tool / interface to convert life science data formats from one to another.
Many convertion tools already exist but they may be dispersed, focused on few specific formats, difficult to install, or not optimised. With bioconvert, we plan to cover a wide spectrum of format convertions; we will re-use existing tools when possible and provide facilities to compare different conversion tools or methods via benchmarking. New implementations are provided when considered better than existing ones.
In Aug 2018, we had 79 formats, 129 direct conversions (much more are possible calling bioconvert several times).
In June 2018, we had 66 formats, 120 direct conversions (much more are possible calling bioconvert several times).
In order to install bioconvert, you can use pip:
pip install bioconvert
This command installs bioconvert and its Python dependencies. Note, however, that bioconvert may need extra non-Python dependencies as indicated in this requirements file.
Since Jan 2018 we also provide some versions on bioconda. If you already have bioconda setup on your system, just type:
conda install bioconvert
Or if you have never done so, please add those channels before hand (provided you have installed conda):
conda config --add channels r conda config --add channels defaults conda config --add channels conda-forge conda config --add channels bioconda
From the command line, you can convert a fastq to fasta as follows (compressed or not):
bioconvert fastq2fasta input.fastq output.fasta bioconvert fastq2fasta input.fq output.fasta bioconvert fastq2fasta input.fq.gz output.fasta.gz bioconvert fastq2fasta input.fq.gz output.fasta.bz2 bioconvert --help
From Python shell:
# import a converter from bioconvert.fastq2fasta import Fastq2Fasta # Instanciate with infile/outfile names convert = Fastq2Fasta(infile, outfile) # the conversion itself convert()
User and Developer Guides¶
- 1. User Guide
- 2. Developer guide
- 3. Benchmarking
- 4. Gallery
- 5. References
- 6. Formats
- 6.1. ABI
- 6.2. ASQG
- 6.3. BAM format
- 6.4. BED format
- 6.5. FastA
- 6.6. FastG
- 6.7. FastQ
- 6.8. GFA format
- 6.9. Nexus
- 6.10. PAF (Pairwise mApping Format)
- 6.11. PLINK flat files (MAP/PED)
- 6.12. PLINK binary files (BED/BIM/FAM)
- 6.13. SAM format
- 6.14. Trace File Format - Sequence Chromatogram Format (SCF)
- 6.15. Stockholm
- 6.16. Wiggle Track format (WIG)
- 7. Converters not provided
- 8. Faqs
- 9. Glossary
- 10. Whats’ new, what has changed