bs2csv

Description

Python script to extract metadata for a list of given BioSample ids from the NCBI BioSample database. Data is extracted from XML files and output to a CSV.

Note

CSV output works best when comparing metadata from different BioSamples in the same BioProject as the tag names in the XML will be consistent. Comparing runs from different BioProjects can result in a messy CSV output.

Local Usage

Setup

# [optional] create/load virtualenv
pip install -r requirements.txt

Example usage

python bs2csv.py input_ids.txt

input_ids.txt is a text file containing new line separated NCBI BioSample accession ids

Example usage with flags

python bs2csv.py input_ids.txt  -o metadata_output.csv -v values.txt

metadata_output.csv is the name of the desired output file. Defaults to biosample_metadata.csv
values.txt is a text file containing new line separated values that are used when extracting metadata
- only the information from tags found in values.txt will be stored in the output

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
bs2csv.py		bs2csv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

bs2csv

Description

Local Usage

Setup

Example usage

Example usage with flags

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

serratus-bio/bs2csv

Folders and files

Latest commit

History

Repository files navigation

bs2csv

Description

Local Usage

Setup

Example usage

Example usage with flags

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages