pcannotate
Module¶
annotate_peptides¶
- proteoclade.pcannotate.annotate_peptides(file, db, pctaxa, taxon_levels=None, worker_threads=None)¶
Drives the taxonomic and gene annotation of peptide-containing files.
- Parameters
file (string) – csv or txt file containing wide-form, peptide entries
db (string) – PCDB file containing digested peptides to match w/ experiment
pctaxa (string) – PCTAXA file containing taxonomic mapping for species and above
taxon_levels (None, string, or tuple) – Which taxa to annotate above the organism level (default None)
worker_threads (None or integer) –
Number of worker threads to use. (default None)
if None: will use up to 6 threads.
Notes
Outputs csv or txt file with all data and appended taxonomic and gene annotations
‘annotated_’ + ‘denovo_matched’ + file
annotate_denovo¶
- proteoclade.pcannotate.annotate_denovo(file, db, pctaxa, method='dbconstrain', taxon_levels=None, worker_threads=None)¶
Drives the annotation of denovo/psm-containing files.
- Parameters
file (string) – csv or txt file containing long-form PSM entries
db (string) – PCDB file containing digested peptides to match w/ experiment
pctaxa (string) – PCTAXA file containing taxonomic mapping for species and above
method (string) –
“dbconstrain”: serially checks PSM candidates against the PCDB
”top”: only looks at top scoring PSM candidate (default: “dbconstrain”)
taxon_levels (None, string, or tuple) – Which taxa to annotate above the organism level (default None)
worker_threads (None or integer) –
Number of worker threads to use. (default None)
if None: will use up to 6 threads.
Notes
Output is csv or txt file with all data and appended taxonomic and gene annotations
‘denovo_matched_’ + file
‘annotated_’ + ‘denovo_matched’ + file
filter_taxa¶
- proteoclade.pcannotate.filter_taxa(file, taxon_levels, taxa, unique=False)¶
Filters peptide files based on desired taxa.
- Parameters
file (string) – csv or txt file containing wide-form, peptide entries
taxon_levels (string, list, or tuple) – Taxonomic ranks to include in file search. Must be annotated
taxa (string, list, or tuple) – Taxa to include in filter
unique (bool) – Whether specified taxa must be unique in their given taxonomic rank
Notes
Output is csv or txt file pared down by filter specifications.
‘filtered_’ + file name