Module:Bio::Graph::IO::psi xml

From BioPerl
Jump to: navigation, search


Pdoc documentation: Bio::Graph::IO::psi_xml CPAN documentation: Bio::Graph::IO::psi_xml


Contents

PSI XML Specifications

See the HUPO Proteomics Standards Initiative page or the initial publication first for more.

References

<biblio>
#first pmid=14755292
</biblio>

Usage Notes

HPRD

Individual PSI XML files from HPRD can't be parsed as is because the fullName of the organism of an interacting protein is not specific. HPRD uses values like 'Mammalia' rather than the required species names, thus Bio::Species objects can't be constructed. Although I haven't performed an accurate count a simple grep suggests that there are thousands of interacting proteins labelled 'Mammalia'. Since HPRD says that it is concerned exclusively the human proteome it may that one can globally replace 'Mammalia' with 'Homo sapiens'. On the other hand it may be that including an interaction in HPRD is allowed when only one of the interacting pair is human, the definitive test could be performed using the identifiers (BIO 18:16, 31 December 2005 (EST)).

Another thing to notice about HPRD's individual "PSI-MI" files is that they begin with various blocks like protein and interaction and the PSI entrySet section starts somewhere in the middle of the file. BIO 13:33, 23 January 2006 (EST)

HPRD PSI-XML Example

article1 article2

References

<biblio>

  1. article1 pmid=16381900

</biblio> <biblio>

  1. article2 pmid=14681466

</biblio>

IntAct

PSI XML from IntAct has occasional errors, the fullName of the organism of some interacting proteins is absent when the shortLabel of the organism is 'in vitro', these are usually short peptides (fullName is used as a source of species information, thus Bio::Species objects can't be constructed). In one file I examined there were only a few proteins like this, they could be corrected by hand (BIO 18:16, 31 December 2005 (EST)).

Example: bovine_small.xml

MINT

PSI XML from MINT has occasional errors, the fullName of the organism of some interacting proteins is absent. BIO 13:46, 2 October 2006 (EDT)

Personal tools
Namespaces
Variants
Actions
Main Links
documentation
community
development
Toolbox