07 December 2005

Semantic Abstracts.

Many papers about text mining trying to extract the semantic value of pubmed the abstracts can now be found in the litterature.
I wish publishers could join a semantic abstract (RDF based ?) to their classic abstract, that would be an incredible (and safe) source of knowledge for "systems biology". I imagine the following paper:


Piron M, Vende P, Cohen J, Poncet D.
Rotavirus RNA-binding protein NSP3 interacts with eIF4GI and evicts the poly(A) binding protein from eIF4F.


would be turned into something like that...



<rdf:RDF (...)>
<bio:Protein
rdf:about='http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=protein&val=77120248'
ncbi:taxon='http://www.ncbi.nlm.nih.gov/htbin-post/Taxonomy/wgetorg?mode=Info&id=10923'
>
<rdfs:label>NSP3</rdfs:label>
<rdfs:description>Rotavirus Non Structural Protein 3</rdfs:description>
</bio:Protein>

<bio:Protein
rdf:about='http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=protein&val=38201627'
ncbi:taxon='http://www.ncbi.nlm.nih.gov/htbin-post/Taxonomy/wgetorg?mode=Info&id=9606'
>
<rdfs:label>EIF4G</rdfs:label>
<rdfs:description>eukaryotic translation initiation factor 4 gamma</rdfs:description>
</bio:Protein>

<bio:interaction rdf:ID='NSP3-EIF4G'>
<bio:entity rdf:resource='http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=protein&val=77120248'/>
<bio:entity rdf:resource='http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=protein&val=38201627'/>
<bio:method rdf:resource='#2YH'/>
<bio:reference rdf:resource='#paper'>
</bio:interaction>


<bio:Paper rdf:ID='paper'>
<dc:title>Rotavirus RNA-binding protein NSP3 interacts with eIF4GI and evicts the poly(A) binding protein from eIF4F</dc:title>
<rdf:seeAlso rdf:resource='http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=9755181'/>
</bio:Paper>

<bio:Method rdf:ID='2YH'>
<rdfs:label>2YH</rdfs:label>
<rdfs:description>Yeast two hybrid system</rdfs:description>
</bio:Method>

(...)
</rdf:RDF>




Please Mr Nature, Springer, Elsevier :-) ....

No comments: