The XBabelPhish MAGE-ML and XML translator

被引:2
作者
Maier, Don [1 ]
Wymore, Farrell [1 ]
Sherlock, Gavin [2 ]
Ball, Catherine A. [1 ]
机构
[1] Stanford Univ, Sch Med, Dept Biochem, Stanford, CA 94305 USA
[2] Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA
关键词
Semantic Variant; Translation Step; Context Query; Document Domain; XQuery Engine;
D O I
10.1186/1471-2105-9-28
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: MAGE-ML has been promoted as a standard format for describing microarray experiments and the data they produce. Two characteristics of the MAGE-ML format compromise its use as a universal standard: First, MAGE-ML files are exceptionally large - too large to be easily read by most people, and often too large to be read by most software programs. Second, the MAGE-ML standard permits many ways of representing the same information. As a result, different producers of MAGE-ML create different documents describing the same experiment and its data. Recognizing all the variants is an unwieldy software engineering task, resulting in software packages that can read and process MAGE-ML from some, but not all producers. This Tower of MAGE-ML Babel bars the unencumbered exchange of microarray experiment descriptions couched in MAGE-ML. Results: We have developed XBabelPhish - an XQuery-based technology for translating one MAGE-ML variant into another. XBabelPhish's use is not restricted to translating MAGE-ML documents. It can transform XML files independent of their DTD, XML schema, or semantic content. Moreover, it is designed to work on very large (> 200 Mb.) files, which are common in the world of MAGE-ML. Conclusion: XBabelPhish provides a way to inter-translate MAGE-ML variants for improved interchange of microarray experiment information. More generally, it can be used to transform most XML files, including very large ones that exceed the capacity of most XML tools.
引用
收藏
页数:12
相关论文
共 10 条
  • [1] *GAL, XQUER IMPL DISCR HAK
  • [2] JAMES C, TRANG MULTIFORMAT SC
  • [3] JANOS D, 2007, NUCLEIC ACIDS RES, pD766
  • [4] MICHAEL K, BLOOMING FLWOR INTRO
  • [5] ArrayExpress - a public database of microarray experiments and gene expression profiles
    Parkinson, H.
    Kapushesky, M.
    Shojatalab, M.
    Abeygunawardena, N.
    Coulson, R.
    Farne, A.
    Holloway, E.
    Kolesnykov, N.
    Lilja, P.
    Lukk, M.
    Mani, R.
    Rayner, T.
    Sharma, A.
    William, E.
    Sarkans, U.
    Brazma, A.
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : D747 - D750
  • [6] PAUL TS, 2002, GENOME BIOL, V3
  • [7] *SHREX, COMP SOL XML REL MAP
  • [8] *XPATH, 1999, VERS 1 0 W3C REC
  • [9] *XSLT, 1999, VERS 1 0 W3C REC
  • [10] 2007, XQUERY 1 0 XML QUERY