Skip to main content

Table 1 Major features and their overall levels of simulation platforms to support provenance

From: A survey of simulation provenance systems: modeling, capturing, querying, visualization, and advanced utilization

Platforms Application domain Provenance subject Preservation (where to record) Modeling (how to represent) Querying (how to access) Visualization (how to show) Result reuse Reproduction Data mining
(Support level = [high \(\vert\) medium \(\vert\) low])
EDISON [3] Scientific simulation (domain-neutral) Simulation RDBMS Table (med) SQL (high) None (low) Limited (Manual) (med) Possible (high) None (low)
myGrid [90] Biology/bioinformatics Workflow RDBMS XML/HTML/RDF (med) SQL (high) Graph (high) Limited (Manual) (med) Possible (high) None (low)
Taverna [14] Biology/bioinformatics Workflow RDBMS XML (Scufl)+RDF (med) SQL (high) Graph (high) Unknown (low) Unknown (low) None (low)
Chimera [91] Physics/astronomy Workflow RDBMS VDL (med) SQL+VDL (high) Graph (high) Unknown (low) Unknown (low) Unknown (low)
CMCS [92] Chemistry Workflow File system XML/RDF (med) Via a browser (low) Graph (high) Unknown (low) Unknown (low) None (low)
PASOA [93,94,95,96,97,98] e-Science Service Memory/RDBMS/file system Unknown (low) Java-based API/XQuery (high) Unknown (low) Unknown (med) Possible (high) None (low)
ESSW [99] Earth science Workflow RDBMS XML (med) SQL (high) Graph (high) Unknown (low) Unknown (low) Unknown (low)
Kepler [16] General science Workflow File system MoML (XML) (med) File search (low) Unknown (low) Unknown (low) Possible (high) Unknown (low)
Kepler Distributed Provenance Framework [100] Kepler-extension Workflow based on MapReduce RDBMS (MySQL) Table (med) API/SQL (high) Unknown (low) Unknown (low) Unknown (low) Unknown (low)
RAMP [101] Distributed system Workflow based on MapReduce File system/key-value store File (low) API (med) Unknown (low) None (low) None (low) Unknown (low)
HadoopProv [102] Distributed system Workflow based on MapReduce File system/key-value store File (low) API (med) Graph (high) None (low) None (low) Unknown (med
Pig Lipstick [103] Distributed system Workflow based on MapReduce Pig latin OPM (high) Graph-based API (med) Graph (high) Unknown (low) Unknown (low) Unknown (low)
Karma [104] Weather forcast Dynamic workflow RDBMS XML (med) SQL (high) Graph (high) Unknown (low) Unknown (low) Unknown (low)
Pegasus [105,106,107] Distributed system Workflow VDS/RDBMS OWL (med) SPARQL/SQL (high) Unknown (low) Unknown (low) Unknown (low) Unknown (low)
REDUX [108] Windows system Workflow RDBMS Table (med) SQL (high) Unknown (low) Unknown (low) Possible (high) Unknown (low)
Swift [109,110,111] Distributed system Workflow File system File (low) File Search (low) Unknown (low) Unknown (low) Unknown (low) Unknown (low)
VisTrails [11, 12, 66] Computing system Workflow RDBMS Python Object (XML/Table) (med) System-defined query (med) Graph (high) Unknown (low) Unknown (low) Unknown (low)
PASS [112] Computing system Linux process Berkeley DB File (low) Graph-based query (nq) (med) Graph (high) None (low) None (low) Unknown (low)
ES3 [108] Computing system Linux process DBMS XML (med) SQL (high) Graph (high) None (low) None (low) Unknown (low)
CloudProv [113] Cloud system Real-time application Database File (low) API (med) Unknown (low) Unknown (low) Unknown (low) Unknown (low)
Milieu [114] Scientific simulation Workflow Database Table (med) SQL (high) Unknown (low) Unknown (low) Possible (high) Unknown (low)
Sumatra [115, 116] Scientific simulation Program File system (CSV) File (low) File search (low) Possible (med) Unknown (low) Possible (high) Unknown (low)
e-Science Central [117, 118] Scientific simulation Workflow/scientific data PaaS OPM (high) Web-based interface (low) Graph (high) Possible (med) Possible (high) None (low)