Skip to main content

Table 1 Major features and their overall levels of simulation platforms to support provenance

From: A survey of simulation provenance systems: modeling, capturing, querying, visualization, and advanced utilization

Platforms

Application domain

Provenance subject

Preservation (where to record)

Modeling (how to represent)

Querying (how to access)

Visualization (how to show)

Result reuse

Reproduction

Data mining

(Support level = [high \(\vert\) medium \(\vert\) low])

EDISON [3]

Scientific simulation (domain-neutral)

Simulation

RDBMS

Table (med)

SQL (high)

None (low)

Limited (Manual) (med)

Possible (high)

None (low)

myGrid [90]

Biology/bioinformatics

Workflow

RDBMS

XML/HTML/RDF (med)

SQL (high)

Graph (high)

Limited (Manual) (med)

Possible (high)

None (low)

Taverna [14]

Biology/bioinformatics

Workflow

RDBMS

XML (Scufl)+RDF (med)

SQL (high)

Graph (high)

Unknown (low)

Unknown (low)

None (low)

Chimera [91]

Physics/astronomy

Workflow

RDBMS

VDL (med)

SQL+VDL (high)

Graph (high)

Unknown (low)

Unknown (low)

Unknown (low)

CMCS [92]

Chemistry

Workflow

File system

XML/RDF (med)

Via a browser (low)

Graph (high)

Unknown (low)

Unknown (low)

None (low)

PASOA [93,94,95,96,97,98]

e-Science

Service

Memory/RDBMS/file system

Unknown (low)

Java-based API/XQuery (high)

Unknown (low)

Unknown (med)

Possible (high)

None (low)

ESSW [99]

Earth science

Workflow

RDBMS

XML (med)

SQL (high)

Graph (high)

Unknown (low)

Unknown (low)

Unknown (low)

Kepler [16]

General science

Workflow

File system

MoML (XML) (med)

File search (low)

Unknown (low)

Unknown (low)

Possible (high)

Unknown (low)

Kepler Distributed Provenance Framework [100]

Kepler-extension

Workflow based on MapReduce

RDBMS (MySQL)

Table (med)

API/SQL (high)

Unknown (low)

Unknown (low)

Unknown (low)

Unknown (low)

RAMP [101]

Distributed system

Workflow based on MapReduce

File system/key-value store

File (low)

API (med)

Unknown (low)

None (low)

None (low)

Unknown (low)

HadoopProv [102]

Distributed system

Workflow based on MapReduce

File system/key-value store

File (low)

API (med)

Graph (high)

None (low)

None (low)

Unknown (med

Pig Lipstick [103]

Distributed system

Workflow based on MapReduce

Pig latin

OPM (high)

Graph-based API (med)

Graph (high)

Unknown (low)

Unknown (low)

Unknown (low)

Karma [104]

Weather forcast

Dynamic workflow

RDBMS

XML (med)

SQL (high)

Graph (high)

Unknown (low)

Unknown (low)

Unknown (low)

Pegasus [105,106,107]

Distributed system

Workflow

VDS/RDBMS

OWL (med)

SPARQL/SQL (high)

Unknown (low)

Unknown (low)

Unknown (low)

Unknown (low)

REDUX [108]

Windows system

Workflow

RDBMS

Table (med)

SQL (high)

Unknown (low)

Unknown (low)

Possible (high)

Unknown (low)

Swift [109,110,111]

Distributed system

Workflow

File system

File (low)

File Search (low)

Unknown (low)

Unknown (low)

Unknown (low)

Unknown (low)

VisTrails [11, 12, 66]

Computing system

Workflow

RDBMS

Python Object (XML/Table) (med)

System-defined query (med)

Graph (high)

Unknown (low)

Unknown (low)

Unknown (low)

PASS [112]

Computing system

Linux process

Berkeley DB

File (low)

Graph-based query (nq) (med)

Graph (high)

None (low)

None (low)

Unknown (low)

ES3 [108]

Computing system

Linux process

DBMS

XML (med)

SQL (high)

Graph (high)

None (low)

None (low)

Unknown (low)

CloudProv [113]

Cloud system

Real-time application

Database

File (low)

API (med)

Unknown (low)

Unknown (low)

Unknown (low)

Unknown (low)

Milieu [114]

Scientific simulation

Workflow

Database

Table (med)

SQL (high)

Unknown (low)

Unknown (low)

Possible (high)

Unknown (low)

Sumatra [115, 116]

Scientific simulation

Program

File system

(CSV) File (low)

File search (low)

Possible (med)

Unknown (low)

Possible (high)

Unknown (low)

e-Science Central [117, 118]

Scientific simulation

Workflow/scientific data

PaaS

OPM (high)

Web-based interface (low)

Graph (high)

Possible (med)

Possible (high)

None (low)