Datasets
16 datasets spanning random graphs, scientific workflows, and IoT applications.
Random Graphs
Synthetically generated task graphs with controlled structure
In-Trees
RandomSource: SAGA random generator
Randomly generated task graphs with a converging (fan-in) tree structure. Tasks flow from multiple sources to a single sink. Generated with 2-4 levels and branching factor 2-3.
Out-Trees
RandomSource: SAGA random generator
Randomly generated task graphs with a diverging (fan-out) tree structure. Tasks flow from a single source to multiple sinks. Generated with 2-4 levels and branching factor 2-3.
Parallel Chains
RandomSource: SAGA random generator
Task graphs consisting of multiple parallel chains connected by a super-source and super-sink. Generated with 2-5 chains of length 2-5.
Scientific Workflows (WFCommons)
Real-world scientific application workflows
BLAST
WFCommonsSource: WFCommons
Basic Local Alignment Search Tool workflows for sequence alignment. One of the most widely-used bioinformatics tools.
BWA
WFCommonsSource: WFCommons
Burrows-Wheeler Aligner workflows for mapping sequencing reads to a reference genome.
Cycles
WFCommonsSource: WFCommons
Agroecosystem modeling workflows that simulate crop growth and soil processes.
Epigenomics
WFCommonsSource: WFCommons
Epigenomics analysis workflows for studying chemical modifications to DNA and histone proteins.
1000Genome
WFCommonsSource: WFCommons
Genomics workflows from the 1000 Genomes Project for identifying genetic variants across human populations.
Montage
WFCommonsSource: WFCommons
Astronomical image mosaicking workflows that combine multiple sky survey images into seamless composites.
Seismology
WFCommonsSource: WFCommons
Seismic data processing workflows for analyzing earthquake data.
SoyKB
WFCommonsSource: WFCommons
Soybean Knowledge Base workflows for genomic analysis of soybean varieties.
SRASearch
WFCommonsSource: WFCommons
Sequence Read Archive search workflows for querying large genomic databases.
IoT/Streaming (RIOTBench)
IoT data processing workflows on edge-fog-cloud networks
ETL
RIOTBenchSource: RIOTBench
Extract-Transform-Load workflows for IoT data processing in edge/fog computing environments.
Predict
RIOTBenchSource: RIOTBench
Machine learning prediction workflows for real-time IoT data inference pipelines.
Stats
RIOTBenchSource: RIOTBench
Statistical analysis workflows for IoT sensor data streams.
Train
RIOTBenchSource: RIOTBench
Machine learning training workflows for IoT data across edge, fog, and cloud nodes.