Open Causal (Beta)

Overview

The Open Causal platform proposes a standardized JSON format for sharing and exchanging causal graphs of all types, including DAGs, ADMGs, and more. This data standard is designed to capture the essential components of causal graphs, including nodes, edges, and metadata, in a structured and machine-readable format. This data standard enables interoperability across different tools, platforms, and research workflows, making causal knowledge more accessible and machine-readable.

Schema Specification (version 0.1)

The complete JSON schema for specifying causal graphs includes metadata, node definitions, and edge definitions. Below is the full structure with all possible fields:

Complete JSON Structure


{
  "graph": {
    "id": "unique-identifier",
    "date": "2024-01-01T00:00:00Z",
    "title": "Graph Title",
    "description": "A description of the causal graph",
    "creator": {
      "name": "Creator Name",
      "orcid": "0000-0000-0000-0000"
    },
    "contributors": [
      {
        "name": "Contributor Name",
        "orcid": "0000-0000-0000-0001",
        "role": "author"
      }
    ],
    "licence": "cc-by-4.0",
    "type": "dag",
    "cyclicity": "acyclic",
    "related_publication": [
      {
        "doi": "10.1234/example",
        "title": "Related Publication Title",
        "year": 2024
      }
    ],
    "statistical_unit": "individual"
  },
  "nodes": [
    {
      "id": "node1",
      "name": "Treatment",
      "description": "The treatment or exposure variable",
      "position": {"x": 100, "y": 150},
      "causal_role": "exposure",
      "variable_type": "continuous",
      "time_point": "baseline",
      "structural_model": "linear",
      "ontology_ids": ["ONTOLOGY:12345"]
    },
    {
      "id": "node2",
      "name": "Outcome",
      "description": "The primary outcome",
      "position": {"x": 300, "y": 150},
      "causal_role": "outcome",
      "variable_type": "binary"
    }
  ],
  "edges": [
    {
      "source": "node1",
      "target": "node2",
      "type": "directed",
      "position": {"curve": 0.5},
      "sign": "positive"
    }
  ]
}

Field Definitions

Graph Object

Field	Type	Required	Description
id	string	Yes	Unique identifier for the graph. Should be URL-safe and immutable.
date	ISO 8601 datetime	Yes	Creation or last modified date of the graph in ISO 8601 format.
title	string	Yes	Human-readable title of the causal graph.
description	string	No	Detailed description of the graph, its context, and assumptions.
creator	object	Yes	Person who created the graph. Contains 'name' (required) and 'orcid' (optional).
contributors	array of objects	No	List of people who contributed to the graph with their roles.
licence	string	Yes	License under which the graph is distributed (e.g., 'CC BY 4.0', 'CC BY-SA 4.0').
type	enum	Yes	Graph type: 'dag' (Directed Acyclic Graph), 'cyclic' (with cycles), 'undirected', 'mixed'.
cyclicity	enum	Yes	Cyclicity status: 'acyclic', 'cyclic', or 'unknown'.
related_publication	array of objects	No	List of publications that discuss or use this causal graph.
statistical_unit	string	No	Unit of analysis for the graph (e.g., 'individual', 'household', 'population', 'time-series').

Nodes Array

Field	Type	Required	Description
id	string	Yes	Unique identifier for the node within the graph.
name	string	Yes	Human-readable name of the variable or concept.
description	string	No	Detailed description of what the node represents.
position	object {x, y}	No	X and Y coordinates for visualization. Allows tools to preserve layout.
causal_role	enum	No	Role in the causal structure: 'exposure', 'outcome', 'confounder', 'mediator', 'collider', 'other'.
variable_type	enum	No	Data type: 'binary', 'categorical', 'continuous', 'count', 'survival', 'other'.
time_point	string	No	Time point at which the variable is measured (e.g., 'baseline', 'month_6', 'follow-up').
structural_model	string	No	Assumed structural model: 'linear', 'nonlinear', 'multiplicative', 'threshold', 'other'.
ontology_ids	array of strings	No	References to ontologies (e.g., SNOMED-CT, UMLS, MeSH) for standardized variable definitions.

Edges Array

Field	Type	Required	Description
source	string	Yes	ID of the source (from) node.
target	string	Yes	ID of the target (to) node.
type	enum	Yes	Edge type: 'directed', 'undirected', 'bidirected', 'dashed' (unknown direction).
position	object	No	Visualization metadata for edge layout (e.g., curve parameter for curved edges).
sign	enum	No	Direction of effect: 'positive' (increases), 'negative' (decreases), 'unknown'.

Example:

{
  "graph": {
    "id": "12345678",
    "date": "2026-04-13T00:00:00Z",
    "title": "Ice Cream Sales and Drowning Deaths",
    "description": "A DAG showing that ice cream sales and drowning deaths are correlated due to a common cause (warm weather), not a direct causal relationship.",
    "creator": {
      "name": "Abdullah Ademoğlu",
      "orcid": "0000-0000-0000-0000"
    },
    "licence": "cc-by-4.0",
    "type": "dag",
    "cyclicity": "acyclic",
    "statistical_unit": "population"
  },
  "nodes": [
    {
      "id": "weather",
      "name": "Warm Weather",
      "description": "Temperature and seasonality (summer months)",
      "position": {"x": 50, "y": 50},
      "causal_role": "confounder",
      "variable_type": "continuous"
    },
    {
      "id": "ice_cream",
      "name": "Ice Cream Sales",
      "description": "Weekly ice cream sales in units sold",
      "position": {"x": 200, "y": 150},
      "causal_role": "exposure",
      "variable_type": "continuous"
    },
    {
      "id": "drowning",
      "name": "Drowning Deaths",
      "description": "Weekly drowning fatalities",
      "position": {"x": 350, "y": 150},
      "causal_role": "outcome",
      "variable_type": "count"
    }
  ],
  "edges": [
    {
      "source": "weather",
      "target": "ice_cream",
      "type": "directed",
      "sign": "positive"
    },
    {
      "source": "weather",
      "target": "drowning",
      "type": "directed",
      "sign": "positive"
    },
    {
      "source": "ice_cream",
      "target": "drowning",
      "type": "dashed"
    }
  ]
}

Contributing

We welcome contributions from the research community to improve and extend this standard. If you have suggestions, feedback, or would like to contribute to the development of this data standard, please reach out to us.

Share & Discuss — Share your causal assumptions and get feedback from the community early in your study.
Citeable — Every graph will be assigned a Digital Object Identifier (DOI), to make them citable and trackable.
Versioning — Easily update your graph and publish a new version.
Findable — Search among thousands of nodes and edges from hundreds of graphs.
Accessible — Import a graph object directly into your favorite statistical package in Python or R.
Reusable — Build on top of existing causal knowledge. Don't start from scratch.

Open Causal Graph Data Standard