Elliptic Graph ML for Illicit Transaction Detection

Python PyTorch Geometric XGBoost Logistic regression TensorBoard Streamlit Calibration analysis

Source code is private. The public case study and article focus on the methodology, evaluation design, and operational tradeoffs.

Elliptic Graph ML for Illicit Transaction Detection supporting evidence

Business context

Fraud and compliance teams need to identify risky transaction flows early, but graph-based financial datasets are easy to evaluate incorrectly because of time leakage and unrealistic thresholds. This project focused on building a defensible detection workflow, not just chasing a strong offline score.

Outcome

Compares feature-only baselines against GCN, GraphSAGE, and GAT models.
Uses temporal splits, calibration, robustness checks, Precision@K, PR-AUC by timestep, and workload curves.
Documents SAGE-ResBN configurations as the strongest-performing setup in the repo’s experiments.
Includes interpretability and ensemble-analysis paths for both baseline and graph models.

Key decisions

Prioritized leakage-safe temporal splits before model tuning.
Benchmarked simpler baselines before claiming graph-model gains.
Used precision-at-investigation-budget style metrics instead of relying only on ROC-AUC.
Added calibration, robustness, and hub-ablation checks to test operational stability.

System design

Raw Elliptic CSVs are transformed into a processed graph artifact, then passed through baseline and GNN training pipelines. Analysis modules handle by-time drift, calibration, workload curves, bootstrap comparison, robustness checks, explanations, and optional ensembling.

Stack

Python, PyTorch Geometric, XGBoost, and logistic-regression baselines
TensorBoard and Streamlit for experiment presentation
Graph preprocessing, temporal evaluation, calibration, and explainability tooling

Elliptic Graph ML for Illicit Transaction Detection

Business context

Outcome

Key decisions

System design

Stack

Wes Lee

Error

Business context

Outcome

Key decisions

System design

Stack

Templates:

Error