Optimizing Communication Efficiency and Latency in Blockchain-Enabled Federated Learning Systems

Team

Supervisors

Table of Contents

  1. Abstract
  2. Research Problem
  3. Objectives
  4. Methodology
  5. System Architecture
  6. Experimental Setup and Implementation
  7. Results and Analysis
  8. Impact and Limitations
  9. Conclusion
  10. Publications
  11. Links

Abstract

Credit card fraud detection increasingly requires collaborative learning across multiple financial institutions in order to capture fraud patterns that are not visible within isolated organizational datasets. However, direct sharing of raw financial transaction data is restricted by privacy regulations, institutional policies, and security concerns. Federated Learning (FL) provides a privacy-preserving alternative by enabling institutions to train models collaboratively without exchanging raw data. Nevertheless, conventional blockchain-enabled federated learning systems often suffer from high communication overhead, increased end-to-end latency, and limited scalability when deployed in realistic multi-institution settings.

This project proposes a Hierarchical Collaborative Federated Learning (HCFL) framework for credit card fraud detection, designed to improve communication efficiency, reduce latency, and enhance trust in collaborative training. The framework organizes participants into institutional clusters, where branch-level model updates are first aggregated at headquarters (HQ) nodes before being coordinated globally. To provide decentralized trust and verifiability, the framework integrates Hyperledger Fabric as a permissioned blockchain layer. To reduce blockchain storage overhead, model artifacts are stored off-chain using IPFS, while only hashes, content identifiers (CIDs), validation outcomes, and trust metadata are recorded on-chain.

The system also incorporates resilience and optimization mechanisms, including deadline-aware collection, validation-based filtering, trust-weighted aggregation, backup failover handling, and lightweight blockchain verification. Experimental results show that the proposed architecture achieves strong fraud detection performance while maintaining low communication cost and acceptable end-to-end latency in a simulated multi-institution environment.


Research Problem

Fraud detection models trained independently within a single financial institution often fail to capture cross-institution fraud patterns. Although federated learning allows collaborative model training without sharing raw data, existing approaches still face several limitations:

This project addresses the challenge of designing a trustworthy, efficient, and scalable collaborative fraud detection framework that preserves privacy while reducing communication overhead and latency.


Objectives

The main objective of this research is to design and evaluate a blockchain-enabled hierarchical federated learning framework for collaborative credit card fraud detection.

The specific objectives are:


Methodology

The project follows a staged methodology to separately validate the learning framework and the blockchain-integrated system.

Stage 1: Hierarchical Federated Learning Core

In the first stage, the core fraud detection pipeline is developed and validated independently of the blockchain layer. This includes:

This stage ensures that the learning framework is numerically correct, robust under non-IID data, and effective for fraud detection before introducing distributed blockchain overhead.

Stage 2: Blockchain-Enabled HCFL

In the second stage, the HCFL pipeline is integrated with:

This stage evaluates the full distributed system under realistic coordination, verification, and infrastructure constraints.


System Architecture

The proposed architecture follows a two-tier hierarchical design.

Tier 1 – Institutional / Intra-Cluster Layer

Each participating financial institution is treated as an independent cluster. A cluster contains:

Branch nodes perform local model training on private transaction datasets. The HQ collects branch updates and generates a cluster-level aggregated model. The Backup node remains synchronized with the HQ and can take over in the event of HQ failure, reducing the risk of a single point of failure.

Tier 2 – Blockchain / Inter-Cluster Layer

The second tier consists of a permissioned blockchain network connecting only the HQ nodes of participating institutions. This layer is responsible for:

By limiting blockchain participation to HQ nodes instead of all branch nodes, the architecture significantly reduces consensus complexity, communication overhead, and latency.

Off-Chain Storage

To avoid storing large model files directly on-chain, model artifacts are uploaded to IPFS, and only compact metadata such as CID, hash, validation status, and trust information are stored on the blockchain ledger.


Experimental Setup and Implementation

The framework is implemented using the following technologies:

Dataset

Experiments use the Credit Card Fraud Detection dataset, partitioned across multiple simulated institutions and branches. The training setup reflects a realistic non-IID multi-institution environment, where each branch has access only to its local transaction subset.

Federation Setup

The experimental setup consists of:

Key Implemented Components


Results and Analysis

The proposed HCFL framework was evaluated using both machine learning and system-level performance metrics.

Model Performance

The framework achieved strong fraud detection capability on the held-out test data, with results including:

These results indicate that the hierarchical and trust-aware aggregation strategy is effective even in a distributed multi-institution setting.

System Efficiency

System-level evaluation demonstrated that the proposed design can support collaborative model training with manageable overhead:

The hierarchical structure reduced the number of global submissions, while IPFS minimized blockchain payload size by storing only references and integrity metadata on-chain.

Robustness Validation

The framework was also tested under adverse system conditions, including:

These experiments confirmed that the system remains operational and verifiable under fault and attack scenarios.


Impact and Limitations

Impact

This research contributes to the growing field of secure and scalable collaborative AI for finance by demonstrating that:

The proposed framework is relevant not only to credit card fraud detection, but also to other collaborative learning settings in regulated domains such as healthcare, insurance, and cybersecurity.

Limitations

Despite promising results, several limitations remain:

These limitations provide directions for future work.


Conclusion

This project presented a Hierarchical Collaborative Federated Learning (HCFL) framework for optimizing communication efficiency and latency in blockchain-enabled federated learning systems for credit card fraud detection. The framework combines hierarchical aggregation, trust-aware global coordination, Hyperledger Fabric–based verification, and IPFS-backed off-chain storage to support privacy-preserving and verifiable multi-institution learning.

The experimental results demonstrate that the proposed architecture can achieve strong fraud detection performance while maintaining low communication overhead and reasonable latency. The framework also improves resilience through validation filtering, trust scoring, and failover support. Overall, this work shows that blockchain-enabled federated learning can be made more practical for financial fraud detection through careful hierarchical design and communication-aware optimization.


Publications