High-Performance Matrix Exponential in C

This repository contains a highly optimized C implementation of the matrix exponential algorithm introduced by Awad H. Al-Mohy and Nicholas J. Higham. Designed for high performance, it employs advanced optimizations such as blocking, vectorization, and LU decomposition to deliver results competitive with state-of-the-art tools like SciPy.

Authors

Project Overview

Developed as part of the Advanced System Lab course at ETH Zürich (2023), this project tackles computational challenges in matrix exponential calculation using advanced optimizations. For detailed results, see our report.

Key Features

Algorithm: Al-Mohy & Higham's scaling and squaring method.
Optimizations:
- Strength reduction and precomputation.
- Instruction-level parallelism (ILP) and loop unrolling.
- AVX2-based vectorization for key operations.
- Blocking techniques for efficient memory utilization.
- Transition from Gaussian elimination to LU decomposition for complex system solving.

Motivation

Efficient matrix exponential computation is crucial in various fields such as:

Differential equations (linear and partial).
Quantum mechanics.
Control theory.
Network analysis.

Existing implementations often suffer from overscaling issues or are not optimized for high-performance systems. This project provides a fully optimized C implementation to address these gaps.

Installation

To compile and run the project, ensure the following dependencies are available:

C Compiler: GCC 10.2.1 or later.
BLAS Library: Intel Math Kernel Library (MKL) or equivalent.
Intel Tools: For roofline analysis and performance evaluation.

Ensure that your environment is properly configured for AVX2 and MKL optimizations to achieve peak performance.

Results

This implementation achieves:

Runtime reduction: From ~5s to 850ms for $1024 \times 1024$ matrices.
Peak performance: 10 flops/cycle, nearing BLAS's dgemm peak of 14 flops/cycle.
Competitiveness: Matches SciPy's linalg.expm up to $512 \times 512$ matrices.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data_collection		data_collection
docs		docs
git_hooks		git_hooks
plots		plots
src		src
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
all_in_one.py		all_in_one.py
init.sh		init.sh
requirements.txt		requirements.txt
solver.py		solver.py
test_runner.sh		test_runner.sh
test_runner_data_collection.sh		test_runner_data_collection.sh
test_runner_flop.sh		test_runner_flop.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

High-Performance Matrix Exponential in C

Authors

Project Overview

Key Features

Motivation

Installation

Results

About

Releases

Packages

Languages

LorenzoPaleari/ASL-Matrix-Exponential-C

Folders and files

Latest commit

History

Repository files navigation

High-Performance Matrix Exponential in C

Authors

Project Overview

Key Features

Motivation

Installation

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages