Periodic Reporting for period 1 - DeepBMS (Deep Reinforcement Learning-Based Battery Management System for Electric Vehicles)
Période du rapport: 2023-03-15 au 2025-03-14
To address these issues, this project focuses on next-generation Battery Management Systems (BMSs), with a particular emphasis on their software components. Specifically, it targets State-of-X estimation algorithms, where X can represent either the State of Charge (SoC) or the State of Health (SoH). SoC indicates the remaining charge of the battery, while SoH reflects the battery’s health and provides insight into its remaining useful life. Both metrics are critical to the safety and performance of EVs.
The proposed system, DeepBMS, adopts a hybrid approach that combines model-based and data-driven methods using reinforcement learning. This aims to develop algorithms that are both accurate and data-efficient. Traditional model-based approaches often suffer from low fidelity due to dynamically changing battery operating conditions—such as temperature fluctuations, driving patterns, and aging effects. On the other hand, purely data-driven approaches are typically data-intensive, requiring extensive lab testing, and often lack interpretability.
DeepBMS uses reinforcement learning to augment physical battery models with a streamlined data-driven component that compensates for model uncertainties. This enables accurate SoC and SoH estimation with significantly reduced data requirements. Moreover, the system is designed to be adaptable, capable of learning from new battery conditions and incorporating this knowledge into the BMS software to maintain estimation accuracy, even as the battery ages. The work carried out in DeepBMS to develop these state estimators include the whole design cycle from battery cell testing and dataset generation, designing and training the AI agents, optimizing the AI, to deployment of the trained agents in embedded systems.
Algorithmic innovation: Two types of state algorithms are designed with distinct philosophies. In the first approach, a pure model-based algorithm based on an extended Kalman filter (EKF) for battery state estimation is proposed as baseline. A recursive least squares algorithm (RLS) is also augmented with the EKF to enable real-time update of the model parameters to account for dynamic operating conditions. In addition, to account for model uncertainties arising from aging, a reinforcement learning agent is trained to learn how to adjust the covariance matrices in the EKF algorithm to achieve minimum state estimation error as the battery age. The first approach yields a maximum SoC estimation error below 2% and a root mean square error (RMSE) below 1% when tested on a dynamic driving condition. The philosophy of second approach is based on introducing a neural network directly into the architecture of the EKF algorithm. This mechanism, called KalmanNet, learns to adjust the gain of the Kalman algorithm through a recurrent neural network (RNN) with input features designed directly from cell measurements and previous state estimates. KalmanNet similarly achieved promising state estimation results in particular for deeply aged cells very close to the end-of-life condition where state estimation error was maintained as low as ~1%.
Cell testing: Two types of battery cells were considered for development and testing of the algorithms. This includes lithium-ion batteries based on nickel manganese cobalt oxide (NMC) and Lithium Titanate Oxide (LTO). Different standard tests based on IEC62660 including capacity test, hybrid pulse power characteristic test (HPPC), OCV-SoC characteristic test, and dynamic test replicating real driving situations were fulfilled to collect the necessary data for algorithmic works.
In addition, a test plan was developed to explore embedded implementation of the trained AI agents considering low-cost DSP setups such as TMS320F28379 and the battery monitoring ICs BQ79616 both from TI families. For these implementations, discretization, simplification (network pruning), and quantization are fulfilled and effects on final integration SW are assessed. A second implementation strategy using a cloud-based digital twin considering ThingSpeak platform is implemented for hierarchical execution of trained agents where a heavy version loads on the cloud and a light agent is embedded onboard. This later solution offers a scalable and flexible tool for DeepBMS implementation.