AI to Enable Accurate Modelling of Data Storage System Performance

Researchers at the HSE Faculty of Computer Science have developed a new approach to modelling data storage systems based on generative machine learning models. This approach makes it possible to accurately predict the key performance characteristics of such systems under various conditions. Results have been published in the IEEE Access journal.
Data storage systems play an important role in today’s digital world, as they are responsible for the safety and prompt availability of vast amounts of information. These systems consist of many components, including controllers, HDD and SSD disks, as well as cache memory, which work together to ensure fast and efficient operation. To achieve optimal performance, it is essential to accurately predict how these systems will function in different scenarios, such as when the load on the system changes.
Researchers at the HSE Faculty of Computer Science developed a new approach to modelling data storage system performance, which relies on generative machine learning models. The authors proposed a method that provides high-precision predictions of the key performance characteristics of the systems: the number of input/output operations per second (IOPS) and latency.
The modelling includes two stages. First, the scientists collect data by measuring the system’s performance under various loads and configurations. This data is then fed to two special generative models: the CatBoost regression model and the normalizing flow model. CatBoost works well with tabular data and can accurately predict average values and performance deviations. The normalizing flow model produces a complete distribution of possible outcomes, taking into account data uncertainties and variability.
Mikhail Hushchyn
‘One of the main advantages of our method is that it does not require detailed knowledge of the internal structure of the system components. This is often impossible due to the manufacturers’ trade secrets. Instead, our generative models are trained directly on real-world data. For instance, in our study, we trained a model using 300,000 measurements. This makes our approach versatile and applicable to any type of data storage system,’ says study author Mikhail Hushchyn, a senior research fellow at the HSE Faculty of Computer Science.
The researchers tested the accuracy of the proposed approach using Little's law, a fundamental principle of queuing theory. According to test results, these predictions are highly consistent with real observations: prediction errors range from just 4–10% for IOPS and 3–16% for latency, while the correlation with the observed values reaches 0.99.
Aziz Temirkhanov
‘Our proposed approach opens up broad prospects for optimising and planning the operation of data centres. It makes it possible to predict the behaviour of the system amid load changes, identify potential performance issues, and optimise power consumption. Furthermore, expensive physical experiments are no longer required for accurate modelling,’ stated Aziz Temirkhanov, a junior research fellow at the Laboratory of Methods for Big Data Analysis.
The experimental code and measurements of the storage system performance are publicly available.
The research was carried out within the Mirror Laboratories project of HSE University on improving the efficiency of data centres and data storage systems using artificial intelligence methods.
See also:
HSE Graduate’s AI Project Wins at TECH & AI Awards
Daria Davydova, graduate of the HSE Graduate School of Business and Head of the AI Implementation Unit at the Artificial Intelligence Department of Alfa-Bank, received a prize at the TECH & AI Awards. She was awarded for the best AI solution for optimising business processes. The winners were determined as part of the VII Russian Summit and Awards on Digital Transformation (CDO/CDTO Summit & Awards).
The 'Second Shift' Is Not Why Women Avoid News
Women are more likely than men to avoid political and economic news, but the reasons for this behaviour are linked less to structural inequality or family-related stress than to personal attitudes and the emotional perception of news content. This conclusion was reached by HSE researchers after analysing data from a large-scale survey of more than 10,000 residents across 61 regions of Russia. The study findings have been published in Woman in Russian Society.
New Neural Network for Science and Innovation Being Developed at HSE University
HSE researchers are training large language models (LLMs) to understand Russian-language scientific terminology while improving their energy efficiency. The adapted model runs 2.7 times faster and requires 73% less memory than the original open model, allowing it to operate on more affordable hardware. The programme has passed state registration.
Living with Volcanoes and Tsunamis
It is not easy to get university graduates and students interested in living and working in Kamchatka, a remote region with harsh natural conditions and an extreme climate. Mirror Laboratories, a joint project of the HSE Institute of Education and Kamchatka State University (KamGU), encourages the development of ‘explorer’ competencies and the modernisation of the region’s education system. The HSE News Service discussed the project with Taras Pashchenko, Chief Expert of the HSE Laboratory for Curriculum Design.
Resource Race and Green Transition: Three Unexpected Conclusions from Foresight Centre’s Research on Climate and Poverty
Beneath the surface of green energy—which most people associate with solar panels, electric vehicles, and reduced CO2 emissions—lies a complex web of geopolitical interests, international inequality, and resource constraints. Researchers from the Laboratory for Science and Technology Studies (LST) at the HSE ISSEK Foresight Centre have published a series of articles in leading international journals on hidden and overt conflicts surrounding critically important metals and minerals, as well as related processes in the energy sector.
HSE FCS Researchers Showcase AI and Bioinformatics Breakthroughs at ICLR 2026
Researchers from the AI and Digital Science Institute at the HSE Faculty of Computer Science, along with students from the AI360: Artificial Intelligence Engineering track of the Applied Mathematics and Information Science bachelor’s programme, took part in ICLR, one of the world’s most prestigious international conferences on machine learning and representation learning. This year’s event was held in Rio de Janeiro, Brazil.
Immersion in Second Language Environment Influences Bilinguals’ Perception of Emotions
Researchers at the Cognitive Health and Intelligence Centre at the HSE Institute for Cognitive Neuroscience have discovered how bilingual individuals process emotional words in their native (first) and non-native (second) languages. It was found that the link between word meaning and bodily sensations is weaker in a second language than in a first language. However, the more a person is immersed in a language environment, the smaller this difference becomes. The article has been published in Language, Cognition and Neuroscience.
Researchers Find More Effective Approach to Revealing Majorana Zero Modes in Superconductors
An international team of researchers, including physicists from HSE MIEM, has demonstrated that nonmagnetic impurities can help more accurately reveal Majorana zero modes—quantum states considered promising building blocks for quantum computing. The researchers found that these impurities shift the energy levels that typically obscure the Majorana signal, while leaving the mode itself largely unaffected, thereby making its spectral peak more distinct. The study has been published in Research.
New Development by HSE Scientists Helps Design Reliable Electronics Faster at a Lower Cost
Scientists from HSE MIEM have developed a new approach to modelling electrothermal processes in high-power electronic circuits on printed circuit boards (PCB). The method allows engineers to quickly and accurately predict how electronic components heat up during operation, helping prevent overheating and potential failures. The results have been published in Russian Microelectronics.
The Future of Cardiogenetics Lies in Artificial Intelligence
Researchers from the AI and Digital Science Institute at the HSE Faculty of Computer Science have developed a program capable of analysing regions of the human genome that were previously inaccessible for accurate interpretation in genetic testing. The program adapts large generative AI (GenAI) models for cardiogenetics to predict how specific mutations affect the function of individual genes.


