Machine Learning: Science and Technology

ISSN: 2632-2153

OPEN ACCESS

Machine Learning: Science and Technology is a multidisciplinary open access journal that bridges the application of machine learning across the sciences with advances in machine learning methods and theory as motivated by physical insights.

Submit an article opens in new tab Track my article opens in new tab

RSS

Median submission to first decision before peer review 3 days

Median submission to first decision after peer review 49 days

Impact factor 6.8

Citescore 7.1

Full list of journal metrics

Open all abstracts, in this tab

The following article is Open access

Towards XAI agnostic explainability to assess differential diagnosis for Meningitis diseases

Aya Messai et al 2024 Mach. Learn.: Sci. Technol. 5 025052

View article, Towards XAI agnostic explainability to assess differential diagnosis for Meningitis diseases PDF, Towards XAI agnostic explainability to assess differential diagnosis for Meningitis diseases

Meningitis, characterized by meninges and cerebrospinal fluid inflammation, poses diagnostic challenges due to diverse clinical manifestations. This work introduces an explainable AI automatic medical decision methodology that determines critical features and their relevant values for the differential diagnosis of various meningitis cases. We proceed with knowledge acquisition to define the rules for this research. Currently, we have established the etiological diagnosis of Meningococcaemia, Meningococcal Meningitis, Tuberculous Meningitis, Aseptic Meningitis, Haemophilus influenzae Meningitis, and Pneumococcal Meningitis. The data preprocessing was conducted after collecting data from samples with meningitis diseases at Setif Hospital in Algeria. Tree-based ensemble methods were then applied to assess the model's performance. Finally, we implement an XAI agnostic explainability approach based on the SHapley Additive exPlanations technique to attribute each feature's contribution to the model's output. Experiments were conducted on the collected dataset and the SINAN database, obtained from the Brazilian Government's Health Information System on Notifiable Diseases, which comprises 6729 patients aged over 18 years. The Extreme Gradient Boosting model was chosen for its superior performance metrics (Accuracy: 0.90, AUROC: 0.94, and F1-score: 0.98). Setif's hospital data revealed notable performance metrics (Accuracy: 0.7143, F1-Score: 0.7857). This study's findings showcase each feature's contribution to the model's predictions and diagnosis. It also reveals critical biomarker ranges associated with distinct types of Meningitis. Significant diagnostic effect was found for Meningococcal Meningitis with elevated neutrophil levels ( $\gt$ 40%) and balanced lymphocyte levels (40%–60%). Tuberculous Meningitis demonstrated low neutrophil levels ( $\lt$ 60%) and elevated lymphocyte levels ( $\gt$ 60%). H. influenzae meningitis exhibited a predominance of neutrophils ( $\gt$ 80%), while Aseptic meningitis showed lower neutrophil levels ( $\lt$ 40%) and lymphocyte levels within the range of 50%–60%. The majority of the AI automatic medical decision results are twinned with validation by our team of infectious disease experts, confirming the alignment of algorithmic diagnoses with clinical practices.

https://doi.org/10.1088/2632-2153/ad4a1f

The following article is Open access

Autoencoders for discovering manifold dimension and coordinates in data from complex dynamical systems

Kevin Zeng et al 2024 Mach. Learn.: Sci. Technol. 5 025053

View article, Autoencoders for discovering manifold dimension and coordinates in data from complex dynamical systems PDF, Autoencoders for discovering manifold dimension and coordinates in data from complex dynamical systems

While many phenomena in physics and engineering are formally high-dimensional, their long-time dynamics often live on a lower-dimensional manifold. The present work introduces an autoencoder framework that combines implicit regularization with internal linear layers and L₂ regularization (weight decay) to automatically estimate the underlying dimensionality of a data set, produce an orthogonal manifold coordinate system, and provide the mapping functions between the ambient space and manifold space, allowing for out-of-sample projections. We validate our framework's ability to estimate the manifold dimension for a series of datasets from dynamical systems of varying complexities and compare to other state-of-the-art estimators. We analyze the training dynamics of the network to glean insight into the mechanism of low-rank learning and find that collectively each of the implicit regularizing layers compound the low-rank representation and even self-correct during training. Analysis of gradient descent dynamics for this architecture in the linear case reveals the role of the internal linear layers in leading to faster decay of a 'collective weight variable' incorporating all layers, and the role of weight decay in breaking degeneracies and thus driving convergence along directions in which no decay would occur in its absence. We show that this framework can be naturally extended for applications of state-space modeling and forecasting by generating a data-driven dynamic model of a spatiotemporally chaotic partial differential equation using only the manifold coordinates. Finally, we demonstrate that our framework is robust to hyperparameter choices.

https://doi.org/10.1088/2632-2153/ad4ba5

The following article is Open access

Global system errors to simultaneously improve the identification of subsystems with mixed data Gaussian process regression

Cameron J LaMack and Eric M Schearer 2024 Mach. Learn.: Sci. Technol. 5 025051

View article, Global system errors to simultaneously improve the identification of subsystems with mixed data Gaussian process regression PDF, Global system errors to simultaneously improve the identification of subsystems with mixed data Gaussian process regression

This paper explores the use of Gaussian process regression for system identification in control engineering. It introduces two novel approaches that utilize the data from a measured global system error. The paper demonstrates these approaches by identifying a simulated system with three subsystems, a one degree of freedom mass with two antagonist muscles. The first approach uses this whole-system error data alone, achieving accuracy on the same order of magnitude as subsystem-specific data ( $9.28\pm0.87 \text{N } \text{vs. } 6.96\pm0.32 \text{N}$ of total model errors). This is significant, as it shows that the same data set can be used to identify unique subsystems, as opposed to requiring a set of data descriptive of only a single subsystem. The second approach demonstrated in this paper mixes traditional subsystem-specific data with the whole system error data, achieving up to 98.71% model improvement.

https://doi.org/10.1088/2632-2153/ad4e05

The following article is Open access

Enhancing particle string detection in electrorheological plasmas using asymmetrical kernel convolutional networks

M Klein et al 2024 Mach. Learn.: Sci. Technol. 5 025050

View article, Enhancing particle string detection in electrorheological plasmas using asymmetrical kernel convolutional networks PDF, Enhancing particle string detection in electrorheological plasmas using asymmetrical kernel convolutional networks

Under different plasma conditions and electric fields in a complex plasma the plasma particles organize themselves in a string-like or chain-like manner. A phase transition from string-like to an isotropic particle distribution is observed at different electrical conditions. The streaming of charged ions around plasma particles with the surrounding electric field gives the plasma its electrorheological properties. The visibility of individual particles in a complex plasma opens up the opportunity to examine properties and phase transitions of such electrorheological fluids in detail. Because of the limited one-dimensional symmetry, determining the configuration of a particle and recognizing strings in particle distributions is not always straightforward. Several approaches have already been used to analyse particle clouds while either considering each particle locally or considering the particle cloud as a whole without providing information about single particle configurations. This paper presents a new machine learning approach that takes advantage of particle distributions over the entire particle cloud and detects all string-like particles at once, using a convolutional neural network in form of an encoder-decoder network with asymmetric kernel convolutions. This not only enhances the result quality but also accelerates the evaluation process, possibly enabling real-time analyses on electrorheological phase transitions, while achieving an accuracy of over 95% on manually labelled data.

https://doi.org/10.1088/2632-2153/ad4d3e

The following article is Open access

Unsupervised learning of quantum many-body scars using intrinsic dimension

Harvey Cao et al 2024 Mach. Learn.: Sci. Technol. 5 025049

View article, Unsupervised learning of quantum many-body scars using intrinsic dimension PDF, Unsupervised learning of quantum many-body scars using intrinsic dimension

Quantum many-body scarred systems contain both thermal and non-thermal scar eigenstates in their spectra. When these systems are quenched from special initial states which share high overlap with scar eigenstates, the system undergoes dynamics with atypically slow relaxation and periodic revival. This scarring phenomenon poses a potential avenue for circumventing decoherence in various quantum engineering applications. Given access to an unknown scar system, current approaches for identification of special states leading to non-thermal dynamics rely on costly measures such as entanglement entropy. In this work, we show how two dimensionality reduction techniques, multidimensional scaling and intrinsic dimension estimation, can be used to learn structural properties of dynamics in the PXP model and distinguish between thermal and scar initial states. The latter method is shown to be robust against limited sample sizes and experimental measurement errors.

https://doi.org/10.1088/2632-2153/ad4d3f

Machine Learning: Science and Technology

Journal links

Journal information

Machine Learning: Science and Technology

Most read

Latest articles

Review articles

Accepted manuscripts

Trending

Trending on Altmetric

Journal links

Journal information