Demo

2025

Explainable machine learning models for classifying reactions within crowd noise during men's collegiate basketball games

Mitchell C. Cutler^‡, Jason Bickmore^‡, Mark K. Transtrum, Katrina Pedersen^†, Shannon Proksch^‡, Eli Farrer, and Kent Gee

J. Acoust. Soc. Am. 158 (4), 3456-3471 (2025).
Time-dependent Ginzburg-Landau framework for sample-specific simulation of superconductors for radio-frequency applications

Aiden V. Harbick^† and Mark K. Transtrum

Phys. Rev. B 112 (9), 094518 (2025).
Tailoring and verification of dynamic load models in power system studies: the WECC load example

Mark K. Transtrum, Aleksandar M. Stankovic, and Clifford C. Youn

Sustain. Energy Grids Netw. 44, 101970 (2025).

2024

Quantitative and Kinetic Proteomics Reveal ApoE Isoform-dependent Proteostasis Adaptations in Mouse Brain

Nathan R. Zuniga^†, Noah E. Earls^‡, Ariel E. A. Denos^‡, Jared M. Elison^‡, Benjamin S. Jones^‡, Ethan G. Smith^‡, Noah G. Moran^‡, … Gerome M. Romero^†, Chad D. Hyer^‡, Kimberly B. Wagstaff^‡, … Mark K. Transtrum, … (3 other authors)

PLoS Comput. Biol. 12 (20), 1-29 (2024).
Impact of submicron Nb₃Sn stoichiometric surface defects on high-field superconducting radiofrequency cavity performance

Sarah A. Willson, Aiden V. Harbick^†, Liana Shpani, Van Do, Helena Lew-Kiedrowska, Matthias U. Liepe, Mark K. Transtrum, and S. J. Sibener

Phys. Rev. Res. 6 (4), 043133 (2024).
Maximum entropy temperature selection via the equipartition theorem

Jacob R. Nuttall^‡, Tracianne B. Neilsen, and Mark K. Transtrum

185th Meeting of the Acoustical Society of America, (Sydney, Australia, December 2023). Proc. Meet. Acoust. 52 (1), 070004 (2024).
Information Geometry in Underwater Acoustics: Tutorial, Case Study, and Outlook

Jay C. Spendlove^†, Michael C. Mortenson^†, Tracianne B. Neilsen, and Mark K. Transtrum

J. Theor. Comput. Acoust. 32 (3), 2450011 (2024).
Learning Seabed Properties and Sensitivities from Ship Noise Spectrograms

Michael C. Mortenson^†, Tracianne B. Neilsen, Mark K. Transtrum, and David P. Knobles

OCEANS 2024 (Singapore, April 2024).
Toward a dynamic national transportation noise map: Modeling temporal variability of spectral traffic noise emission levels

Mylan R. Cook^†, Kent L. Gee, Mark. K. Transtrum, and Shane V. Lympany

J. Acoust. Soc. Am. 156 (3), 1693-1706 (2024).
Information geometry analysis example for absolute and relative transmission loss in a shallow ocean

Jay C. Spendlove^†, Tracianne B. Neilsen, and Mark K. Transtrum

JASA Express Letters 4 (7), 076001 (2024).
The training process of many deep networks explores the same low-dimensional manifold

Jialin Mao, Itay Griniasty, Han Kheng Teoh, Rahul Ramesh, Rubing Yang, Mark K. Transtrum, James P. Sethna, and Pratik Chaudhari

Proc. Natl. Acad. Sci. U. S. A. 121 (12), e2310002121 (2024).
Data-driven decomposition of crowd noise from indoor sporting events

Mitchell C. Cutler^‡, Mylan R. Cook^†, Mark K. Transtrum, and Kent L. Gee

J. Acoust. Soc. Am. 155 (2), 962-970 (2024).

2023

Sloppy model analysis identifies bifurcation parameters without normal form analysis

Christian N. K. Anderson^† and Mark K. Transtrum

Phys. Rev. E 108 (6), 064215 (2023).
Toward a dynamic national transportation noise map: Modeling temporal variability of traffic volume

Mylan R. Cook^†, Kent L. Gee, Mark. K. Transtrum, and Shane V. Lympany

J. Acoust. Soc. Am. 154 (5), 2950-2958 (2023).
Recurrence Quantification Analysis of Crowd Sound Dynamics

Shannon Proksch, Majerle Reeves, Kent Gee, Mark Transtrum, Chris Kello, and Ramesh Balasubramaniam

Cogn. Sci. 47 (10), e13363 (2023).
Field-dependent nanospin ordering in monolayers of Fe₃O₄ nanoparticles throughout the superparamagnetic blocking transition

Johnathon Rackham^†, Brittni Pratt^†, Dalton Griner^‡, Dallin Smith^‡, Yanping Cai^†, Roger G. Harrison, … Mark K. Transtrum, and Karine Chesnel (2 other authors)

Phys. Rev. B 108 (10), 104415 (2023).
Feature selection for a continental-scale geospatial model of environmental sound levels

Katrina Pedersen^†, Mark K. Transtrum, Kent L. Gee, Shane V. Lympany, Michael M. James, and Alexandria R. Salton

J. Acoust. Soc. Am. 154 (2), 1168-1178 (2023).
Enhanced Surface Superconductivity of Niobium by Zirconium Doping

Nathan S. Sitaraman, Zeming Sun, Benjamin L. Francis^†, Ajinkya C. Hire, Thomas Oseroff, Zhaslan Baraissov, … and Mark K. Transtrum (4 other authors)

Phys. Rev. Applied 20 (1), 014064 (2023).
K-Means Clustering of 51 Geospatial Layers Identified for Use in Continental-Scale Modeling of Outdoor Acoustic Environments

Katrina Pedersen^†, Ryan R. Jensen, Lucas K. Hall, Mitchell C. Cutler^‡, Mark K. Transtrum, Kent L. Gee, and Shane V. Lympany

Appl. Sci.-Basel 13 (14), 8123 (2023).
ZrNb(CO) RF Superconducting Thin Film with High Critical Temperature in the Theoretical Limit

Zeming Sun, Thomas Oseroff, Zhaslan Baraissov, Darrah K. Dare, Katrina Howard, Benjamin Francis^†, … Mark K. Transtrum, … (7 other authors)

Adv. Electron. Mater. 9 (8), 2300151 (2023).
An app for nationwide dynamic traffic noise prediction

Mylan Cook^†, Kent Gee, Mark Transtrum, and Shane Lympany

INTER-NOISE and NOISE-CON Congress and Conference Proceedings, NOISE-CON23, (Grand Rapids, MI, May 2023).
Accurate broadband gradient estimates enable local sensitivity analysis of ocean acoustic models

Michael C. Mortenson^†, Tracianne B. Neilsen, Mark K. Transtrum, and David P. Knobles

J. Theor. Comput. Acoust. 31 (2), 2250015 (2023).
Information geometry for multiparameter models: new perspectives on the origin of simplicity

Katherine N Quinn, Michael C Abbott, Mark K Transtrum, Benjamin B Machta, and James P Sethna

Rep. Prog. Phys. 86 (3), 035901 (2023).

2022

Toward improving road traffic noise characterization: A reduced-order model for representing hourly traffic volume dynamics

Mylan R. Cook^†, Kent L. Gee, Mark K. Transtrum, Shane V. Lympany, and Matthew F. Calton

181st Meeting of the Acoustical Society of America, (Seattle, WA, December 2021).
Selecting simple, transferable models with the supremum principle

Cody Petrie^‡, Christian Anderson^†, Casie Maekawa^‡, Travis Maekawa^‡, and Mark K. Transtrum

Phys. Rev. Res. 4 (3), L032044 (2022).
Utilizing Nonequilibrium Isotope Enrichments to Dramatically Increase Turnover Measurement Ranges in Single Biopsy Samples from Humans

Bradley C. Naylor^†, Christian N. K. Anderson^†, Marcus Hadfield^‡, David H. Parkinson^‡, Austin Ahlstrom^‡, Austin Hannemann^‡, Chad R. Quilling^‡, Kyle J. Cutler^‡, … Mark K. Transtrum, … (8 other authors)

J. Proteome Res. 21 (11), 2703-2714 (2022).
Bayesian, frequentist, and information geometric approaches to parametric uncertainty quantification of classical empirical interatomic potentials

Yonatan Kurniawan^†, Cody L. Petrie^‡, Kinamo J. Williams^‡, Mark K. Transtrum, Ellad B. Tadmor, Ryan S. Elliott, Daniel S. Karls, and Mingjian Wen

J. Chem. Phys. 156 (21), 214103 (2022).
Integration of Physics- and Data-Driven Power System Models in Transient Analysis After Major Disturbances

Aleksandar A. Sarić, Mark K. Transtrum, Andrija T. Sarić, and Aleksandar M. Stanković

IEEE Syst. J. 17 (1), 479-490 (2022).
Application of a spectral-based wind noise reduction method to acoustical measurements

Mylan R. Cook^†, Kent L. Gee, Mark K. Transtrum, Shane V. Lympany, and Matt Calton

181st Meeting of the Acoustical Society of America, (Seattle, WA, December 2021). Proc. Meet. Acoust. 45, 045002 (2022).
Modeling Live Crowd Emotion Dynamics for State Estimation and Prediction

Brooks A. Butler^†, Philip E. Paré^†, Mark K. Transtrum, and Sean Warnick

2021 IEEE Conference on Control Technology and Applications, (San Diego, CA, Aug 2021).
State Estimation Model Reduction through the Manifold Boundary Approximation Method

Vanja G. Svenda, Mark K Transtrum, Benjamin L. Francis, Andrija T. Saric, and Aleksandar M Stankovic

IEEE Trans. Power Syst. 37 (1), 272-281 (2022).

2021

Piecemeal Reduction of Models of Large Networks

Benjamin L. Francis^†, Mark K. Transtrum, Andrija T. Sarić, and Aleksandar M. Stanković

2021 60th IEEE Conference on Decision and Control (CDC), (Austin, TX, December 2021).
Validating two geospatial models of continental-scale environmental sound levels

Katrina Pedersen^†, Mark K. Transtrum, Kent L. Gee, Shane V. Lympany, Michael M. James, and Alexandria R. Salton

JASA Express Letters 1 (12), 122401 (2021).
Simultaneous Determination of Equilibrium Constants, Enthalpy Changes and Stoichiometries by Titration Calorimetry

Lee D. Hansen, Mark K. Transtrum, and Jason D. Kenealey

Chapter 5 in Gibbs Energy and Helmholtz Energy: Liquids, Solutions and Vapours, edited by Emmerich Wilhelm and Trevor Letcher, Royal Society of Chemistry (2021).
Automatic classification and reduction of wind noise in spectral data

Mylan R. Cook^†, Kent L. Gee, Mark K. Transtrum, Shane V. Lympany, and Matt Calton

JASA Express Letters 1 (6), 063602 (2021).
Examining wind noise reduction effects of windscreens and microphone elevation in outdoor acoustical measurements

Zachary Jones^†, Mylan R. Cook^†, Kent L. Gee, Mark K. Transtrum, Shane V. Lympany, Matthew F. Calton, and Michael M. James

179th Meeting of the Acoustical Society of America, Acoustics Virtually Everywhere (December 2020). Proc. Meet. Acoust. 42 (1), 045007 (2021).
Symbolic Regression for Data-Driven Dynamic Model Refinement in Power Systems

Andrija T. Sarić, Aleksandar A. Sarić, Mark K. Transtrum, and Aleksandar M. Stanković

IEEE Trans. Power Syst. 36 (3), 2390-2402 (2021).
Data-Driven Classification, Reduction, Parameter Identification and State Extension in Hybrid Power Systems

Andrija T. Sarić, Mark K. Transtrum, and Aleksandar M. Stanković

IEEE Trans. Power Syst. 36 (3), 2222-2233 (2021).
Effect of the density of states at the Fermi level on defect free energies and superconductivity: A case study of Nb₃Sn

Nathan S. Sitaraman, Michelle M. Kelley, Ryan D. Porter, Matthias U. Liepe, Tomás A. Arias, Jared Carlson^†, Alden R. Pack^†, Mark K. Transtrum, … (1 other authors)

Phys. Rev. B 103 (11), 115106 (2021).
Analysis of magnetic vortex dissipation in Sn-segregated boundaries in Nb₃Sn superconducting RF cavities

Jared Carlson, Alden Pack, Mark K. Transtrum, Jaeyel Lee, David N. Seidman, Danilo B. Liarte, … (6 other authors)

Phys. Rev. B 103 (2), 024516 (2021).

2020

Automatic detection of instances of focused crowd involvement at recreational events

Eric Todd^‡, Mylan R. Cook^†, Katrina Pedersen^†, David S. Woolworth, Brooks A. Butler^‡, Xin Zhao^‡, Colt Liu^‡, Kent L. Gee, Mark K. Transtrum, … (1 other authors)

178th Meeting of the Acoustical Society of America, (San Diego, CA, December 2019). Proc. Meet. Acoust. 39, 040003 (2020).
Interleaving physics- and data-driven models for power system transient dynamics

Aleksandar M. Stanković, Aleksandar A. Sarić, Andrija T. Sarić, and Mark K. Transtrum

Electr. Power Syst. Res. 189, 106824 (2020).
Probabilistic extension of flexible hybrid state estimation for cyber-physical systems

Vanja G. Švenda, Aleksandar M. Stanković, Andrija T. Sarić, and Mark K. Transtrum

Int. J. Electr. Power Energy Syst. 122, 106179 (2020).
K-Means clustering of inputs to a geospatial model for optimizing acoustic data collection

Brooks A. Butler^‡, Katrina Pedersen^†, Casie Maekawa, Kent L. Gee, Mark K. Transtrum, Michael M. James, and Alexandria R. Salton

176th Meeting of Acoustical Society of America, 2018 Acoustics Week in Canada (Victoria, Canada, November 2018). Proc. Meet. Acoust. 35 (1), 055008 (2020).
Data-driven Symbolic Regression for Identification of Nonlinear Dynamics in Power Systems

Alex M. Stanković, Aleksandar A. Sarić, Andrija T. Sarić, and Mark K. Transtrum

2020 IEEE Power & Energy Society General Meeting, (Montreal, Canada, August 2020).
Vortex nucleation in superconductors within time-dependent Ginzburg-Landau theory in two and three dimensions: Role of surface defects and material inhomogeneities

Alden R. Pack^†, Jared Carlson^‡, Spencer Wadsworth^‡, and Mark K. Transtrum

Phys. Rev. B 101 (14), 144504 (2020).
Flexible hybrid state estimation for power systems with communication irregularities

V. G. Švenda, A. M. Stanković, A. T. Sarić, and M. K. Transtrum

IET Gener. Transm. Distrib. 14 (11), 2111-2119 (2020).

2019

Model Boundary Approximation Method as a Unifying Framework for Balanced Truncation and Singular Perturbation Approximation

P. E. Paré, D. Grimsman, A. T. Wilson, M. K. Transtrum, and S. Warnick

IEEE Trans. Autom. Control 64 (11), 4796-4802 (2019).
Network Reduction in Transient Stability Models using Partial Response Matching

Benjamin L. Francis^‡, Jacob R. Nuttall^‡, Mark K. Transtrum, Andrija T. Sarić, and Aleksandar M. Stanković

2019 North American Power Symposium (NAPS), (Wichita, Kansas, October 2019).
Machine learning-based ensemble model predictions of outdoor ambient sound levels

Katrina Pedersen^†, Mark K. Transtrum, Kent L. Gee, Brooks A. Butler^‡, Michael M. James, and Alexandria R. Salton

176th Meeting of Acoustical Society of America; 2018 Acoustics Week in Canada, (Victoria, Canada, November 2018). Proc. Meet. Acoust. 35 (1), 022002 (2019).
Classifying crowd behavior at collegiate basketball games using acoustic data

Brooks A. Butler^‡, Katrina Pedersen^†, Mylan R. Cook^†, Spencer G. Wadsworth^‡, Eric Todd^‡, Dallen Stark^‡, Kent L. Gee, Mark K. Transtrum, … (1 other authors)

176th Meeting of Acoustical Society of America; 2018 Acoustics Week in Canada, (Victoria, Canada, November 2018). Proc. Meet. Acoust. 35 (1), 055006 (2019).
Data Classification and Parameter Identification in Power Systems by Manifold Learning

Andrija T. Sarić, Mark K. Transtrum, and Aleksandar M. Stanković

2019 IEEE Milan PowerTech, (Milan, Italy, June 2019).
Unwinding the model manifold: Choosing similarity measures to remove local minima in sloppy dynamical systems

Benjamin L. Francis^† and Mark K. Transtrum

Phys. Rev. E 100 (1), 012206 (2019).
Modeling inter-particle magnetic correlations in magnetite nanoparticle assemblies using x-ray magnetic scattering data

Johnathon Rackham^†, Brittni Newbold^‡, Steve Kotter^‡, Dallin Smith^‡, Dalton Griner^‡, Roger Harrison, … Mark Transtrum, and Karine Chesnel (1 other authors)

AIP Advances 9 (3), 035033 (2019).
Optimal experimental design for machine learning using the Fisher information

Tracianne B. Neilsen, David F. Van Komen^‡, Mark K. Transtrum, Makenzie B. Allen^‡, and David P. Knobles

Proc. Meet. Acoust. 35 (1), 055004 (2019).
Data-Driven Dynamic Equivalents for Power System Areas from Boundary Measurements

Andrija T. Sarić, Mark T. Transtrum, and Aleksandar M. Stanković

IEEE Trans. Power Syst. 34 (1), 360-370 (2019).

2018

Simultaneous Global Identification of Dynamic and Network Parameters in Transient Stability Studies

Mark K. Transtrum, Benjamin L. Francis^†, Andrija T. Saric, and Aleksandar M. Stankovic

2018 IEEE Power & Energy Society General Meeting (PESGM). (Portland, OR, August 2018).
Influence of Communication Irregularities and Co-simulation on Hybrid Power System State Estimation

Vanja G. Švenda, Alex M. Stanković, Andrija T. Sarić, and Mark K. Transtrum

2018 IEEE PES Innovative Smart Grid Technologies Conference Europe (ISGT-Europe). (Sarajevo, Bosnia-Herzegovina, October 2018).
Effect of extreme temperatures on soil: A calorimetric approach

Lee D. Hansen, Nieves Barros, Mark K. Transtrum, Jose A. Rodríguez-Añón, Jorge Proupín, Verónica Piñeiro, Ander Arias-González, and Nahia Gartzia

Thermochim. Acta 670, 128-135 (2018).
Geometrically Motivated Reparameterization for Identifiability Analysis in Power Systems Models

Mark K. Transtrum, Benjamin L. Francis^†, Clifford C. Youn, Andrija T. Saric, and Aleksander M. Stankovic

2018 North American Power Symposium (NAPS). (Fargo, ND, September 2018).
The Spectrum of Mechanism-Oriented Models and Methods for Explanations of Biological Phenomena

C. Anthony Hunt, Ahmet Erdemir, William W. Lytton, Feilim Mac Gabhann, Edward A. Sander, Mark K. Transtrum, and Lealem Mulugeta

Processes 6 (5), 56 (2018).
Experimental design and model reduction in systems biology

Jenny E. Jeong, Qinwei Zhuang, Mark K. Transtrum, Enlu Zhou, and Peng Qiu

Quant. Biol. 6 (4), 287-306 (2018).
Titration Calorimetry: From Concept to Application

Lee D. Hansen, Mark K. Transtrum, and Colette F. Quinn

Springer (2018)
Maximizing the information learned from finite data selects a simple model

Henry H. Mattingly, Mark K. Transtrum, Michael C. Abbott, and Benjamin B. Machta

Proc. Natl. Acad. Sci. U. S. A. 115 (8), 1760-1765 (2018).
Calorimetric Methods for Measuring Stability and Reusability of Membrane Immobilized Enzymes

Marco Mason^†, Matteo Scampicchio, Colette F. Quinn, Mark K. Transtrum, Nicholas Baker^‡, Lee D. Hansen, and Jason D. Kenealey

J. Food Sci. 83 (2), 326-331 (2018).
Hybrid power system state estimation with irregular sampling

Aleksandar M. Stanković, Vanja Švenda, Andrija T. Sarić, and Mark K. Transtrum

Power & Energy Society General Meeting, IEEE (Chicago, IL, July 2017)
SRF Theory Developments from the Center for Bright Beams

D. B. Liarte, T. Arias, D. L. Hall, M. Liepe, J. P. Sethna, N. Sitaraman, A. Pack^†, and M. K. Transtrum

18th International Conference on RF Superconductivity, (Lanzhou, China, 2017).
Information Geometry Approach to Verification of Dynamic Models in Power Systems

M. K. Transtrum, A. T. Saric, and A. M. Stankovic

IEEE Trans. Power Syst. 33 (1), 440-450 (2018).

2017

Information geometry for model identification and parameter estimation in renewable energy – DFIG plant case

Andrija T. Sarić, Mark K. Transtrum, and Aleksandar M. Stanković

IET Gener. Transm. Distrib. 12 (6), 1294-1302 (2017).
Information geometry for model reduction of dynamic loads in power systems

Clifford C. Youn, Andrija T. Sarić, Mark K. Transtrum, and Aleksandar M. Stanković

2017 IEEE Manchester PowerTech (Manchester, UK, June 2017).
Mechanisms of in vivo ribosome maintenance change in response to nutrient signals

Andrew D. Mathis^†, Bradley C. Naylor^†, Richard H. Carson^†, Eric Evans^‡, Justin Harwell^‡, Jared Knect^‡, Eric Hexem^‡, … Mark K. Transtrum, … (5 other authors)

Mol. Cell. Proteomics 16 (2), 243-254 (2017).
Theoretical estimates of maximum fields in superconducting resonant radio frequency cavities: stability theory, disorder, and laminates

Danilo B. Liarte, Sam Posen, Mark K. Transtrum, Gianluigi Catelani, Matthias Liepe, and James P. Sethna

Supercond. Sci. Technol. 30 (3), 033002 (2017).

2016

The Limitations of Model-Based Experimental Design and Parameter Estimation in Sloppy Systems

Andrew White^‡, Malachi Tolman^†, Howard D. Thames, Hubert Rodney Withers, Kathy A. Mason, and Mark K. Transtrum

PLoS Comput. Biol. 12, 1-12 (2016).
Ginzburg-Landau theory of the superheating field anisotropy of layered superconductors

Danilo B. Liarte, Mark K. Transtrum, and James P. Sethna

Phys. Rev. B 94 (14), 144504 (2016).
Measurement-Directed Reduction of Dynamic Models in Power Systems

Mark K. Transtrum , Andrija T. Saric, and Aleksandar M. Stankovic

IEEE Trans. Power Syst. 32 (3), 2243-2253 (2016).
Dynamic Model Estimation for Power System Areas from Boundary Measurements

Andrija T. Sarić, Mark K. Transtrum, and Aleksandar M. Stanković

IEEE Power in Energy and Society General Meeting, (Boston, MA, July 2016).
Information geometry for model verification in energy systems

Mark K. Transtrum, Andrija T. Saric, and Alex M. Stankovic

2016 Power Systems Computation Conference (PSCC), (Genova, Italy, June 2016).
Bridging Mechanistic and Phenomenological Models of Complex Biological Systems

Mark K. Transtrum and Peng Qiu

PLoS Comput. Biol. 12 (5), 1-34 (2016).
Enzyme-catalyzed and binding reaction kinetics determined by titration calorimetry

Lee D. Hansen, Mark K. Transtrum, Colette Quinn, and Neil Demarse

Biochim. Biophys. Acta-Gen. Subj. 1860 (5), 957-966 (2016).

2015

Shielding Superconductors with Thin Films as Applied to rf Cavities for Particle Accelerators

Sam Posen, Mark K. Transtrum, Gianluigi Catelani, Matthias U. Liepe, and James P. Sethna

Phys. Rev. Applied 4, 044019 (2015).
Sloppiness and the Geometry of Parameter Space

Brian K. Mannakee, Aaron P. Ragsdale, Mark K. Transtrum, and Ryan N. Gutenkunst

Uncertainty in Biology, Volume 17 of the series Studies in Mechanobiology, Tissue Engineering and Biomaterials
A unified view of Balanced Truncation and Singular Perturbation Approximations

Philip E. Paré, Alma T. Wilson, Mark K. Transtrum, and Sean C. Warnick

American Control Conference, (Chicago, IL, July 2015), pp 1984-1989.
Perspective: Sloppiness and emergent theories in physics, biology, and beyond

Mark K. Transtrum, Benjamin B. Machta, Kevin S. Brown, Bryan C. Daniels, Christopher R. Myers, and James P. Sethna

J. Chem. Phys. 143 (1), 010901 (2015).
Enzyme kinetics determined by single-injection isothermal titration calorimetry

Mark K. Transtrum, Lee D. Hansen, and Colette Quinn

Methods 76, 194-200 (2015).

2014

Model Reduction by Manifold Boundaries

Mark K. Transtrum and Peng Qiu

Phys. Rev. Lett. 113, 098701 (2014).

2013

Parameter Space Compression Underlies Emergent Theories and Predictive Models

Benjamin B. Machta, Ricky Chachra, Mark K. Transtrum, and James P. Sethna

Science 342 (6158), 604-607 (2013).
Use of the LQ model with large fraction sizes results in underestimation of isoeffect doses

Tommy Sheu, Jessica Molkentine, Mark K. Transtrum, Thomas A. Buchholz, Hubert Rodney Withers, Howard D. Thames, and Kathy A. Mason

Radiother. Oncol. 109 (1), 21-25 (2013).

2012

Structural susceptibility and separation of time scales in the van der Pol oscillator

Ricky Chachra, Mark K. Transtrum, and James P. Sethna

Phys. Rev. E 86 (2), 026712 (2012).
Optimal experiment selection for parameter estimation in biological differential equation models

Mark K Transtrum and Peng Qiu

BMC Bioinformatics 13 (1), 181 (2012).

2011

Comment on “Sloppy models, parameter uncertainty, and the role of experimental design”

Ricky Chachra, Mark K. Transtrum, and James P. Sethna

Mol. BioSyst. 7, 2522 (2011).
Superheating field of superconductors within Ginzburg-Landau theory

Mark K. Transtrum, Gianluigi Catelani, and James P. Sethna

Phys. Rev. B 83 (9), 094505 (2011).
Geometry of nonlinear least squares with applications to sloppy models and optimization

Mark K. Transtrum, Benjamin B. Machta, and James P. Sethna

Phys. Rev. E 83 (3), 036701 (2011).
Riemann manifold Langevin and Hamiltonian Monte Carlo methods

Mark K. Transtrum, Yanjiun Chen, Benjamin B. Machta, James P. Sethna, and Ryan Gutenkunst

Read before The Royal Statistical Society at a meeting organized by the Research Section (October 2010). See discussion on page 199. J. Roy. Stat. Soc. B 73 (2), 123-214 (2011).

2010

Why are Nonlinear Fits to Data so Challenging?

Mark K. Transtrum, Benjamin B. Machta, and James P. Sethna

Phys. Rev. Lett. 104 (6), 060201 (2010).

2009

An Analytic Iterative Approach to Solving the Time-Independent Schrodinger Equation

Chad Junkermeier^†, Mark Transtrum^‡, and Manuel Berrondo

Int. J. Quantum Chem. 109 (5), 982-998 (2009).

2007

Beyond the quantum in Snyder space

J. F. S. Van Huele and M. K. Transtrum^‡

Beyond the Quantum, (June 2006,Leiden, The Netherlands), 235-243.

2006

A perturbative approach to Snyder space with applications

Mark K. Transtrum^‡ and Jean-François S. Van Huele

J. Phys. A 39 (48), 14985-14996 (2006).

2005

Commutation relations for functions of operators

Mark K. Transtrum and Jean-François S. Van Huele

J. Math. Phys. 46 (6), 063510 (2005).

^‡Undergraduate Student Author

^†Graduate Student Author

Selected Publications

Vortex nucleation in superconductors within time-dependent Ginzburg-Landau theory in two and three dimensions: Role of surface defects and material inhomogeneities

Alden R. Pack, Jared Carlson, Spencer Wadsworth, and Mark K. Transtrum

We use time-dependent Ginzburg-Landau theory to study the nucleation of vortices in type-II superconductors in the presence of both geometric and material inhomogeneities. The superconducting Meissner state is metastable up to a critical magnetic field, known as the superheating field. For a uniform surface and homogeneous material, the superheating transition is driven by a nonlocal critical mode in which an array of vortices simultaneously penetrate the surface. In contrast, we show that even a small amount of disorder localizes the critical mode and can have a significant reduction in the effective superheating field for a particular sample. Vortices can be nucleated by either surface roughness or local variations in material parameters, such as

. Our approach uses a finite-element method to simulate a cylindrical geometry in two dimensions and a film geometry in two and three dimensions. We combine saddle-node bifurcation analysis along with a fitting procedure to evaluate the superheating field and identify the unstable mode. We demonstrate agreement with previous results for homogeneous geometries and surface roughness and extend the analysis to include variations in material properties. Finally, we show that in three dimensions, surface divots not aligned with the applied field can increase the superheating field. We discuss implications for fabrication and performance of superconducting resonant frequency cavities in particle accelerators.

Flexible hybrid state estimation for power systems with communication irregularities

M. K. Transtrum (et al.)

This study proposes a novel flexible hybrid state estimation (SE) algorithm when a realistic communication system with its irregularities is taken into account. This system is modelled by the Network Simulator 2 software tool, which is also used to calculate communication delays and packet drop probabilities. Within this setup, the system observability can be predicted, and the proposed SE can decide between using the static SE (SSE) or the discrete Kalman filter plus SSE-based measurements and time alignment (Forecasting-aided SE). Flexible hybrid SE (FHSE) incorporates both phasor measurement units and supervisory control and data acquisition-based measurements, with different time stamps. The proposed FHSE with detailed modelling of the communication system is motivated by: (i) well-known issues in SSE (time alignment of the measurements, frequent un-observability for fixed SE time stamps etc.); and (ii) the need to model a realistic communication system (calculated communication delays and packet drop probabilities are a part of the proposed FHSE). Application of the proposed algorithm is illustrated for examples with time-varying bus load/generation on two IEEE test cases: 14-bus and 300-bus.

Model Boundary Approximation Method as a Unifying Framework for Balanced Truncation and Singular Perturbation Approximation

M. K. Transtrum (et al.)

Model Boundary Approximation Method as a Unifying Framework for Balanced Truncation and Singular Perturbation Approximation

Network Reduction in Transient Stability Models using Partial Response Matching

Benjamin L. Francis, Jacob R. Nuttall, and Mark K. Transtrum (et al.)

We describe a method for simultaneously identifying and reducing dynamic power systems models in the form of differential-algebraic equations. Often, these models are large and complex, containing more parameters than can be identiﬁed from the available system measurements. We demonstrate our method on transient stability models, using the IEEE 14-bus test system.Ourapproachusestechniquesofinformationgeometryto remove unidentiﬁable parameters from the model. We examine the case of a networked system with 58 parameters using full observations throughout the network. We show that greater reduction can be achieved when only partial observations are available, including reduction of the network itself.

Machine learning-based ensemble model predictions of outdoor ambient sound levels

Katrina Pedersen, Mark K. Transtrum, Kent L. Gee, and Brooks A. Butler (et al.)

Outdoor ambient sound levels can be predicted from machine learning-based models derived from geospatial and acoustic training data. To improve modeling robustness, median predicted sound levels have been calculated from an ensemble of tuned models from different supervised machine learning modeling classes. The ensemble is used to predict ambient sound levels throughout the contiguous United States. The training data set consists of 607 unique sites, where various acoustic metrics, such as overall daytime L50 levels and one-third octave frequency band levels, have been obtained. Data for 117 geospatial features, which include metrics such as distance to the nearest road or airport, are used. The spread in the ensemble provides an estimate of the modeling accuracy. Results of an initial leave-one-out and leave-four-out validation study are presented.

Classifying crowd behavior at collegiate basketball games using acoustic data

Brooks A. Butler, Katrina Pedersen, Mylan R. Cook, Spencer G. Wadsworth, Eric Todd, Dallen Stark, Kent L. Gee, Mark K. Transtrum, and Sean Warnick

The relationship between crowd noise and crowd behavioral dynamics is a relatively unexplored field of research. Signal processing and machine learning (ML) may be useful in classifying and predicting crowd emotional state. This paper describes using both supervised and unsupervised ML methods to automatically differentiate between different types of crowd noise. Features used include A-weighted spectral levels, low-level audio signal parameters, and Mel-frequency cepstral coefficients. K-means clustering is used for the unsupervised approach with spectral levels, and six distinct clusters are found; four of these clusters correspond to different amounts of crowd involvement, while two correspond to different amounts of band or public announcement system noise. Random forests are used for the supervised approach, wherein validation and testing accuracies are found to be similar. These investigations are useful for differentiating between types of crowd noise, which is necessary for future work in automatically determining and classifying crowd emotional state.

Data Classification and Parameter Identification in Power Systems by Manifold Learning

Mark K. Transtrum (et al.)

This paper describes a manifold learning algorithm for big data classification and parameter identification in real-time operation of power systems. We assume a black-box setting, where only SCADA-based measurements at the point of interest are available. Data classification is based on diffusion maps, where an improved data-informed metric construction for partition trees is used. Data reduction is demonstrated on an hourly measurement tensor example, collected from the power flow solutions calculated for daily load/generation profiles. Parameter identification is performed on the same example, generated via randomly selected input parameters. The proposed method is illustrated on the case of the static part (ZIP) of a detailed WECC load model, connected to a single bus of a real-world 441-bus power system.

Unwinding the model manifold: Choosing similarity measures to remove local minima in sloppy dynamical systems

Benjamin L. Francis and Mark K. Transtrum

In this paper, we consider the problem of parameter sensitivity in models of complex dynamical systems through the lens of information geometry. We calculate the sensitivity of model behavior to variations in parameters. In most cases, models are sloppy, that is, exhibit an exponential hierarchy of parameter sensitivities. We propose a parameter classification scheme based on how the sensitivities scale at long observation times. We show that for oscillatory models, either with a limit cycle or a strange attractor, sensitivities can become arbitrarily large, which implies a high effective dimensionality on the model manifold. Sloppy models with a single fixed point have model manifolds with low effective dimensionality, previously described as a “hyper-ribbon.” In contrast, models with high effective dimensionality translate into multimodal fitting problems. We define a measure of curvature on the model manifold which we call the winding frequency that estimates the density of local minima in the model's parameter space. We then show how alternative choices of fitting metrics can “unwind” the model manifold and give low winding frequencies. This prescription translates the model manifold from one of high effective dimensionality into the hyper-ribbon structures observed elsewhere. This translation opens the door for applications of sloppy model analysis and model reduction methods developed for models with low effective dimensionality.

Modeling inter-particle magnetic correlations in magnetite nanoparticle assemblies using x-ray magnetic scattering data

Johnathon Rackham, Brittni Newbold, Steve Kotter, Dallin Smith, Dalton Griner, Roger Harrison, Mark Transtrum, and Karine Chesnel (et al.)

Magnetic nanoparticles are increasingly used in nanotechnologies and biomedical applications, such as drug targeting, MRI, bio-separation. Magnetite (Fe3O4) nanoparticles stand to be effective in these roles due to the non-toxic nature of magnetite and its ease of manufacture. To be more effective in these applications, a greater understanding of the magnetic behavior of a collection of magnetite nanoparticles is needed. This research seeks to discover the local magnetic ordering of ensembles of magnetite nanoparticles occurring under various external fields. To complete this study, we use x-ray resonant magnetic scattering (XRMS). Here we discuss the modeling of the magnetic scattering data using a one-dimensional chain of nanoparticles with a mix of ferromagnetic, anti-ferromagnetic, and random orders. By fitting the model to the experimental data, we extracted information about the magnetic correlations in the nanoparticle assembly.

Optimal experimental design for machine learning using the Fisher information

Tracianne B. Neilsen, David F. Van Komen, Mark K. Transtrum, and Makenzie B. Allen (et al.)

Optimal experimental design focuses on selecting experiments that minimize the statistical uncertainty in inferred parameter or predictions. In traditional optimizations, the experiment consists of input data, model parameters, and cost function. For machine learning and deep learning, the features, labels, and loss function define the experiment. One tool for optimal experimental design is the Fisher information, which gives an estimate of the relative uncertainty in and correlation among the model parameters based on the local curvature of the cost function. Using the Fisher information allows for rapid assessment of many different experimental conditions. In machine learning, the Fisher information can provide guidance as to which types of input features and labels maximize the gradients in the search space. This approach has been applied, for example, to systems biology models of biochemical reaction networks [Transtrum and Qiu, BMC Bioinformatics 13(1), 181 (2012)]. Preliminary application of the Fisher information to optimize experimental design for source localization in an uncertain ocean environment is a step towards finding an efficient machine learning algorithm that produces results with the least uncertainty in the quantities of interest.

Data-Driven Dynamic Equivalents for Power System Areas from Boundary Measurements

Mark T. Transtrum (et al.)

The paper describes an algorithm for parameter identification of a dynamic equivalent for an external subsystem, based solely on the available online measurements in boundary buses and branches. Static equivalent part is represented by equivalent impedances from boundary buses (ones that separate the internal and external subsystems), and calculated using the modified (minimum loss) REI (Radial, Equivalent and Independent) method. Parameter identification of Synchronous Generator (SG)-based equivalent (for predominantly production external areas), Dynamic Load (DL)-based equivalent (for predominantly load external areas) or (SG+DL)-based equivalent (for mixed external areas) in fictitious buses is performed by Levenberg-Marquardt Weighted Least-Square (WLS) nonlinear optimization, which minimizes the variances between available online measurements and transient responses of the reduced power system. The IEEE 14-bus and 441-bus real-world test systems are used to illustrate and test the proposed power system equivalent derivation technique.

Simultaneous Global Identification of Dynamic and Network Parameters in Transient Stability Studies

Mark K. Transtrum and Benjamin L. Francis (et al.)

The paper describes a global identification procedure for dynamic power system models in the form of differential and algebraic equations. Power system models have a number of features that makes their improvement challenging - they are multi-level, multi-user and multi-physics. Not surprisingly, they are nonlinear and time varying, both in terms of states (memory variables) and parameters, and discrete structures, such as graphs, are strongly blended with continuous dynamics, resulting in network dynamics. The transient stability models are used as a prototypical example. Our method is based on information geometry, and uses advances in computational differential geometry to characterize high-dimensional manifolds in the space of measurements. In the case of network parameters, a comparison is presented with circuit-theoretic techniques. The results are illustrated on the case of IEEE 14-bus test system with 58 parameters in our realization.

Influence of Communication Irregularities and Co-simulation on Hybrid Power System State Estimation

Mark K. Transtrum (et al.)

The paper explores the effects of sensor behavior and communication system (CS) irregularities on power system state estimation (SE). CS are modeled in Network Simulator 2 (NS-2), allowing the quantification of irregularities, including delays and dropped packets. The overall information is obtained combining SCADA measurements with phasor measurement unit (PMU) derived data, where time stamping (based on GPS or an equivalent local clock) for all measurements is assumed. To fully analyze the effects of irregularities, a detailed analysis of sensitivities to different communication system parameters is provided as well. Using the co-simulation environment PiccSIM, a SE with these irregularities is quantified for CS parameter variation, with detailed models of power and communication flows.

Effect of extreme temperatures on soil: A calorimetric approach

Mark K. Transtrum (et al.)

The relation between soil organic matter dynamics and temperature is an important research topic, poorly understood yet. This study focuses on the effect of temperature on the heat rate of soil organic matter decomposition using different soil types, by simulating an extreme heat wave with a calorimeter. Heat rates were measured with an automated step-scan of temperature upward at 20, 30, 40, 50, and 60 degrees C, and downward at 40 and 20 degrees C to monitor how soil recovers after the heat wave. The results show enzyme-catalyzed bioprocesses are not the only reactions in soil mineralization. Other reactions can be distinguished from the shape of the curve of the heat rate versus temperature. These reactions coexist at normal environmental temperatures, and their relative contribution to soil organic matter mineralization rates varies with soil type.

Geometrically Motivated Reparameterization for Identifiability Analysis in Power Systems Models

Mark K. Transtrum and Benjamin L. Francis (et al.)

This paper describes a geometric approach to parameter identifiability analysis in models of power systems dynamics. When a model of a power system is to be compared with measurements taken at discrete times, it can be interpreted as a mapping from parameter space into a data or prediction space. Generically, model mappings can be interpreted as manifolds with dimensionality equal to the number of structurally identifiable parameters. Empirically it is observed that model mappings often correspond to bounded manifolds. We propose a new definition of practical identifiability based the topological definition of a manifold with boundary. In many ways, our proposed definition extends the properties of structural identifiability. We construct numerical approximations to geodesics on the model manifold and use the results, combined with insights derived from the mathematical form of the equations, to identify combinations of practically identifiable and unidentifiable parameters. We give several examples of application to dynamic power systems models.

The Spectrum of Mechanism-Oriented Models and Methods for Explanations of Biological Phenomena

Mark K. Transtrum (et al.)

Developing and improving mechanism-oriented computational models to better explain biological phenomena is a dynamic and expanding frontier. As the complexity of targeted phenomena has increased, so too has the diversity in methods and terminologies, often at the expense of clarity, which can make reproduction challenging, even problematic. To encourage improved semantic and methodological clarity, we describe the spectrum of Mechanism-oriented Models being used to develop explanations of biological phenomena. We cluster explanations of phenomena into three broad groups. We then expand them into seven workflow-related model types having distinguishable features. We name each type and illustrate with examples drawn from the literature. These model types may contribute to the foundation of an ontology of mechanism-based biomedical simulation research. We show that the different model types manifest and exert their scientific usefulness by enhancing and extending different forms and degrees of explanation. The process starts with knowledge about the phenomenon and continues with explanatory and mathematical descriptions. Those descriptions are transformed into software and used to perform experimental explorations by running and examining simulation output. The credibility of inferences is thus linked to having easy access to the scientific and technical provenance from each workflow stage.

Experimental design and model reduction in systems biology

Mark K. Transtrum (et al.)

Background: In systems biology, the dynamics of biological networks are often modeled with ordinary differential equations (ODEs) that encode interacting components in the systems, resulting in highly complex models. In contrast, the amount of experimentally available data is almost always limited, and insufficient to constrain the parameters. In this situation, parameter estimation is a very challenging problem. To address this challenge, two intuitive approaches are to perform experimental design to generate more data, and to perform model reduction to simplify the model. Experimental design and model reduction have been traditionally viewed as two distinct areas, and an extensive literature and excellent reviews exist on each of the two areas. Intriguingly, however, the intrinsic connections between the two areas have not been recognized.

Results: Experimental design and model reduction are deeply related, and can be considered as one unified framework. There are two recent methods that can tackle both areas, one based on model manifold and the other based on profile likelihood. We use a simple sum-of-two-exponentials example to discuss the concepts and algorithmic details of both methods, and provide Matlab-based code and implementation which are useful resources for the dissemination and adoption of experimental design and model reduction in the biology community.

Conclusions: From a geometric perspective, we consider the experimental data as a point in a high-dimensional data space and the mathematical model as a manifold living in this space. Parameter estimation can be viewed as a projection of the data point onto the manifold. By examining the singularity around the projected point on the manifold, we can perform both experimental design and model reduction. Experimental design identifies new experiments that expand the manifold and remove the singularity, whereas model reduction identifies the nearest boundary, which is the nearest singularity that suggests an appropriate form of a reduced model. This geometric interpretation represents one step toward the convergence of experimental design and model reduction as a unified framework.

Titration Calorimetry: From Concept to Application

Lee D. Hansen and Mark K. Transtrum (et al.)

This Brief describes the calibration of titration calorimeters (ITCs) and calculation of stoichiometry, equilibrium constants, enthalpy changes, and rate constants for reactions in solution. A framework/methodology for model development for analysis of ITC data is presented together with methods for assessing the uncertainties in determined parameters and test data sets. This book appeals to beginners, as well as to researchers and professionals in the field.

Maximizing the information learned from finite data selects a simple model

Mark K. Transtrum (et al.)

We use the language of uninformative Bayesian prior choice to study the selection of appropriately simple effective models. We advocate for the prior which maximizes the mutual information between parameters and predictions, learning as much as possible from limited data. When many parameters are poorly constrained by the available data, we find that this prior puts weight only on boundaries of the parameter space. Thus, it selects a lower-dimensional effective theory in a principled way, ignoring irrelevant parameter directions. In the limit where there are sufficient data to tightly constrain any number of parameters, this reduces to the Jeffreys prior. However, we argue that this limit is pathological when applied to the hyperribbon parameter manifolds generic in science, because it leads to dramatic dependence on effects invisible to experiment.

Calorimetric Methods for Measuring Stability and Reusability of Membrane Immobilized Enzymes

Marco Mason, Mark K. Transtrum, Nicholas Baker, Lee D. Hansen, and Jason D. Kenealey (et al.)

The aim of this work is to develop calorimetric methods for characterizing the activity and stability of membrane immobilized enzymes. Invertase immobilized on a nylon-6 nanofiber membrane is used as a test case. The stability of both immobilized and free invertase activity was measured by spectrophotometry and isothermal titration calorimetry (ITC). Differential scanning calorimetry was used to measure the thermal stability of the structure and areal concentration of invertase on the membrane. This is the 1st demonstration that ITC can be used to determine activity and stability of an enzyme immobilized on a membrane. ITC and spectrophotometry show maximum activity of free and immobilized invertase at pH 4.5 and 45 to 55 °C. ITC determination of the activity as a function of temperature over an 8-h period shows a similar decline of activity of both free and immobilized invertase at 55 °C.

Hybrid power system state estimation with irregular sampling

Mark K. Transtrum (et al.)

The paper proposes a power system state estimation algorithm in the presence of irregular sensor sampling and random communication delays. Our state estimator incorporates Phasor Measurement Units (PMU) and SCADA measurements. We use an Extended Kalman filter based algorithm for time alignment of measurements and state variables. Time stamps are assumed for PMU, SCADA and state estimation. Application of the proposed algorithm is illustrated for hourly/daily load/generation variations on two test examples: 14-bus and 118-bus.

SRF Theory Developments from the Center for Bright Beams

A. Pack and M. K. Transtrum (et al.)

Information Geometry Approach to Verification of Dynamic Models in Power Systems

M. K. Transtrum (et al.)

The paper describes a new class of system identification procedures that are tailored to electric power systems, in particular to synchronous generators (SGs) and other dynamic components. Our procedure builds on computational advances in differential geometry, and offers a new, global characterization of challenges frequently encountered in system identification of electric power systems. The approach also benefits from increasing availability of high-quality measurements. While the proposed procedure is illustrated on SG example in a multi-machine benchmark (IEEE 14-bus and real-world 441-bus power systems), it is equally applicable to identification of other system components, such as loads.

Information geometry for model identification and parameter estimation in renewable energy – DFIG plant case

Mark K. Transtrum (et al.)

This study describes a new class of system identification procedures, tailored to electric power systems with renewable resources. The procedure described here builds on computational advances in differential geometry, and offers a new, global, and intrinsic characterisation of challenges in data-derived identification of electric power systems. The approach benefits from increased availability of high-quality measurements. The procedure is illustrated on the multi-machine benchmark example of IEEE 14-bus system with renewable resources, but it is equally applicable to identification of other components and systems (e.g. dynamic loads). The authors consider doubly-fed induction generators (DFIG) operating in a wind farm with system level proportional–integral controllers.

Information geometry for model reduction of dynamic loads in power systems

Mark K. Transtrum (et al.)

Load modeling has been extensively studied in power systems. The problem is intrinsically hard, as a simple description is sought for a large collection of heterogeneous physical devices. One aspect of model simplification has to do with the number of parameters needed to describe a dynamic load. With the rich tapestry of methods proposed in the literature as a backdrop, this paper introduces a new approach to simplify the load models and estimate the parameters. Our method is based on information geometry which combines information theory with computational differential geometry to derive global estimation results and shed a new light on difficulties commonly encountered when fitting widely used models to the measurement data. The results are compared with the literature using simulations on the IEEE 14 bus benchmark system.

Mechanisms of in vivo ribosome maintenance change in response to nutrient signals

Andrew D. Mathis, Bradley C. Naylor, Richard H. Carson, Eric Evans, Justin Harwell, Jared Knect, Eric Hexem, Mark K. Transtrum, Benjamin T. Bikman, and John C. Price (et al.)

Control of protein homeostasis is fundamental to the health and longevity of all organisms. Because the rate of protein synthesis by ribosomes is a central control point in this process, regulation and maintenance of ribosome function could have amplified importance in the overall regulatory circuit. Indeed, ribosomal defects are commonly associated with loss of protein homeostasis, aging and disease, whereas improved protein homeostasis, implying optimal ribosomal function, is associated with disease resistance and increased lifespan. To maintain a high quality ribosome population within the cell, dysfunctional ribosomes are targeted for autophagic degradation. It is not known if complete degradation is the only mechanism for eukaryotic ribosome maintenance or if they might also be repaired by replacement of defective components. We used stable-isotope feeding and protein mass-spectrometry to measure the kinetics of turnover of ribosomal RNA (rRNA) and 71 ribosomal proteins (r-proteins) in mice. The results indicate that exchange of individual proteins and whole ribosome degradation both contribute to ribosome maintenance in vivo. In general, peripheral r-proteins and those with more direct roles in peptide-bond formation are replaced multiple times during the lifespan of the assembled structure, presumably by exchange with a free cytoplasmic pool, whereas the majority of r-proteins are stably incorporated for the lifetime of the ribosome. Dietary signals impact the rates of both new ribosome assembly and component exchange. Signal-specific modulation of ribosomal repair and degradation could provide a mechanistic link in the frequently observed associations among diminished rates of protein synthesis, increased autophagy, and greater longevity.

Theoretical estimates of maximum fields in superconducting resonant radio frequency cavities: stability theory, disorder, and laminates

Mark K. Transtrum (et al.)

Theoretical limits to the performance of superconductors in high magnetic fields parallel to their surfaces are of key relevance to current and future accelerating cavities, especially those made of new higher-Tc materials such as Nb3Sn, NbN, and MgB2. Indeed, beyond the so-called superheating field , flux will spontaneously penetrate even a perfect superconducting surface and ruin the performance. We present intuitive arguments and simple estimates for , and combine them with our previous rigorous calculations, which we summarize. We briefly discuss experimental measurements of the superheating field, comparing to our estimates. We explore the effects of materials anisotropy and the danger of disorder in nucleating vortex entry. Will we need to control surface orientation in the layered compound MgB2? Can we estimate theoretically whether dirt and defects make these new materials fundamentally more challenging to optimize than niobium? Finally, we discuss and analyze recent proposals to use thin superconducting layers or laminates to enhance the performance of superconducting cavities. Flux entering a laminate can lead to so-called pancake vortices; we consider the physics of the dislocation motion and potential re-annihilation or stabilization of these vortices after their entry.

The Limitations of Model-Based Experimental Design and Parameter Estimation in Sloppy Systems

Andrew White, Malachi Tolman, and Mark K. Transtrum (et al.)

We explore the relationship among experimental design, parameter estimation, and systematic error in sloppy models. We show that the approximate nature of mathematical models poses challenges for experimental design in sloppy models. In many models of complex biological processes it is unknown what are the relevant physical mechanisms that must be included to explain system behaviors. As a consequence, models are often overly complex, with many practically unidentifiable parameters. Furthermore, which mechanisms are relevant/irrelevant vary among experiments. By selecting complementary experiments, experimental design may inadvertently make details that were ommitted from the model become relevant. When this occurs, the model will have a large systematic error and fail to give a good fit to the data. We use a simple hyper-model of model error to quantify a model’s discrepancy and apply it to two models of complex biological processes (EGFR signaling and DNA repair) with optimally selected experiments. We find that although parameters may be accurately estimated, the discrepancy in the model renders it less predictive than it was in the sloppy regime where systematic error is small. We introduce the concept of a sloppy system–a sequence of models of increasing complexity that become sloppy in the limit of microscopic accuracy. We explore the limits of accurate parameter estimation in sloppy systems and argue that identifying underlying mechanisms controlling system behavior is better approached by considering a hierarchy of models of varying detail rather than focusing on parameter estimation in a single model.

Ginzburg-Landau theory of the superheating field anisotropy of layered superconductors

Mark K. Transtrum (et al.)

We investigate the effects of material anisotropy on the superheating field of layered superconductors. We provide an intuitive argument both for the existence of a superheating field, and its dependence on anisotropy, for κ=λ/ξ (the ratio of magnetic to superconducting healing lengths) both large and small. On the one hand, the combination of our estimates with published results using a two-gap model for MgB2 suggests high anisotropy of the superheating field near zero temperature. On the other hand, within Ginzburg-Landau theory for a single gap, we see that the superheating field shows significant anisotropy only when the crystal anisotropy is large and the Ginzburg-Landau parameter κ is small. We then conclude that only small anisotropies in the superheating field are expected for typical unconventional superconductors near the critical temperature. Using a generalized form of Ginzburg Landau theory, we do a quantitative calculation for the anisotropic superheating field by mapping the problem to the isotropic case, and present a phase diagram in terms of anisotropy and κ, showing type I, type II, or mixed behavior (within Ginzburg-Landau theory), and regions where each asymptotic solution is expected. We estimate anisotropies for a number of different materials, and discuss the importance of these results for radio-frequency cavities for particle accelerators.

Measurement-Directed Reduction of Dynamic Models in Power Systems

Mark K. Transtrum (et al.)

The paper describes a new model reduction procedure tailored to power systems. It uses measurement data to devise a family of reduced order nonlinear models while retaining physical interpretability of parameters and equations. The Manifold Boundary Approximation Method (MBAM) uses the Fisher Information Matrix calculated from measurements to identify the least relevant parameter combination in the original model. Next, it numerically constructs a geodesic on the corresponding statistical manifold originating from the initial parameters in the least relevant parameter direction until a manifold boundary is found. MBAM then identifies a limiting approximation in the mathematical form of the model and removes one parameter combination. The simplified model is recalibrated by fitting its behavior to that of the original model, and the process is repeated as appropriate. MBAM is demonstrated on the example of a synchronous generator (SG), which has been treated extensively in the literature. Implications of the proposed model reduction procedure on large power system models are illustrated on a 441-bus, 72-SG dynamical model.

Dynamic Model Estimation for Power System Areas from Boundary Measurements

Mark K. Transtrum (et al.)

The paper describes a REI-based procedure for estimating parameters of a dynamic model from measurements in the boundary buses/branches. Parameter identification of equivalent synchronous generators in fictitious buses is performed by Weighted Least Square (WLS) nonlinear optimization to minimize the difference between online measurements and transient responses of reduced power system.

Information geometry for model verification in energy systems

Mark K. Transtrum (et al.)

The paper describes a new class of system identification procedures that are tailored to electric power systems with renewable sources. Our procedure builds on computational advances in differential geometry, and offers a new, global, intrinsic characterization of challenges frequently encountered in system identification of electric power systems. The approach also benefits from increased availability of high-quality measurements. While the proposed procedure is illustrated on renewable source IEEE 14-bus based example in a multi-machine benchmark power system, it is equally applicable to identification of other system components (for example, dynamic loads).

Bridging Mechanistic and Phenomenological Models of Complex Biological Systems

Mark K. Transtrum (et al.)

The inherent complexity of biological systems gives rise to complicated mechanistic models with a large number of parameters. On the other hand, the collective behavior of these systems can often be characterized by a relatively small number of phenomenological parameters. We use the Manifold Boundary Approximation Method (MBAM) as a tool for deriving simple phenomenological models from complicated mechanistic models. The resulting models are not black boxes, but remain expressed in terms of the microscopic parameters. In this way, we explicitly connect the macroscopic and microscopic descriptions, characterize the equivalence class of distinct systems exhibiting the same range of collective behavior, and identify the combinations of components that function as tunable control knobs for the behavior. We demonstrate the procedure for adaptation behavior exhibited by the EGFR pathway. From a 48 parameter mechanistic model, the system can be effectively described by a single adaptation parameter τ characterizing the ratio of time scales for the initial response and recovery time of the system which can in turn be expressed as a combination of microscopic reaction rates, Michaelis-Menten constants, and biochemical concentrations. The situation is not unlike modeling in physics in which microscopically complex processes can often be renormalized into simple phenomenological models with only a few effective parameters. The proposed method additionally provides a mechanistic explanation for non-universal features of the behavior.

Enzyme-catalyzed and binding reaction kinetics determined by titration calorimetry

Lee D. Hansen and Mark K. Transtrum (et al.)

Background

Isothermal calorimetry allows monitoring of reaction rates via direct measurement of the rate of heat produced by the reaction. Calorimetry is one of very few techniques that can be used to measure rates without taking a derivative of the primary data. Because heat is a universal indicator of chemical reactions, calorimetry can be used to measure kinetics in opaque solutions, suspensions, and multiple phase systems, and does not require chemical labeling. The only significant limitation of calorimetry for kinetic measurements is that the time constant of the reaction must be greater than the time constant of the calorimeter which can range from a few seconds to a few minutes. Calorimetry has the unique ability to provide both kinetic and thermodynamic data.

Scope of Review

This article describes the calorimetric methodology for determining reaction kinetics and reviews examples from recent literature that demonstrate applications of titration calorimetry to determine kinetics of enzyme-catalyzed and ligand binding reactions.

Major Conclusions

A complete model for the temperature dependence of enzyme activity is presented. A previous method commonly used for blank corrections in determinations of equilibrium constants and enthalpy changes for binding reactions is shown to be subject to significant systematic error.

General Significance

Methods for determination of the kinetics of enzyme-catalyzed reactions and for simultaneous determination of thermodynamics and kinetics of ligand binding reactions are reviewed. This article is part of a Special Issue entitled Microcalorimetry in the BioSciences - Principles and Applications, edited by Fadi Bou-Abdallah.

Shielding Superconductors with Thin Films as Applied to rf Cavities for Particle Accelerators

Mark K. Transtrum (et al.)

Determining the optimal arrangement of superconducting layers to withstand large-amplitude ac magnetic fields is important for certain applications such as superconducting radio-frequency cavities. In this paper, we evaluate the shielding potential of the superconducting-film–insulating-film–superconductor (SIS ′ ) structure, a configuration that could provide benefits in screening large ac magnetic fields. After establishing that, for high-frequency magnetic fields, flux penetration must be avoided, the superheating field of the structure is calculated in the London limit both numerically and, for thin films, analytically. For intermediate film thicknesses and realistic material parameters, we also solve numerically the Ginzburg-Landau equations. It is shown that a small enhancement of the superheating field is possible, on the order of a few percent, for the SIS′ structure relative to a bulk superconductor of the film material, if the materials and thicknesses are chosen appropriately.

Sloppiness and the Geometry of Parameter Space

Mark K. Transtrum (et al.)

When modeling complex biological systems, exploring parameter space is critical, because parameter values are typically poorly known a priori. This exploration can be challenging, because parameter space often has high dimension and complex structure. Recent work, however, has revealed universal structure in parameter space of models for nonlinear systems. In particular, models are often sloppy, with strong parameter correlations and an exponential range of parameter sensitivities. Here we review the evidence for universal sloppiness and its implications for parameter fitting, model prediction, and experimental design. In principle, one can transform parameters to alleviate sloppiness, but a parameterization-independent information geometry perspective reveals deeper universal structure. We thus also review the recent insights offered by information geometry, particularly in regard to sloppiness and numerical methods.

A unified view of Balanced Truncation and Singular Perturbation Approximations

Mark K. Transtrum and Sean C. Warnick (et al.)

This paper demonstrates that both Balanced Truncation and Balanced Singular Perturbation Approximations can be viewed as limiting approximations of the same parameterization of Linear Time Invariant (LTI) systems. First, we introduce a specific parameterization of LTI systems that distinguishes dynamic and structural parameters. Next, we apply the Model Boundary Approximation Method (MBAM) [1] to particular parameters to achieve different approximations. This unified view of these popular model reduction techniques, which themselves can result in quite different approximations, illustrates that each approximation corresponds to a particular boundary element on a manifold, the “model manifold,” which is associated with the specific choice of model parameterization and is embedded in a sample space of measured outputs.

Perspective: Sloppiness and emergent theories in physics, biology, and beyond

Mark K. Transtrum (et al.)

Large scale models of physical phenomena demand the development of new statistical and computational tools in order to be effective. Many such models are “sloppy,” i.e., exhibit behavior controlled by a relatively small number of parameter combinations. We review an information theoretic framework for analyzing sloppy models. This formalism is based on the Fisher information matrix, which is interpreted as a Riemannian metric on a parameterized space of models. Distance in this space is a measure of how distinguishable two models are based on their predictions. Sloppy model manifolds are bounded with a hierarchy of widths and extrinsic curvatures. The manifold boundary approximation can extract the simple, hidden theory from complicated sloppy models. We attribute the success of simple effective models in physics as likewise emerging from complicated processes exhibiting a low effective dimensionality. We discuss the ramifications and consequences of sloppy models for biochemistry and science more generally. We suggest that the reason our complex world is understandable is due to the same fundamental reason: simple theories of macroscopic behavior are hidden inside complicated microscopic processes.

Enzyme kinetics determined by single-injection isothermal titration calorimetry

Mark K. Transtrum and Lee D. Hansen (et al.)

The purposes of this paper are (a) to examine the effect of calorimeter time constant (τ) on heat rate data from a single enzyme injection into substrate in an isothermal titration calorimeter (ITC), (b) to provide information that can be used to predict the optimum experimental conditions for determining the rate constant (k₂), Michaelis constant (K_M), and enthalpy change of the reaction (Δ_RH), and (c) to describe methods for evaluating these parameters. We find that K_M, k₂ and Δ_RH can be accurately estimated without correcting for the calorimeter time constant, τ, if (k₂E/K_M), where E is the total active enzyme concentration, is between 0.1/τ and 1/τ and the reaction goes to at least 99% completion. If experimental conditions are outside this domain and no correction is made for τ, errors in the inferred parameters quickly become unreasonable. A method for fitting single-injection data to the Michaelis–Menten or Briggs–Haldane model to simultaneously evaluate K_M, k₂, Δ_RH, and τ is described and validated with experimental data. All four of these parameters can be accurately inferred provided the reaction time constant (k₂E/K_M) is larger than 1/τ and the data include enzyme saturated conditions.

Model Reduction by Manifold Boundaries

Mark K. Transtrum (et al.)

Understanding the collective behavior of complex systems from their basic components is a difficult yet fundamental problem in science. Existing model reduction techniques are either applicable under limited circumstances or produce “black boxes” disconnected from the microscopic physics. We propose a new approach by translating the model reduction problem for an arbitrary statistical model into a geometric problem of constructing a low-dimensional, submanifold approximation to a high-dimensional manifold. When models are overly complex, we use the observation that the model manifold is bounded with a hierarchy of widths and propose using the boundaries as submanifold approximations. We refer to this approach as the manifold boundary approximation method. We apply this method to several models, including a sum of exponentials, a dynamical systems model of protein signaling, and a generalized Ising model. By focusing on parameters rather than physical degrees of freedom, the approach unifies many other model reduction techniques, such as singular limits, equilibrium approximations, and the renormalization group, while expanding the domain of tractable models. The method produces a series of approximations that decrease the complexity of the model and reveal how microscopic parameters are systematically “compressed” into a few macroscopic degrees of freedom, effectively building a bridge between the microscopic and the macroscopic descriptions.

Parameter Space Compression Underlies Emergent Theories and Predictive Models

Mark K. Transtrum (et al.)

The microscopically complicated real world exhibits behavior that often yields to simple yet quantitatively accurate descriptions. Predictions are possible despite large uncertainties in microscopic parameters, both in physics and in multiparameter models in other areas of science. We connect the two by analyzing parameter sensitivities in a prototypical continuum theory (diffusion) and at a self-similar critical point (the Ising model). We trace the emergence of an effective theory for long-scale observables to a compression of the parameter space quantified by the eigenvalues of the Fisher Information Matrix. A similar compression appears ubiquitously in models taken from diverse areas of science, suggesting that the parameter space structure underlying effective continuum and universal theories in physics also permits predictive modeling more generally.

Use of the LQ model with large fraction sizes results in underestimation of isoeffect doses

Mark K. Transtrum (et al.)

Purpose To test the appropriateness of the linear-quadratic (LQ) model to describe survival of jejunal crypt clonogens after split doses with variable (small 1–6Gy, large 8–13Gy) first dose, as a model of its appropriateness for both small and large fraction sizes. Methods C3Hf/KamLaw mice were exposed to whole body irradiation using 300kVp X-rays at a dose rate of 1.84Gy/min, and the number of viable jejunal crypts was determined using the microcolony assay. 14Gy total dose was split into unequal first and second fractions separated by 4h. Data were analyzed using the LQ model, the lethal potentially lethal (LPL) model, and a repair-saturation (RS) model. Results Cell kill was greater in the group receiving the larger fraction first, creating an asymmetry in the plot of survival vs size of first dose, as opposed to the prediction of the LQ model of a symmetric response. There was a significant difference in the estimated βs (higher β after larger first doses), but no significant difference in the αs, when large doses were given first vs small doses first. This difference results in underestimation (based on present data by approximately 8%) of isoeffect doses using LQ model parameters based on small fraction sizes. While the LPL model also predicted a symmetric response inconsistent with the data, the RS model results were consistent with the observed asymmetry. Conclusion The LQ model underestimates doses for isoeffective crypt-cell survival with large fraction sizes (in the present setting, >9Gy).

Structural susceptibility and separation of time scales in the van der Pol oscillator

Mark K. Transtrum (et al.)

We use an extension of the van der Pol oscillator as an example of a system with multiple time scales to study the susceptibility of its trajectory to polynomial perturbations in the dynamics. A striking feature of many nonlinear, multiparameter models is an apparently inherent insensitivity to large-magnitude variations in certain linear combinations of parameters. This phenomenon of "sloppiness" is quantified by calculating the eigenvalues of the Hessian matrix of the least-squares cost function. These typically span many orders of magnitude. The van der Pol system is no exception: Perturbations in its dynamics show that most directions in parameter space weakly affect the limit cycle, whereas only a few directions are stiff. With this study, we show that separating the time scales in the van der Pol system leads to a further separation of eigenvalues. Parameter combinations which perturb the slow manifold are stiffer and those which solely affect the jumps in the dynamics are sloppier.

Optimal experiment selection for parameter estimation in biological differential equation models

Mark K Transtrum (et al.)

We show that by an appropriate choice of experiments, one can, in principle, efficiently and accurately estimate all the parameters of gene regulatory network. In addition, we demonstrate that appropriate experiment selection can also allow one to restrict model predictions without constraining the parameters using many fewer experiments. We suggest that predicting model behaviors and inferring parameters represent two different approaches to model calibration with different requirements on data and experimental cost.

Comment on “Sloppy models, parameter uncertainty, and the role of experimental design”

Mark K. Transtrum (et al.)

We explain that part of the reduction in the parameter uncertainties in the computations of Apgar et al. (Mol. Biosyst. 2010, 6, 1890–900) is due to a greatly increased number of effective data points.

Superheating field of superconductors within Ginzburg-Landau theory

Mark K. Transtrum (et al.)

We study the superheating field of a bulk superconductor within Ginzburg-Landau theory, which is valid near the critical temperature. We calculate, as functions of the Ginzburg-Landau parameter., the superheating field H(sh) and the critical momentum k(c) characterizing the wavelength of the instability of the Meissner state to flux penetration. By mapping the two-dimensional linear stability theory into a one-dimensional eigenfunction problem for an ordinary differential equation, we solve the problem numerically. We demonstrate agreement between the numerics and analytics, and show convergence to the known results at both small and large.. We discuss the implications of the results for superconducting rf cavities used in particle accelerators.

Geometry of nonlinear least squares with applications to sloppy models and optimization

Mark K. Transtrum (et al.)

Parameter estimation by nonlinear least-squares minimization is a common problem that has an elegant geometric interpretation: the possible parameter values of a model induce a manifold within the space of data predictions. The minimization problem is then to find the point on the manifold closest to the experimental data. We show that the model manifolds of a large class of models, known as sloppy models, have many universal features; they are characterized by a geometric series of widths, extrinsic curvatures, and parameter-effect curvatures, which we describe as a hyper-ribbon. A number of common difficulties in optimizing least-squares problems are due to this common geometric structure. First, algorithms tend to run into the boundaries of the model manifold, causing parameters to diverge or become unphysical before they have been optimized. We introduce the model graph as an extension of the model manifold to remedy this problem. We argue that appropriate priors can remove the boundaries and further improve the convergence rates. We show that typical fits will have many evaporated parameters unless the data are very accurately known. Second, "bare" model parameters are usually ill-suited to describing model behavior; cost contours in parameter space tend to form hierarchies of plateaus and long narrow canyons. Geometrically, we understand this inconvenient parametrization as an extremely skewed coordinate basis and show that it induces a large parameter-effect curvature on the manifold. By constructing alternative coordinates based on geodesic motion, we show that these long narrow canyons are transformed in many cases into a single quadratic, isotropic basin. We interpret the modified Gauss-Newton and Levenberg-Marquardt fitting algorithms as an Euler approximation to geodesic motion in these natural coordinates on the model manifold and the model graph, respectively. By adding a geodesic acceleration adjustment to these algorithms, we alleviate the difficulties from parameter-effect curvature, improving both efficiency and success rates at finding good fits.

Riemann manifold Langevin and Hamiltonian Monte Carlo methods

Mark K. Transtrum (et al.)

The paper proposes Metropolis adjusted Langevin and Hamiltonian Monte Carlo sampling methods defined on the Riemann manifold to resolve the shortcomings of existing Monte Carlo algorithms when sampling from target densities that may be high dimensional and exhibit strong correlations. The methods provide fully automated adaptation mechanisms that circumvent the costly pilot runs that are required to tune proposal densities for Metropolis–Hastings or indeed Hamiltonian Monte Carlo and Metropolis adjusted Langevin algorithms. This allows for highly efficient sampling even in very high dimensions where different scalings may be required for the transient and stationary phases of the Markov chain. The methodology proposed exploits the Riemann geometry of the parameter space of statistical models and thus automatically adapts to the local structure when simulating paths across this manifold, providing highly efficient convergence and exploration of the target density. The performance of these Riemann manifold Monte Carlo methods is rigorously assessed by performing inference on logistic regression models, log-Gaussian Cox point processes, stochastic volatility models and Bayesian estimation of dynamic systems described by non-linear differential equations. Substantial improvements in the time-normalized effective sample size are reported when compared with alternative sampling approaches. MATLAB code that is available from http://www.ucl.ac.uk/statistics/research/rmhmc allows replication of all the results reported.

Why are Nonlinear Fits to Data so Challenging?

Mark K. Transtrum (et al.)

Fitting model parameters to experimental data is a common yet often challenging task, especially if the model contains many parameters. Typically, algorithms get lost in regions of parameter space in which the model is unresponsive to changes in parameters, and one is left to make adjustments by hand. We explain this difficulty by interpreting the fitting process as a generalized interpolation procedure. By considering the manifold of all model predictions in data space, we find that cross sections have a hierarchy of widths and are typically very narrow. Algorithms become stuck as they move near the boundaries. We observe that the model manifold, in addition to being tightly bounded, has low extrinsic curvature, leading to the use of geodesics in the fitting process. We improve the convergence of the Levenberg-Marquardt algorithm by adding geodesic acceleration to the usual step.

An Analytic Iterative Approach to Solving the Time-Independent Schrodinger Equation

Chad Junkermeier, Mark Transtrum, and Manuel Berrondo

In this article, we introduce a simple analytic method for obtaining approximate solutions of the Schrodinger equation for a wide range of potentials in one- and two-dimensions. We define an operator, called the iteration operator, which will be used to solve for the lowest order state(s) of a system. The method is simple in that it does not require the computation of any integrals in order to obtain a solution. We use this method on several potentials which are well understood or even exactly solvable in order to demonstrate the strengths and weaknesses of this method. (C) 2008 Wiley Periodicals, Inc. Int J Quantum Chem 109: 982-998, 2009

Beyond the quantum in Snyder space

J. F. S. Van Huele and M. K. Transtrum

A perturbative approach to Snyder space with applications

Mark K. Transtrum and Jean-François S. Van Huele

In this paper, we present a method for studying systems in the modified formulation of quantum mechanics known as Snyder space, proposed by Snyder (1947 Phys. Rev. 71 38-41). Snyder space predicts a modified commutation algebra for position and momentum operators. The method described in this paper introduces operators satisfying the canonical commutation relations and relates them to the position and momentum operators of Snyder space, effectively mapping a problem in Snyder space into a similar problem in standard quantum mechanics. The method is applied to the simple harmonic oscillator (SHO) in one and two dimensions as well as to the one-dimensional infinite square well. The energy spectra are calculated perturbatively for the SHO. We also find an exact spectrum for the one-dimensional infinite square well potential. These results are shown to agree with similar results found elsewhere in the literature.

Commutation relations for functions of operators

Mark K. Transtrum and Jean-François S. Van Huele

We derive an expression for the commutator of functions of operators with constant commutations relations in terms of the partial derivatives of these functions. This result extends the well-known commutation relation between one operator and a function of another operator. We discuss the range of applicability of the formula with examples in quantum mechanics. (C) 2005 American Institute of Physics.

Explainable machine learning models for classifying reactions within crowd noise during men's collegiate basketball games

Mitchell C. Cutler, Jason Bickmore, Mark K. Transtrum, Katrina Pedersen, Shannon Proksch, and Kent Gee (et al.)

Crowds at collegiate basketball games react acoustically to events on the court in many ways, including applauding, chanting, cheering, and making distracting noises. Acoustic features can be extracted from recordings of crowds at basketball games to train machine learning models to classify crowd reactions. Such models may help identify crowd mood, which could help players secure fair contracts, venues refine fan experience, and safety personnel improve emergency response services or to minimize conflict in policing. By exposing the key features in these models, feature selection highlights physical insights about crowd noise, reduces computational costs, and often improves model performance. Feature selection is performed using random forests and least absolute shrinkage and selection operator logistic regression to identify the most useful acoustic features for identifying and classifying crowd reactions. The importance of including short-term feature temporal histories in the feature vector is also evaluated. Features related to specific 1/3-octave band shapes, sound level, and tonality are highly relevant for classifying crowd reactions. Additionally, the inclusion of feature temporal histories can increase classifier accuracies by up to 12%. Interestingly, some features are better predictors of future crowd reactions than current reactions. Reduced feature sets are human-interpretable on a case-by-case basis for the crowd reactions they predict.

Time-dependent Ginzburg-Landau framework for sample-specific simulation of superconductors for radio-frequency applications

Aiden V. Harbick and Mark K. Transtrum

Modern superconducting radio frequency (SRF) applications demand precise control over material properties across multiple length scales—from microscopic composition, to mesoscopic defect structures, to macroscopic cavity geometry. We present a time-dependent Ginzburg-Landau (TDGL) framework that incorporates spatially varying parameters derived from experimental measurements and ab initio calculations, enabling realistic, sample-specific simulations. As a demonstration, we model Sn-deficient islands in Nb₃Sn and calculate the field at which vortex nucleation first occurs for various defect configurations. These thresholds serve as a predictive tool for identifying defects likely to degrade SRF cavity performance. We then simulate the resulting dissipation and show how aggregate contributions from multiple small defects can reproduce trends consistent with high-field 𝑄-slope behavior observed experimentally. Our results offer a pathway for connecting microscopic defect properties to macroscopic SRF performance using a computationally efficient mesoscopic model.

Tailoring and verification of dynamic load models in power system studies: the WECC load example

Mark K. Transtrum (et al.)

Load modeling is a primary activity in deriving verifiable models of power systems. It is often argued that the uncertainty in load models exceeds that of other components by a wide margin. The problem is intrinsically challenging, as the acceptable solution consists of many heterogeneous and even disparate physical components. The number of parameters needed to describe a composite dynamic load captures one quantitative aspect of model simplification. This paper uses information geometry as the main tool in a two-step process–model simplification followed by parameter determination. The method offers global results in parameter estimation and quantifies the common challenges in fitting standard models to measurement data. We use a very detailed WECC composite load model embedded in the real world 441-bus benchmark system to illustrate the procedure and provide recommendations.

Quantitative and Kinetic Proteomics Reveal ApoE Isoform-dependent Proteostasis Adaptations in Mouse Brain

Nathan R. Zuniga, Noah E. Earls, Jared M. Elison, Benjamin S. Jones, Ethan G. Smith, Noah G. Moran, Gerome M. Romero, Chad D. Hyer, Kimberly B. Wagstaff, Haifa M. Almughamsi, Mark K. Transtrum, and John C. Price (et al.)

Apolipoprotein E (ApoE) polymorphisms modify the risk of Alzheimer’s disease with ApoE4 strongly increasing and ApoE2 modestly decreasing risk relative to the control ApoE3. To investigate how ApoE isoforms alter risk, we measured changes in proteome homeostasis in transgenic mice expressing a human ApoE gene (isoform 2, 3, or 4). The regulation of each protein’s homeostasis is observed by measuring turnover rate and abundance for that protein. We identified 4849 proteins and tested for ApoE isoform-dependent changes in the homeostatic regulation of ~2700 ontologies. In the brain, we found that ApoE4 and ApoE2 both lead to modified regulation of mitochondrial membrane proteins relative to the wild-type control ApoE3. In ApoE4 mice, lack of cohesion between mitochondrial membrane and matrix proteins suggests that dysregulation of proteasome and autophagy is reducing protein quality. In ApoE2, proteins of the mitochondrial matrix and the membrane, including oxidative phosphorylation complexes, had a similar increase in degradation which suggests coordinated replacement of the entire organelle. In the liver we did not observe these changes suggesting that the ApoE-effect on proteostasis is amplified in the brain relative to other tissues. Our findings underscore the utility of combining protein abundance and turnover rates to decipher proteome regulatory mechanisms and their potential role in biology.

Impact of submicron Nb₃Sn stoichiometric surface defects on high-field superconducting radiofrequency cavity performance

Aiden V. Harbick and Mark K. Transtrum (et al.)

Nb₃Sn film coatings have the potential to drastically improve the accelerating performance of Nb superconducting radiofrequency (SRF) cavities in next-generation linear particle accelerators. Unfortunately, persistent Nb₃Sn stoichiometric material defects formed during fabrication limit the cryogenic operating temperature and accelerating gradient by nucleating magnetic vortices that lead to premature cavity quenching. The SRF community currently lacks a predictive model that can explain the impact of chemical and morphological properties of Nb₃Sn defects on vortex nucleation and maximum accelerating gradients. Both experimental and theoretical studies of the material and superconducting properties of the first 100 nm of Nb₃Sn surfaces are complicated by significant variations in the volume distribution and topography of stoichiometric defects. This work contains a coordinated experimental study with supporting simulations to identify how the observed chemical composition and morphology of certain Sn-rich and Sn-deficient surface defects can impact the SRF performance. Nb₃Sn films were prepared with varying degrees of stoichiometric defects, and the film surface morphologies were characterized. Both Sn-rich and Sn-deficient regions were identified in these samples. For Sn-rich defects, we focus on elemental Sn islands that are partially embedded into the Nb₃Sn film. Using finite element simulations of the time-dependent Ginzburg-Landau equations, we estimate vortex nucleation field thresholds at Sn islands of varying size, geometry, and embedment. We find that these islands can lead to significant SRF performance degradation that could not have been predicted from the ensemble stoichiometry alone. For Sn-deficient Nb₃Sn surfaces, we experimentally identify a periodic nanoscale surface corrugation that likely forms because of extensive Sn loss from the surface. Simulation results show that the surface corrugations contribute to the already substantial drop in the vortex nucleation field of Sn-deficient Nb₃Sn surfaces. This work provides a systematic approach for future studies to further detail the relationship between experimental Nb₃Sn growth conditions, stoichiometric defects, geometry, and vortex nucleation. These findings have technical implications that will help guide improvements to Nb₃Sn fabrication procedures. Our outlined experiment-informed theoretical methods can assist future studies in making additional key insights about Nb₃Sn stoichiometric defects that will help build the next generation of SRF cavities and support related superconducting materials development efforts.

Maximum entropy temperature selection via the equipartition theorem

Jacob R. Nuttall, Tracianne B. Neilsen, and Mark K. Transtrum

Maximum entropy is an approach for obtaining posterior probability distributions of modeling parameters. This approach, based on a cost function that quantifies the data-model mismatch, relies on an estimate of an appropriate temperature. Selection of this ”statistical temperature” is related to estimating the noise covariance. A method for selecting the ”statistical temperature” is derived from analogies with statistical mechanics, including the equipartition theorem. Using the equipartition-theorem estimate, the statistical temperature can be obtained for a single data sample instead of via the ensemble approach used previously. Examples of how the choice of temperature impacts the posterior distributions are shown using a toy model. The examples demonstrate the impact of the choice of the temperature on the resulting posterior probability distributions and the advantages of using the equipartition-theorem approach for selecting the temperature.

Information Geometry in Underwater Acoustics: Tutorial, Case Study, and Outlook

Jay C. Spendlove, Michael C. Mortenson, Tracianne B. Neilsen, and Mark K. Transtrum

This tutorial demonstrates the use of information geometry tools in analyzing environmental parameter sensitivities in underwater acoustics. Sensitivity analyses quantify how well data can constrain model parameters, with application to inverse problems like geoacoustic inversion. A review of examples of parameter sensitivity methods and their application to problems in underwater acoustics is given, roughly grouped into “local” and “non-local” methods. Local methods such as Fisher information and Cramér-Rao bounds have important connections to information geometry. Information Geometry combines the fields of information theory and differential geometry by interpreting a model as a Riemannian manifold, known as the model manifold, that encodes both local and global parameter sensitivities. As an example, 2-dimensional model manifold slices are constructed for the Pekeris waveguide with sediment attenuation, for a vertical array of hydrophones. This example demonstrates how effective, reduced-order models emerge in certain parameter limits, which correspond to boundaries of the model manifold. This example also demonstrates how the global structure of the model manifold influences the local sensitivities quantified by the Fisher information matrix. This paper motivates future work to utilize information geometry methods for experimental design and model reduction applied to more complex modeling scenarios in underwater acoustics.

Learning Seabed Properties and Sensitivities from Ship Noise Spectrograms

Michael C. Mortenson, Tracianne B. Neilsen, and Mark K. Transtrum (et al.)

In this work, deep neural networks are trained on synthetic spectrograms of transiting ships to find properties of the seafloor and uncertainty labels associated with those predictions. The spectrograms are labeled with the values of sediment layer thickness, sound speed, density, and attenuation; and a measure of the parameter information content or sensitivity for each parameter obtained from the Cramér-Rao bound (CRB). The CRB values for a given ocean environment and source-receiver geometry can be calculated and expressed as a relative uncertainty, for each sediment property. To obtain uncertainty labels for each spectrogram, the relative CRB is calculated for ten equally spaced frequencies for each sediment parameter of interest. These relative CRB values are divided into uncertainty classes, and the mode class across frequency is assigned as the uncertainty label for the spectrogram. The labeled synthetic spectrograms are used to train ResNet-18 networks, which can then tested on measured spectrograms. Comparisons are made between validation performance of networks that are trained to learn only the parameter value labels and those trained to learn both seabed parameter values and sensitivity class. The validation results indicate the potential to predict not only sediment properties but uncertainty in those predictions.

Toward a dynamic national transportation noise map: Modeling temporal variability of spectral traffic noise emission levels

Mylan R. Cook, Kent L. Gee, and Mark. K. Transtrum (et al.)

The National Transportation Noise Map predicts time-averaged road traffic noise across the continental United States (CONUS) based on annual average daily traffic counts. However, traffic noise can vary greatly with time. This paper outlines a method for predicting nationwide hourly varying source traffic sound emissions called the Vehicular Reduced-Order Observation-based Model (VROOM). The method incorporates three models that predict temporal variability of traffic volume, predict temporal variability of different traffic classes, and use Traffic Noise Model (TNM) 3.0 equations to give traffic noise emission levels based on vehicle numbers and class mix. Location-specific features are used to predict average class mix across CONUS. VROOM then incorporates dynamic traffic class mix data to obtain dynamic traffic class mix. TNM 3.0 equations then give estimated equivalent sound level emission spectra near roads with up to hourly resolution. Important temporal traffic noise characteristics are modeled, including diurnal traffic patterns, rush hours in urban locations, and weekly and yearly variation. Examples of the temporal variability are depicted and possible types of uncertainties are identified. Altogether, VROOM can be used to map national transportation noise with temporal and spectral variability.

Information geometry analysis example for absolute and relative transmission loss in a shallow ocean

Jay C. Spendlove, Tracianne B. Neilsen, and Mark K. Transtrum

The model manifold, an information geometry tool, is a geometric representation of a model that can quantify the expected information content of modeling parameters. For a normal-mode sound propagation model in a shallow ocean environment, transmission loss (TL) is calculated for a vertical line array and model manifolds are constructed for both absolute and relative TL. For the example presented in this paper, relative TL yields more compact model manifolds with seabed environments that are less statistically distinguishable than manifolds of absolute TL. This example illustrates how model manifolds can be used to improve experimental design for inverse problems.

The training process of many deep networks explores the same low-dimensional manifold

Mark K. Transtrum (et al.)

We develop information-geometric techniques to analyze the trajectories of the predictions of deep networks during training. By examining the underlying highdimensional probabilistic models, we reveal that the training process explores an effectively low-dimensional manifold. Networks with a wide range of architectures, sizes, trained using different optimization methods, regularization techniques, data augmentation techniques, and weight initializations lie on the same manifold in the prediction space. We study the details of this manifold to find that networks with different architectures follow distinguishable trajectories, but other factors have a minimal influence; larger networks train along a similar manifold as that of smaller networks, just faster; and networks initialized at very different parts of the prediction space converge to the solution along a similar manifold.

Data-driven decomposition of crowd noise from indoor sporting events

Mitchell C. Cutler, Mylan R. Cook, Mark K. Transtrum, and Kent L. Gee

Separating crowd responses from raw acoustic signals at sporting events is challenging because recordings contain complex combinations of acoustic sources, including crowd noise, music, individual voices, and public address (PA) systems. This paper presents a data-driven decomposition of recordings of 30 collegiate sporting events. The decomposition uses machine-learning methods to find three principal spectral shapes that separate various acoustic sources. First, the distributions of recorded one-half-second equivalent continuous sound levels from men's and women's basketball and volleyball games are analyzed with regard to crowd size and venue. Using 24 one-third-octave bands between 50 Hz and 10 kHz, spectrograms from each type of game are then analyzed. Based on principal component analysis, 87.5% of the spectral variation in the signals can be represented with three principal components, regardless of sport, venue, or crowd composition. Using the resulting three-dimensional component coefficient representation, a Gaussian mixture model clustering analysis finds nine different clusters. These clusters separate audibly distinct signals and represent various combinations of acoustic sources, including crowd noise, music, individual voices, and the PA system.

Sloppy model analysis identifies bifurcation parameters without normal form analysis

Christian N. K. Anderson and Mark K. Transtrum

Bifurcation phenomena are common in multidimensional multiparameter dynamical systems. Normal form theory suggests that bifurcations are driven by relatively few combinations of parameters. Models of complex systems, however, rarely appear in normal form, and bifurcations are controlled by nonlinear combinations of the bare parameters of differential equations. Discovering reparameterizations to transform complex equations into a normal form is often very difficult, and the reparameterization may not even exist in a closed form. Here we show that information geometry and sloppy model analysis using the Fisher information matrix can be used to identify the combination of parameters that control bifurcations. By considering observations on increasingly long timescales, we find those parameters that rapidly characterize the system's topological inhomogeneities, whether the system is in normal form or not. We anticipate that this novel analytical method, which we call time-widening information geometry (TWIG), will be useful in applied network analysis.

Toward a dynamic national transportation noise map: Modeling temporal variability of traffic volume

Mylan R. Cook, Kent L. Gee, and Mark. K. Transtrum (et al.)

The National Transportation Noise Map (NTNM) gives time-averaged traffic noise across the continental United States (CONUS) using annual average daily traffic. However, traffic noise varies significantly with time. This paper outlines the development and utility of a traffic volume model which is part of VROOM, the Vehicular Reduced-Order Observation-based model, which, using hourly traffic volume data from thousands of traffic monitoring stations across CONUS, predicts nationwide hourly varying traffic source noise. Fourier analysis finds daily, weekly, and yearly temporal traffic volume cycles at individual traffic monitoring stations. Then, principal component analysis uses denoised Fourier spectra to find the most widespread cyclic traffic patterns. VROOM uses nine principal components to represent hourly traffic characteristics for any location, encapsulating daily, weekly, and yearly variation. The principal component coefficients are predicted across CONUS using location-specific features. Expected traffic volume model sound level errors—obtained by comparing predicted traffic counts to measured traffic counts—and expected NTNM-like errors, are presented. VROOM errors are typically within a couple of decibels, whereas NTNM-like errors are often inaccurate, even exceeding 10 decibels. This work details the first steps towards creation of a temporally and spectrally variable national transportation noise map.

Recurrence Quantification Analysis of Crowd Sound Dynamics

Kent Gee and Mark Transtrum (et al.)

When multiple individuals interact in a conversation or as part of a large crowd, emergent structures and dynamics arise that are behavioral properties of the interacting group rather than of any individual member of that group. Recent work using traditional signal processing techniques and machine learning has demonstrated that global acoustic data recorded from a crowd at a basketball game can be used to classify emergent crowd behavior in terms of the crowd's purported emotional state. We propose that the description of crowd behavior from such global acoustic data could benefit from nonlinear analysis methods derived from dynamical systems theory. Such methods have been used in recent research applying nonlinear methods to audio data extracted from music and group musical interactions. In this work, we used nonlinear analyses to extract features that are relevant to the behavioral interactions that underlie acoustic signals produced by a crowd attending a sporting event. We propose that recurrence dynamics measured from these audio signals via recurrence quantification analysis (RQA) reflect information about the behavioral dynamics of the crowd itself. We analyze these dynamics from acoustic signals recorded from crowds attending basketball games, and that were manually labeled according to the crowds' emotional state across six categories: angry noise, applause, cheer, distraction noise, positive chant, and negative chant. We show that RQA measures are useful to differentiate the emergent acoustic behavioral dynamics between these categories, and can provide insight into the recurrence patterns that underlie crowd interactions.

Field-dependent nanospin ordering in monolayers of Fe₃O₄ nanoparticles throughout the superparamagnetic blocking transition

Johnathon Rackham, Brittni Pratt, Dalton Griner, Dallin Smith, Yanping Cai, Roger G. Harrison, Mark K. Transtrum, and Karine Chesnel (et al.)

We report on magnetic orderings of nanospins in self-assemblies of Fe₃O₄ nanoparticles (NPs), occurring at various stages of the magnetization process throughout the superparamagnetic (SPM)-blocking transition. Essentially driven by magnetic dipole couplings and by Zeeman interaction with a magnetic field applied out-of-plane, these magnetic orderings include a mix of long-range parallel and antiparallel alignments of nanospins, with the antiparallel correlation being the strongest near the coercive point below the blocking temperature. The magnetic ordering is probed via x-ray resonant magnetic scattering (XRMS), with the x-ray energy tuned to the Fe−L₃ edge and using circular polarized light. By exploiting dichroic effects, a magnetic scattering signal is isolated from the charge scattering signal. We measured the nanospin ordering for two different sizes of NPs, 5 and 11 nm, with blocking temperatures TB of 28 and 170 K, respectively. At 300 K, while the magnetometry data essentially show SPM and absence of hysteresis for both particle sizes, the XRMS data reveal the presence of nonzero (up to 9%) antiparallel ordering when the applied field is released to zero for the 11 nm NPs. These antiparallel correlations are drastically amplified when the NPs are cooled down below TB and reach up to 12% for the 5 nm NPs and 48% for the 11 nm NPs, near the coercive point. The data suggest that the particle size affects the prevalence of the antiparallel correlations over the parallel correlations by a factor ∼1.6 to 3.8 higher when the NP size increases from 5 to 11 nm.

Feature selection for a continental-scale geospatial model of environmental sound levels

Katrina Pedersen, Mark K. Transtrum, and Kent L. Gee (et al.)

Modeling environmental sound levels over continental scales is difficult due to the variety of geospatial environments. Moreover, current continental-scale models depend upon machine learning and therefore face additional challenges due to limited acoustic training data. In previous work, an ensemble of machine learning models was used to predict environmental sound levels in the contiguous United States using a training set composed of 51 geospatial layers (downselected from 120) and acoustic data from 496 geographic sites from Pedersen, Transtrum, Gee, Lympany, James, and Salton [JASA Express Lett. 1(12), 122401 (2021)]. In this paper, the downselection process, which is based on factors such as data quality and inter-feature correlations, is described in further detail. To investigate additional dimensionality reduction, four different feature selection methods are applied to the 51 layers. Leave-one-out median absolute deviation cross-validation errors suggest that the number of geospatial features can be reduced to 15 without significant degradation of the model's predictive error. However, ensemble predictions demonstrate that feature selection results are sensitive to variations in details of the problem formulation and, therefore, should elicit some skepticism. These results suggest that more sophisticated dimensionality reduction techniques are necessary for problems with limited training data and different training and testing distributions.

Enhanced Surface Superconductivity of Niobium by Zirconium Doping

Benjamin L. Francis and Mark K. Transtrum (et al.)

Superconducting radio-frequency (SRF) cavities currently rely on niobium (Nb), and could benefit from a higher-Tc surface, which would enable a higher operating temperature, lower surface resistance, and higher maximum fields. Surface zirconium (Zr) doping is one option for improvement, which has not previously been explored, likely because bulk alloy experiments showed only mild Tc enhancements of 1–2 K relative to Nb. Our ab initio results reveal a more nuanced picture: an ideal bcc Nb-Zr alloy would have Tc over twice that of niobium, but displacements of atoms away from the high-symmetry bcc positions due to the Jahn-Teller-Peierls effect almost completely eliminates this enhancement in typical disordered alloy structures. Ordered Nb-Zr alloy structures, in contrast, are able to avoid these atomic displacements and achieve higher calculated Tc up to a theoretical limit of 17.7 K. Encouraged by this, we tested two deposition methods: a physical-vapor Zr deposition method, which produced Nb-Zr surfaces with Tc values of 13.5 K, and an electrochemical deposition method, which produced surfaces with a possible 16-K Tc. An rf test of the highest-Tc surface showed a mild reduction in BCS surface resistance relative to Nb, demonstrating the potential value of this material for RF devices. Finally, our Ginzburg-Landau theory calculations show that realistic surface doping profiles should be able to reach the maximum rf fields necessary for next-generation applications, such as the ground-breaking LCLS-II accelerator. Considering the advantages of Nb-Zr compared to other candidate materials such as Nb3Sn and Nb-Ti-N, including a simple phase diagram with relatively little sensitivity to composition, and a stable, insulating ZrO2 native oxide, we conclude that Nb-Zr alloy is an excellent candidate for next-generation, high-quality-factor superconducting rf devices.

K-Means Clustering of 51 Geospatial Layers Identified for Use in Continental-Scale Modeling of Outdoor Acoustic Environments

Katrina Pedersen, Mitchell C. Cutler, Mark K. Transtrum, and Kent L. Gee (et al.)

Applying machine learning methods to geographic data provides insights into spatial patterns in the data as well as assists in interpreting and describing environments. This paper investigates the results of k-means clustering applied to 51 geospatial layers, selected and scaled for a model of outdoor acoustic environments, in the continental United States. Silhouette and elbow analyses were performed to identify an appropriate number of clusters (eight). Cluster maps are shown and the clusters are described, using correlations between the geospatial layers and clusters to identify distinguishing characteristics for each cluster. A subclustering analysis is presented in which each of the original eight clusters is further divided into two clusters. Because the clustering analysis used geospatial layers relevant to modeling outdoor acoustics, the geospatially distinct environments corresponding to the clusters may aid in characterizing acoustically distinct environments. Therefore, the clustering analysis can guide data collection for the problem of modeling outdoor acoustic environments by identifying poorly sampled regions of the feature space (i.e., clusters which are not well-represented in the training data).

ZrNb(CO) RF Superconducting Thin Film with High Critical Temperature in the Theoretical Limit

Benjamin Francis and Mark K. Transtrum (et al.)

Superconducting radio-frequency (SRF) resonators are critical components for particle accelerator applications, such as free-electron lasers, and for emerging technologies in quantum computing. Developing advanced materials and their deposition processes to produce RF superconductors that yield n & omega; surface resistances is a key metric for the wider adoption of SRF technology. Here, ZrNb(CO) RF superconducting films with high critical temperatures (T-c) achieved for the first time under ambient pressure are reported. The attainment of a T-c near the theoretical limit for this material without applied pressure is promising for its use in practical applications. A range of T-c, likely arising from Zr doping variation, may allow a tunable superconducting coherence length that lowers the sensitivity to material defects when an ultra-low surface resistance is required. The ZrNb(CO) films are synthesized using a low-temperature (100 - 200 & DEG;C) electrochemical recipe combined with thermal annealing. The phase transformation as a function of annealing temperature and time is optimized by the evaporated Zr-Nb diffusion couples. Through phase control, one avoids hexagonal Zr phases that are equilibrium-stable but degrade T-c. X-ray and electron diffraction combined with photoelectron spectroscopy reveal a system containing cubic & beta;-ZrNb mixed with rocksalt NbC and low-dielectric-loss ZrO2. Proof-of-concept RF performance of ZrNb(CO) on an SRF sample test system is demonstrated. BCS resistance trends lower than reference Nb, while quench fields occur at approximately 35 mT. The results demonstrate the potential of ZrNb(CO) thin films for particle accelerators and other SRF applications.

An app for nationwide dynamic traffic noise prediction

Mylan Cook, Kent Gee, and Mark Transtrum (et al.)

Despite being so pervasive, road traffic noise can be difficult to model and predict on a national scale. Detailed road traffic noise predictions can be made on small geographic scales using the US Federal Highway Administration's Traffic Noise Model (TNM), but TNM becomes infeasible for the typical user on a nationwide scale because of the complexity and computational cost. Incorporating temporal and spectral variability also greatly increases complexity. To address this challenge, physics-based models are made using reported hourly traffic counts at locations across the country together with published traffic trends. Using these models together with TNM equations for spectral source emissions, a streamlined app has been created to efficiently predict traffic noise at roads across the nation with temporal and spectral variability. This app, which presently requires less than 700 MB of stored geospatial data and models, incorporates user inputs such as location, time period, and frequency, and gives predicted spectral levels within seconds.

Accurate broadband gradient estimates enable local sensitivity analysis of ocean acoustic models

Michael C. Mortenson, Tracianne B. Neilsen, and Mark K. Transtrum (et al.)

Sensitivity analysis is a powerful tool for analyzing multi-parameter models. For example, the Fisher information matrix (FIM) and the Cramer-Rao bound (CRB) involve derivatives of a forward model with respect to parameters. However, these derivatives are difficult to estimate in ocean acoustic models. This work presents a frequency-agnostic methodology for accurately estimating numerical derivatives using physics-based parameter preconditioning and Richardson extrapolation. The methodology is validated on a case study of transmission loss in the 50-400Hz band from a range-independent normal mode model for parameters of the sediment. Results demonstrate the utility of this methodology for obtaining Cramer-Rao bound (CRB) related to both model sensitivities and parameter uncertainties, which reveal parameter correlation in the model. This methodology is a general tool that can inform model selection and experimental design for inverse problems in different applications.

Information geometry for multiparameter models: new perspectives on the origin of simplicity

Mark K Transtrum (et al.)

Complex models in physics, biology, economics, and engineering are often sloppy, meaning that the model parameters are not well determined by the model predictions for collective behavior. Many parameter combinations can vary over decades without significant changes in the predictions. This review uses information geometry to explore sloppiness and its deep relation to emergent theories. We introduce the model manifold of predictions, whose coordinates are the model parameters. Its hyperribbon structure explains why only a few parameter combinations matter for the behavior. We review recent rigorous results that connect the hierarchy of hyperribbon widths to approximation theory, and to the smoothness of model predictions under changes of the control variables. We discuss recent geodesic methods to find simpler models on nearby boundaries of the model manifold-emergent theories with fewer parameters that explain the behavior equally well. We discuss a Bayesian prior which optimizes the mutual information between model parameters and experimental data, naturally favoring points on the emergent boundary theories and thus simpler models. We introduce a 'projected maximum likelihood' prior that efficiently approximates this optimal prior, and contrast both to the poor behavior of the traditional Jeffreys prior. We discuss the way the renormalization group coarse-graining in statistical mechanics introduces a flow of the model manifold, and connect stiff and sloppy directions along the model manifold with relevant and irrelevant eigendirections of the renormalization group. Finally, we discuss recently developed 'intensive' embedding methods, allowing one to visualize the predictions of arbitrary probabilistic models as low-dimensional projections of an isometric embedding, and illustrate our method by generating the model manifold of the Ising model.

Toward improving road traffic noise characterization: A reduced-order model for representing hourly traffic volume dynamics

Mylan R. Cook, Kent L. Gee, and Mark K. Transtrum (et al.)

The National Transportation Noise Map predicts time-averaged road traffic noise across the continental United States (CONUS) based on average annual daily traffic counts. However, traffic counts may vary significantly with time. Since traffic noise is correlated with traffic counts, a more detailed temporal representation of traffic noise requires knowledge of the time-varying traffic counts. Each year, the Federal Highway Administration tabulates the hourly traffic counts recorded at more than 5000 traffic monitoring sites across CONUS. Each site records up to 8760 traffic counts corresponding to each hour of the year. The hourly traffic counts can be treated as time-dependent signals upon which signal processing techniques can be applied. First, Fourier analysis is used to find the daily, weekly, and yearly temporal cycles present at each traffic monitoring site. Next, principal component analysis is applied to the peaks in the Fourier spectra. A reduced-order model using only nine principal components represents much of the temporal variability in traffic counts while requiring only 0.1% as many values as the original hourly traffic counts. This reduced-order model can be used in conjunction with sound mapping tools to predict traffic noise on hourly, rather than time-averaged, timescales. [Work supported by U.S. Army SBIR.]

Selecting simple, transferable models with the supremum principle

Cody Petrie, Christian Anderson, Casie Maekawa, Travis Maekawa, and Mark K. Transtrum

We consider how mathematical models enable predictions for conditions that are qualitatively different from the training data. We propose techniques based on information topology to find models that can apply their learning in regimes for which there is no data. The first step is to use the manifold boundary approximation method to construct simple, reduced models of target phenomena in a data-driven way. We consider the set of all such reduced models and use the topological relationships among them to reason about model selection for new, unobserved phenomena. Given minimal models for several target behaviors, we introduce the supremum principle as a criterion for selecting a new, transferable model. The supremal model, i.e., the least upper bound, is the simplest model that reduces to each of the target behaviors. We illustrate how to discover supremal models with several examples; in each case, the supremal model unifies causal mechanisms to transfer successfully to new target domains. We use these examples to motivate a general algorithm that has formal connections to theories of analogical reasoning in cognitive psychology.

Utilizing Nonequilibrium Isotope Enrichments to Dramatically Increase Turnover Measurement Ranges in Single Biopsy Samples from Humans

Bradley C. Naylor, Christian N. K. Anderson, Marcus Hadfield, David H. Parkinson, Austin Ahlstrom, Austin Hannemann, Chad R. Quilling, Kyle J. Cutler, Russell L. Denton, Rebecca S. Burlett, Paul S. Hafen, John. C. Dallon, Mark K. Transtrum, Robert D. Hyldahl, and John C. Price (et al.)

The synthesis of new proteins and the degradation of old proteins in vivo can be quantified in serial samples using metabolic isotope labeling to measure turnover. Because serial biopsies in humans are impractical, we set out to develop a method to calculate the turnover rates of proteins from single human biopsies. This method involved a new metabolic labeling approach and adjustments to the calculations used in previous work to calculate protein turnover. We demonstrate that using a nonequilibrium isotope enrichment strategy avoids the time dependent bias caused by variable lag in label delivery to different tissues observed in traditional metabolic labeling methods. Turnover rates are consistent for the same subject in biopsies from different labeling periods, and turnover rates calculated in this study are consistent with previously reported values. We also demonstrate that by measuring protein turnover we can determine where proteins are synthesized. In human subjects a significant difference in turnover rates differentiated proteins synthesized in the salivary glands versus those imported from the serum. We also provide a data analysis tool, DeuteRater-H, to calculate protein turnover using this nonequilibrium metabolic ²H₂O method.

Bayesian, frequentist, and information geometric approaches to parametric uncertainty quantification of classical empirical interatomic potentials

Yonatan Kurniawan, Cody L. Petrie, Kinamo J. Williams, and Mark K. Transtrum (et al.)

In this paper, we consider the problem of quantifying parametric uncertainty in classical empirical interatomic potentials (IPs) using both Bayesian (Markov Chain Monte Carlo) and frequentist (profile likelihood) methods. We interface these tools with the Open Knowledgebase of Interatomic Models and study three models based on the Lennard-Jones, Morse, and Stillinger-Weber potentials. We confirm that IPs are typically sloppy, i.e., insensitive to coordinated changes in some parameter combinations. Because the inverse problem in such models is ill-conditioned, parameters are unidentifiable. This presents challenges for traditional statistical methods, as we demonstrate and interpret within both Bayesian and frequentist frameworks. We use information geometry to illuminate the underlying cause of this phenomenon and show that IPs have global properties similar to those of sloppy models from fields, such as systems biology, power systems, and critical phenomena. IPs correspond to bounded manifolds with a hierarchy of widths, leading to low effective dimensionality in the model. We show how information geometry can motivate new, natural parameterizations that improve the stability and interpretation of uncertainty quantification analysis and further suggest simplified, less-sloppy models.

Integration of Physics- and Data-Driven Power System Models in Transient Analysis After Major Disturbances

Mark K. Transtrum (et al.)

The article explores the analysis of transient phenomena in large-scale power systems subjected to major disturbances from the aspect of interleaving, coordinating, and refining physics- and data-driven models. Major disturbances can lead to cascading failures and ultimately to the partial power system blackout. Our primary interest is in a framework that would enable coordinated and seamlessly integrated use of the two types of models in engineered systems. Parts of this framework include: 1) optimized compressed sensing, 2) customized finite-dimensional approximations of the Koopman operator, and 3) gray-box integration of physics-driven (equation-based) and data-driven (deep neural network-based) models. The proposed three-stage procedure is applied to the transient stability analysis on the multimachine benchmark example of a 441-bus real-world test system, where the results are shown for a synchronous generator with local measurements in the connection point.

Application of a spectral-based wind noise reduction method to acoustical measurements

Mylan R. Cook, Kent L. Gee, and Mark K. Transtrum (et al.)

Wind-induced microphone self-noise is a non-acoustic signal that may contaminate outdoor acoustical measurements, particularly at low frequencies, even when using a windscreen. A recently developed method [Cook et al., JASA Express Lett. 1, 063602 (2021)] uses the characteristic spectral slope of wind noise in the inertial subrange for screened microphones to automatically classify and reduce wind noise in acoustical measurements in the lower to middling frequency range of human hearing. To explore its uses and limitations, this method is applied to acoustical measurements which include both natural and anthropogenic noise sources. The method can be applied to one-third octave band spectral data with different frequency ranges and sampling intervals. By removing the shorter timescale data at frequencies where wind noise dominates the signal, the longer timescale acoustical environment can be more accurately represented. While considerations should be made about the specific applicability of the method to particular datasets, the wind reduction method allows for simple classification and reduction of wind-noise-contaminated data in large, diverse datasets.

Modeling Live Crowd Emotion Dynamics for State Estimation and Prediction

Brooks A. Butler, Philip E. Paré, Mark K. Transtrum, and Sean Warnick

Crowd violence and the repression of free speech have become increasingly relevant concerns in recent years. This paper considers a new application of crowd control, namely, keeping the public safe during large scale demonstrations by anticipating the evolution of crowd emotion dynamics through state estimation. This paper takes a first step towards solving this problem by formulating a crowd state prediction problem in consideration of recent work involving crowd psychology and opinion modeling. We propose a nonlinear crowd behavior model incorporating parameters of agent personality, opinion, and relative position to simulate crowd emotion dynamics. This model is then linearized and used to build a state observer whose effectiveness is then tested on system outputs from both nonlinear and linearized models. We show that knowing the value of the equilibrium point for the full nonlinear system is a necessary condition for convergence of this class of estimators, but otherwise not much information about the crowd is needed to obtain good estimates. In particular, zero-error convergence is possible even when the estimator erroneously uses nominal or average personality parameters in its model for each member of the crowd.

State Estimation Model Reduction through the Manifold Boundary Approximation Method

Mark K Transtrum (et al.)

This paper presents a procedure for estimating the systems state when considerable Information and Communication Technology (ICT) component outages occur, leaving entire system areas un-observable. For this task, a novel method for analyzing system observability is proposed based on the Manifold Boundary Ap-proximation Method (MBAM). By utilizing information geome-try, MBAM analyzes boundaries of models in data space, thus detecting unidentifiable system parameters and states based on available data. This approach extends local, matrix-based meth-ods to a global perspective, making it capable of detecting both structurally unidentifiable parameters as well as practically uni-dentifiable parameters (i.e., identifiable with low accuracy). Be-yond partitioning identifiable/unidentifiable states, MBAM also reduces the model to remove reference to the unidentifiable state variables. To test this procedure, cyber-physical system (CPS) simulation environments are constructed by co-simulating the physical and cyber system layers.

Piecemeal Reduction of Models of Large Networks

Benjamin L. Francis and Mark K. Transtrum (et al.)

Many systems can be modeled as an intricate network of interacting components. Often the level of detail in the model exceeds the richness of the available data, makes the model difficult to learn, or makes it difficult to interpret. Such models can be improved by reducing their complexity. If a model of a network is very large, it may be desirable to split it into pieces and reduce them separately, recombining them after reduction. Such a distributed procedure would also have other advantages in terms of speed and data privacy. We discuss piecemeal reduction of a model in the context of the Manifold Boundary Approximation Method (MBAM), including its advantages over other reduction methods. MBAM changes the model reduction problem into one of selecting an appropriate element from a partially ordered set (poset) of reduced models. We argue that the prime factorization of this poset provides a natural decomposition of the network for piecemeal model reduction via MBAM. We demonstrate on an example network and show that MBAM finds a reduced model that introduces less bias than similar models with randomly selected reductions.

Validating two geospatial models of continental-scale environmental sound levels

Katrina Pedersen, Mark K. Transtrum, and Kent L. Gee (et al.)

Modeling outdoor environmental sound levels is a challenging problem. This paper reports on a validation study of two continental-scale machine learning models using geospatial layers as inputs and the summer daytime A-weighted L-50 as a validation metric. The first model was developed by the National Park Service while the second was developed by the present authors. Validation errors greater than 20 dBA are observed. Large errors are attributed to limited acoustic training data. Validation environments are geospatially dissimilar to training sites, requiring models to extrapolate beyond their training sets. Results motivate further work in optimal data collection and uncertainty quantification.

Simultaneous Determination of Equilibrium Constants, Enthalpy Changes and Stoichiometries by Titration Calorimetry

Mark K. Transtrum (et al.)

This chapter reviews the history, key developments in instrumentation and data analysis and representative applications of titration calorimetry to the simultaneous determination of equilibrium constants, enthalpy changes and stoichiometries for reactions in solution. Statistical methods for error analysis and optimizing operating conditions are developed and illustrated. Examples of applications of titration calorimetric methods to solve problems in biophysics are presented.

Automatic classification and reduction of wind noise in spectral data

Mylan R. Cook, Kent L. Gee, and Mark K. Transtrum (et al.)

Outdoor acoustic data often include non-acoustic pressures caused by atmospheric turbulence, particularly below a few hundred Hz in frequency, even when using microphone windscreens. This paper describes a method for automatic wind-noise classification and reduction in spectral data without requiring measured wind speeds. The method finds individual frequency bands matching the characteristic decreasing spectral slope of wind noise. Uncontaminated data from several short-timescale spectra can be used to obtain a decontaminated long-timescale spectrum. This method is validated with field-test data and can be applied to large datasets to efficiently find and reduce the negative impact of wind noise contamination.

Examining wind noise reduction effects of windscreens and microphone elevation in outdoor acoustical measurements

Zachary Jones, Mylan R. Cook, Kent L. Gee, and Mark K. Transtrum (et al.)

Jones et al. [J. Acoust. Soc. Am. 146, 2912 (2019)] compared an elevated (1.5 m) acoustical measurement configuration that used a standard commercial windscreen for outdoor measurements with a ground-based configuration with a custom windscreen. That study showed that the ground-based measurement method yielded superior wind noise rejection, presumably due to the larger windscreen and lower wind speeds experienced near the ground. This study further examines those findings by attempting to decouple the effects of windscreens and microphone elevation using measurements at 1.5 m and near the ground with and without windscreens. Simultaneous wind speed measurements at 1.5 m and near the ground were also made for correlation purposes. Results show that the insertion of the custom windscreen reduces wind noise more than placing the microphone near the ground, and that the ground-based setup is again preferable for obtaining broadband outdoor acoustic measurements.

Symbolic Regression for Data-Driven Dynamic Model Refinement in Power Systems

Mark K. Transtrum (et al.)

This paper describes a data-driven symbolic regression identification method tailored to power systems and demonstrated on different synchronous generator (SG) models. In this work, we extend the sparse identification of nonlinear dynamics (SINDy) modeling procedure to include the effects of exogenous signals (measurements), nonlinear trigonometric terms in the library of elements, equality, and boundary constraints of expected solution. We show that the resulting framework requires fairly little in terms of data, and is computationally efficient and robust to noise, making it a viable candidate for online identification in response to rapid system changes. The SINDy-based model identification is integrated with the manifold boundary approximation method (MBAM) for the reduction of the differential-algebraic equations (DAE)-based SG dynamic models (decrease in the number of states and parameters). The proposed procedure is illustrated on an SG example in a real-world 441-bus and 67-machine benchmark.

Data-Driven Classification, Reduction, Parameter Identification and State Extension in Hybrid Power Systems

Mark K. Transtrum (et al.)

The paper describes a manifold learning-based algorithm for big data classification and reduction, as well as parameter identification in real-time operation of a power system. Both black-box and gray-box settings for SCADA- and PMU-based measurements are examined. Data classification is based on diffusion maps, where an improved data-informed metric construction for partition trees is used. Data classification and reduction is demonstrated on the measurement tensor example of calculated transient dynamics between two SCADA refreshing scans. Interpolation/extension schemes for state extension of restriction (from data to reduced space) and lifting (from reduced to data space) operators are proposed. The method is illustrated on the single-phase Motor D example from very detailed WECC load model, connected to the single bus of a real-world 441-bus power system.

Effect of the density of states at the Fermi level on defect free energies and superconductivity: A case study of Nb₃Sn

Jared Carlson, Alden R. Pack, and Mark K. Transtrum (et al.)

Although often ignored in first-principles studies of material behavior, electronic free energy can have a profound effect in systems with a high-temperature threshold for kinetics and a high Fermi-level density of states (DOS). Nb₃Sn and many other members of the technologically important A15 class of superconductors meet these criteria. This is no coincidence: both electronic free energy and superconducting transition temperature Tc are closely linked to the electronic density of states at the Fermi level. Antisite defects are known to have an adverse effect on Tc in these materials because they disrupt the high Fermi-level density of states. We observe that this also locally reduces electronic free energy, giving rise to large temperature-dependent terms in antisite defect formation and interaction free energies. This work explores the effect of electronic free energy on antisite defect behavior in the case of Nb₃Sn. Using ab initio techniques, we perform a comprehensive study of antisite defects in Nb₃Sn, and find that their effect on the Fermi-level DOS plays a key role determining their thermodynamic behavior, their interactions, and their effect on superconductivity. Based on our findings, we calculate the A15 region of the Nb-Sn phase diagram and show that the phase boundaries depend critically the electronic free energy of antisite defects. In particular, we show that extended defects such as grain boundaries alter the local phase diagram by suppressing electronic free-energy effects, explaining experimental measurements of grain boundary antisite defect segregation. Finally, we quantify the effect of antisite defects on superconductivity with the first ab initio study of Tc in Nb₃Sn as a function of composition, focusing on tin-rich compositions observed in segregation regions around grain boundaries. As tin-rich compositions are not observed in bulk, their properties cannot be directly measured experimentally; our calculations therefore enable quantitative Ginzburg-Landau simulations of grain boundary superconductivity in Nb₃Sn. We discuss the implications of these results for developing new growth processes to improve the properties of Nb₃Sn thin films.

Analysis of magnetic vortex dissipation in Sn-segregated boundaries in Nb₃Sn superconducting RF cavities

Mark K. Transtrum (et al.)

We study mechanisms of vortex nucleation in Nb₃Sn superconducting RF (SRF) cavities using a combination of experimental, theoretical, and computational methods. Scanning transmission electron microscopy imaging and energy dispersive spectroscopy of some Nb3Sn cavities show Sn segregation at grain boundaries in Nb₃Sn with Sn concentration as high as ∼35 at. % and widths ∼3 nm in chemical composition. Using ab initio calculations, we estimate the effect excess tin has on the local superconducting properties of the material. We model Sn segregation as a lowering of the local critical temperature. We then use time-dependent Ginzburg-Landau theory to understand the role of segregation on magnetic vortex nucleation. Our simulations indicate that the grain boundaries act as both nucleation sites for vortex penetration and pinning sites for vortices after nucleation. Depending on the magnitude of the applied field, vortices may remain pinned in the grain boundary or penetrate the grain itself. We estimate the superconducting losses due to vortices filling grain boundaries and compare with observed performance degradation with higher magnetic fields. We estimate that the quality factor may decrease by an order of magnitude (10¹⁰ to 10⁹) at typical operating fields if 0.03% of the grain boundaries actively nucleate vortices. We additionally estimate the volume that would need to be filled with vortices to match experimental observations of cavity heating.

Automatic detection of instances of focused crowd involvement at recreational events

Eric Todd, Mylan R. Cook, Katrina Pedersen, Brooks A. Butler, Xin Zhao, Colt Liu, Kent L. Gee, Mark K. Transtrum, and Sean Warnick (et al.)

This paper describes the development of an automated classification algorithm for detecting instances of focused crowd involvement present in crowd cheering. The purpose of this classification system is for situations where crowds are to be rewarded for not just the loudness of cheering, but for a concentrated effort, such as in Mardi Gras parades to attract bead throws or during critical moments in sports matches. It is therefore essential to separate non-crowd noise, general crowd noise, and focused crowd cheering efforts from one another. The importance of various features—both spectral and low-level audio processing features—are investigated. Data from both parades and sporting events are used for comparison of noise from different venues. This research builds upon previous clustering analyses of crowd noise from collegiate basketball games, using hierarchical clustering as an unsupervised machine learning approach to identify low-level features related to focused crowd involvement. For Mardi Gras crowd data we use a continuous thresholding approach based on these key low-level features as a method of identifying instances where the crowd is particularly active and engaged.

Interleaving physics- and data-driven models for power system transient dynamics

Mark K. Transtrum (et al.)

The paper explores interleaved and coordinated refinement of physicsand data-driven models in describing transient phenomena in large-scale power systems. We develop and study an integrated analytical and computational data-driven gray box environment needed to achieve this aim. Main ingredients include computational differential geometry-based model reduction, optimization-based compressed sensing, and a finite approximation of the Koopman operator. The proposed two-step procedure (the model reduction by differential geometric (information geometry) tools, and data refinement by the compressed sensing and Koopman theory based dynamics prediction) is illustrated on the multi-machine benchmark example of IEEE 14-bus system with renewable sources, where the results are shown for doubly-fed induction generator (DFIG) with local measurements in the connection point. The algorithm is directly applicable to identification of other dynamic components (for example, dynamic loads).

Probabilistic extension of flexible hybrid state estimation for cyber-physical systems

Mark K. Transtrum (et al.)

This paper proposes a probabilistic extension to flexible hybrid state estimation (FHSE) for cyber-physical systems (CPSs). The main goal of the algorithm is improvement of the system state tracking when realistic communications are taken into account, by optimizing information and communication technology (ICT) usage. These advancements result in: 1) coping with ICT outages and inevitable irregularities (delay, packet drop and bad measurements); 2) determining the optimized state estimation execution frequencies based on expected measurement refresh times. Additionally, information about CPSs is gathered from both the phasor measurement units (PMU) and SCADA-based measurements. This measurement transfer introduces two network observability types, which split the system into observable (White) and unobservable (Grey) areas, based on 1) deployed measuring instruments (MIs) and 2) received measurements. A two-step bad data detection (BDD) method is introduced for ICT irregularities and outages. The proposed algorithm benefits are shown on two IEEE test cases with time-varying load/generation: 14-bus and 300-bus.

K-Means clustering of inputs to a geospatial model for optimizing acoustic data collection

Brooks A. Butler, Katrina Pedersen, Kent L. Gee, and Mark K. Transtrum (et al.)

Outdoor ambient acoustical environments may be predicted through machine learning using geospatial features as inputs. However, collecting sufficient training data is an expensive process, particularly when attempting to improve the accuracy of models based on supervised learning methods over large, geospatially diverse regions. Unsupervised machine learning methods, such as K-Means clustering analysis, enable a statistical comparison between the geospatial diversity represented in the current training dataset versus the predictor locations. In this case, 117 geospatial features that represent the contiguous United States have been clustered using K-Means clustering. Results show that most geospatial clusters group themselves according to a relatively small number of prominent geospatial features. It is shown that the available acoustic training dataset has a relatively low geospatial diversity because most training data sites reside in a few clusters. This analysis informs the selection of new site locations for data collection that improve the statistical similarity of the training and input datasets.

Data-driven Symbolic Regression for Identification of Nonlinear Dynamics in Power Systems

Mark K. Transtrum (et al.)

The paper describes a data-driven system identification method tailored to power systems and demonstrated on models of synchronous generators (SGs). In this work, we extend the recent sparse identification of nonlinear dynamics (SINDy) modeling procedure to include the effects of exogenous signals and nonlinear trigonometric terms in the library of elements. We show that the resulting framework requires fairly little in terms of data, and is computationally efficient and robust to noise, making it a viable candidate for online identification in response to rapid system changes. The proposed method also shows improved performance over linear data-driven modeling. While the proposed procedure is illustrated on a SG example in a multi-machine benchmark, it is directly applicable to the identification of other system components (e.g., dynamic loads) in large power systems.

Theses, Captstones, and Dissertations

Asymptotic and Bayesian Approaches to Uncertainty Quantification of Interatomic Models (by Kinamo Williams)

Interatomic models (IMs) are used in molecular modeling to predict material properties of interest. The development of an IM can take several months to years and relies on expert intuition, and yet these potentials are usually only valid for a particular application of interest. Extending existing IMs to new applications is an active area of research. Quantifying the uncertainty of an IM can tell us how much we can trust the predictions it makes. I discuss here two methods for analyzing uncertainty: Fisher Information Matrix (FIM) and Markov Chain Monte Carlo (MCMC). Using MCMC methods, I sample from the posterior distribution of the parameters when trained on data. I demonstrate this method on Lennard-Jones and Morse potentials fit to triclinic crystal configurations from the OpenKIM database. In particular, IMs are often sloppy, i.e., have likelihood surfaces with long, narrow canyons and broad, flat plateaus. I will be comparing the benefits and drawbacks of the two methods.

Two Reduced Models of Nerve Behavior (by Joshua Rasband)

Predictive models are key to understanding the behavior of physical systems. Effective models can facilitate understanding of a complicated system. Ineffective models may have a large number of parameters, leading to the phenomenon of sloppiness, characterized by large uncertainties in estimating parameter values from data. Sloppiness has previously been observed in many fields, including power systems, chemical kinetics, and systems biology. We observe that the Hodgkin- Huxley model, a canonical model of the action potential in the giant squid axon, is a sloppy model. We describe the Manifold Boundary Approximation Method (MBAM), a technique for general model reduction. We use MBAM to construct minimal versions of the Hodgkin-Huxley model of the action potential for two example behaviors. These minimal models can better inform large- scale simulation of neurons in addition to lending important insight into biologically conserved characteristics of the neuron.

Information Geometry and Model Reduction in Oscillatory and Networked Systems (by Benjamin Lane Francis)

In this dissertation, I consider the problem of model reduction in both oscillatory and networked systems. Previously, the Manifold Boundary Approximation Method (MBAM) has been demonstrated as a data-driven tool for reducing the parametric complexity of so-called sloppy models. To be effective, MBAM requires the model manifold to have low curvature. I show that oscillatory models are characterized by model manifolds with high curvature in one or more directions. I propose methods for transforming the model manifolds of these models into ones with low curvature and demonstrate on a couple of test systems. I demonstrate MBAM as a tool for data-driven network reduction on a small model from power systems. I derive multiple effective networks for the model, each tailored to a specific choice of system observations. I find several important types of parameter reductions, including network reductions, which can be used in large power systems models. Finally, I consider the problem of piecemeal reduction of large systems. When a large system is split into pieces that are to be reduced separately using MBAM, there is no guarantee that the reduced pieces will be compatible for reassembly. I propose a strategy for reducing a system piecemeal while guaranteeing that the reduced pieces will be compatible. I demonstrate the reduction strategy on a small resistor network.

Superconductivity at its Limit: Simulating Superconductor Dynamics Near the Superconducting Superheating Field in Eilenberger and Ginzburg-Landau Theory (by Alden Roy Pack)

We computationally explore the dynamics of superconductivity near the superheating field in two ways. First, we use a finite element method to solve the time-dependent Ginzburg-Landau equations of superconductivity. We present a novel way to evaluate the superheating field Hsh and the critical mode that leads to vortex nucleation using saddle-node bifurcation theory. We simulate how surface roughness, grain boundaries, and islands of deficient Sn change those results in 2 and 3 spatial dimensions. We study how AC magnetic fields and heat waves impact vortex movement. Second, we use automatic differentiation to abstract away the details of deriving the equations of motion and stability for Ginzburg-Landau and Eilenberger theory. We present calculations of Hsh and the critical wavenumber using linear stability analysis.

Simulations of Magnetic Vortex Nucleation at Grain Boundaries in RF Superconducting Cavities (by Jared Carlson)

Superconducting Radio Frequency (SRF) cavities are important components of particle accelerators. SRF cavity performance is limited by a maximum allowed applied magnetic field, known as the superheating field ($H_{\rm sh}$) at which magnetic vortices spontaneously enter the material and cause the superconducting material to quench. Previous work has calculated the theoretical maximum field a superconductor can withstand. However, this calculation assumed a perfectly smooth surface with no material inhomogeneities or surface roughness. Real world cavities are polycrystalline (typically Nb or Nb$_3$Sn) and exhibit surface defects near grain boundaries. Cavity preparation methods also lead to material inhomogeneities. I use the time-dependent Ginzburg-Landau theory and finite element methods to model the role of surface defects and material inhomogeneities in magnetic vortex nucleation. Results show the amount by which Hsh is reduced depends on the concentration of impurities as well as the physical dimensions of the defect. Reducing the size of grain boundaries and material inhomogeneities found therein has the potential to significantly increase SRF cavity performance.

Using Machine Learning to Accurately Predict Ambient Soundscapes from Limited Data Sets (by Katrina Lynn Pedersen)

The ability to accurately characterize the soundscape, or combination of sounds, of diverse geographic areas has many practical implications. Interested parties include the United States military and the National Park Service, but applications also exist in areas such as public health, ecology, community and social justice noise analyses, and real estate. I use an ensemble of machine learning models to predict ambient sound levels throughout the contiguous United States. Our data set consists of 607 training sites, where various acoustic metrics, such as overall daytime L50 levels and one-third octave frequency band levels, have been obtained. I have data for 117 geospatial features for the entire contiguous United States, which include metrics such as distance to the nearest road or airport, and the percentage of industrialization or forest in a specific area. I discuss initial model predictions in the spatial, frequency, and temporal domains, and the statistical advantages of using an ensemble of machine learning models, particularly for limited data sets. I comment on uncertainty quantification for machine learning models originating from limited data sets.

Parameter Reduction of the Hodgkin-Huxley Model of Action Potential Propagation (by Tyler Bahr)

In 1952 Hodgkin and Huxley formulated the fundamental biophysical model of how neurons integrate input and fire electric spikes. With 25 parameters and 4 dynamical variables, the model is quite complex. Using information theory, we analyze the model complexity and demonstrate that it is unnecessarily complex for many neural modeling tasks. Using the manifold boundary approximation method of model reduction, we perform a series of parameter reductions on the original 25-parameter model and create a series of spiking Hodgin-Huxley models, each with decreasing parameter number. We analyze the physical meaning of some key approximations uncovered by our systematic reduction methods, which are "blind" to the real physical processes the model is intended to capture. We then evaluate the behavior of the most greatly reduced 14-parameter model under different experimental conditions, including networks of neurons. We also discuss new questions that have arisen as a result of our work

Computational exploration of vortex nucleation in type II superconductors using a finite element method in Ginzburg-Landau theory (by Alden Roy Pack)

Using a finite element method, we numerically solve the time-dependent Ginzburg-Landau equations of superconductivity to explore vortex nucleation in type II superconductors. We consider a cylindrical geometry and simulate the transition from a superconducting state to a mixed state. Using saddle-node bifurcation theory we evaluate the superheating field for a cylinder. We explore how surface roughness and thermal fluctuations influence vortex nucleation. This allows us to simulate material inhomogeneities that may lead to instabilities in superconducting resonant frequency cavities used in particle accelerators.

Algorithms for Inferring the Information Topology of Statistical Mechanics Models (by Kolten Barfuss)

Many-parameter models of complex systems are ubiquitous, yet often difficult to interpret. To gain insight, these models are often simplified, sacrificing some of their global considerations as well as versatility. The task of finding a model that balances these features is of particular interest in statistical mechanics. Our group addresses the problem through a novel approach—the Manifold Boundary Approximation Method (MBAM). As the central step to this approach, we interpret models geometrically as manifolds. Many model manifolds have a set of boundary cells arranged in a hierarchy of dimension. Each of these boundary cells is itself a manifold which corresponds to a simpler version of the original model, with fewer parameters. Thus, a complete picture of all the manifold’s boundary cells—the boundary complex—yields a corresponding family of simplified models. It also characterizes the relationships among the extreme behaviors of the original model, as well as relationships among minimal models that relate subsets of these extreme behaviors. This global picture of the boundary complex for a model is termed the model’s manifold topology. Beginning in the context of statistical mechanics, this thesis defines a class of models— Superficially Determined Lattice (SDL) models—whose manifold topologies can be ascertained algorithmically. This thesis presents two algorithms. Given an SDL model, the Reconstruction Algorithm determines its manifold topology from minimal information. Given a model and desired extreme behaviors, the Minimal Model Algorithm finds the simplified model (with fewest parameters) that interpolates between all of the behaviors.

Adaptation in Enzyme Networks: Searching for Minimal Mechanisms (by Merrill Asp)

Adaptation is an important biological function that can be achieved through networks of enzyme reactions. These networks can be modelled by systems of coupled differential equations. There has been recent interest in identifying what aspect of a network allows it to achieve adaptation. We ask what design principles are necessary for a network to adapt to an external stimulus. We use an information geometric approach that begins with a fully connected network and uses automated model reduction to remove unnecessary combinations of components, effectively constructing and tuning the network to the simplest form that still can achieve adaptation. We interpret the simplified network and combinations of parameters that arise in our model reduction to identify minimal mechanisms of adaptation in enzyme networks, and we consider the applications of these methods to other fields.

Improving the Efficiency of the Levenberg-Marquardt Algorithm Using Partial-Rank Jacobian Updates (by Michael Zarian)

Fitting non-linear models to data is a notoriously difficult problem. The standard algorithm, known as Levenberg-Marquardt (LM), is a gradient search algorithm based on a trust region approach that interpolates between gradient decent and the Gauss-Newton methods. Algorithms (including LM) often get lost in parameter space and take an unreasonable amount of time to converge, especially for models with many parameters. The computational challenge and bottleneck is calculating the derivatives of the model with respect to each parameter to construct the so-called Jacobian matrix. We explore methods for improving the efficiency of LM by approximating the Jacobian using partial-rank updates. We construct an update method that reduces the computational cost of the standard Levenberg-Marquardt routine by a factor of .64 on average for a set of test problems.

Computational Models of Film and Bulk Superconductors in Time Varying Magnetic Fields (by Alden Pack)

We numerically study the time-dependent Ginzburg-Landau equations of superconductivity using a Galerkin method implemented in FEniCS, an automated differential equation solver. We consider geometries for both a bulk material (line from zero to infinity) and a film (half-line), corresponding to mixed and Neumann boundary conditions respectively. We simulate quenching by switching on an external magnetic field, allowing the material to approach a steady state, and then switching on a greater field. Our solutions exhibit the Meissner effect, convergence to the steady state solution, and quenching of superconductors.

Computational Modeling of Magnetic Field Interaction with Superconductors Under High-Field and High-Dissipation Conditions: A TDGL Approach Applied to Nb and Nb3Sn (by Aiden Victor Harbick)

Superconducting radiofrequency (SRF) cavities are essential components in modern particle accelerators, enabling the efficient acceleration of charged particles for various applications in physics, medicine, materials science, and beyond. The performance of these cavities is significantly influenced by the properties of superconducting materials, such as niobium (Nb) and triniobium-tin (Nb3Sn), and the defects and surface features present within the material. This dissertation presents a computational study focused on understanding the behavior of SRF cavities, using a sample-specific time-dependent Ginzburg-Landau (TDGL) framework to simulate their performance under realistic material conditions. The research integrates experimental data and density functional theory (DFT) calculations to model the impact of various defects, including hydrides, Sn-deficient islands, grain boundaries, and surface roughness. The calculations reveal how these defects contribute to performance degradation, particularly in terms of dissipation and quality factor (Q). We also investigate the impact of surface layers and roughness on the behavior of Nb3Sn, finding that surface features play a significant role in influencing cavity performance. In addition, the dissertation explores the generalized TDGL (GTDGL) model, which offers an extension to traditional TDGL theory and enables improved predictions of frequency-dependent phenomena. This work contributes to the development of more accurate computational tools for analyzing SRF cavity performance, providing insights that can guide future efforts in material optimization and accelerator technology.

Information Geometric Approaches to Optimal Experimental Design and Reduced-Order Modeling in Ocean Acoustics (by Jay Clark Spendlove)

This thesis demonstrates the application of information geometry to problems in underwater acoustics. Information geometry combines the fields of information theory and differential geometry by interpreting a multiparameter model as a Riemannian manifold in an ambient data space. Information geometry tools are especially powerful in context of problems of experimental design and model selection in ocean acoustics. The application area specifically considered in this research is geoacoustic inversion, where seabed parameter values are inferred from acoustical data. Chapter 2 contains a paper submitted to the Journal of Theoretical and Computational Acoustics, which introduces information geometry tools such as the model manifold and Fisher information in context of a review of work in underwater acoustics doing parameter sensitivity analysis. An example constructing model manifolds for a sound propagation model is given in the second half of the paper. Chapter 3 contains a paper submitted to the Journal of the Acoustical Society of America, Express Letters, which constructs model manifolds for a sound propagation model and compares the information content of absolute and relative transmission loss in regards to seabed parameters, demonstrating how information geometry can be use to inform experimental design. This thesis contains the initial application of information geometry to ocean acoustics, with many more advances that can be pursued in future work.

Stability, Longevity, and Regulatory Bionetworks (by Christian N Anderson)

Genome-wide studies of diseases and chronic conditions frequently fail to uncover marked or consistent differences in RNA or protein concentrations. However, the developing field of kinetic proteomics has made promising discoveries in differences in the turnover rate of these same proteins, even when concentrations were not necessarily different. The situation can theoretically be modeled mathematically using bifurcation equations, but uncovering the proper form of these is difficult. To this end, we developed TWIG, a method for characterizing bifurcations that leverages information geometry to identify drivers of complex systems. Using this, we characterized the bifurcation and stability properties of all 132 possible 3- and 22,662 possible 4-node subgraphs (motifs) of protein-protein interaction networks. Analyzing millions of real world protein networks indicates that natural selection has little preference for motifs that are stable per se, but a great preference for motifs who have parameter regions that are exclusively stable, rather than poorly constrained mixtures of stability and instability. We apply this knowledge to mice on calorie restricted (CR) diets, demonstrating that changes in their protein turnover rates do indeed make their protein networks more stable, explaining why CR is the most robust way known to extend lifespan.

Quantum Decoherence in Time-Dependent Anharmonic Systems (by Ty Spencer Beus)

This dissertation studies quantum decoherence in anharmonic oscillator systems to monitor and understand the way the systems evolve. It also explores methods to control the systems' evolution, and the effects of decoherence when applicable. We primarily do this by finding the time evolution of the systems using their Lie algebraic structures. We solve for a generalized Caldirola-Kanai Hamiltonian, and propose a general way to produce a desired evolution of the system. We apply the analysis to the effects of Dirac delta fluctuations in mass and frequency, both separately and simultaneously. We also numerically demonstrate control of the generalized Caldirola-Kanai system for the case of timed Gaussian fluctuations in the mass term. This is done in a way that can be applied to any system that is made up of a Lie algebra. We also explore the evolution of an optomechanical coupled mirror-laser system while maintaining a second order coupling. This system creates anharmonic effects that can produce cat states which can be used for quantum computing. We find that the decoherence in this system causes a rotational smearing effect in the Husimi function which, with the second order term added, causes rotational smearing after a squeezing effect. Finally, we also address the dynamic evolution and decoherence of an anharmonic oscillator with infinite coupling using the Born-Markov master equation. This is done by using the Lie algebraic structure of the Born-Markov master equation's superoperators when applying a strategic mean field approximation to maintain dynamic flexibility. The system is compared to the Born-Markov master equation for the harmonic oscillator, the regular anharmonic oscillator, and the dynamic double anharmonic oscillator. Throughout, Husimi plots are provided to visualize the dynamic decoherence of these systems.

Efficacy of Enforced Parsimony on a Machine Learning Model of Gravity (by Joey Ehlert)

Many machine learning models are often overly complicated and require simplification, making it difficult to use them to discover fundamental physical laws. We examine the role of parsimony in the scientific process using a 14-parameter, model of gravity created by the SirIsaac algorithm, an S-Systems model. S-Systems, a universal function approximator for dynamical systems, are an interesting case study because they include true gravity, i.e., the inverse square law, as a special case. We explore the question whether model reduction methods can find true gravity as an optimal approximation to the machine-learned SirIsaac model. We use the Manifold Boundary Approximation Method (MBAM) as a parameter reduction algorithm. MBAM is a computational approach based on the information geometry of the model. We found that MBAM produces a reduced model of SirIsaac that accurately describes the four orbits of Newtonian gravity (circular, elliptical, parabolic, and hyperbolic). The final reduced model is different than Newtonian gravity, although the two reduction paths share four limits. By using two subsets (bound and unbound orbits, respectively) of the data, we identified, via MBAM, a model that accurately fit each subset. We find that all the limits necessary for Newtonian gravity appear in at least one of the reduction paths of the bound and unbound orbits.

Analyzing Frequency Spectra of Crowd Speech (by Sam Greenwell)

The purpose of this project is to determine significant acoustic features or conducive analyzation methods for crowd speech detection for subsequent extrapolation to crowd sentiment detection. This project found that crowd speech when treated as a general noise is differentiable from other prevalent crowd noises and that speech is easily overpowered by these other noises. However, differentiating singular letters may not be possible by using frequency spectra alone.

Bayesian, Frequentist, and Information Geometry Approaches to Parametric Uncertainty Quantification of Classical Empirical Interatomic Potentials (by Yonatan Kurniawan)

Uncertainty quantification (UQ) is an increasingly important part of materials modeling. In this paper, we consider the problem of quantifying parametric uncertainty in classical empirical interatomic potentials (IPs). Previous work based on local sensitivity analysis using the Fisher Information has shown that IPs are sloppy, i.e., are insensitive to coordinated changes of many parameter combinations. We confirm these results and further explore the non-local statistics in the context of sloppy model analysis using both Bayesian (MCMC) and Frequentist (profile likelihood) methods. We interface these tools with the Knowledgebase of Interatomic Models (OpenKIM) and study three models based on the Lennard-Jones, Morse, and Stillinger-Weber potentials, respectively. We confirm that IPs have global properties similar to those of sloppy models from fields such as systems biology, power systems, and critical phenomena. These models exhibit a low effective dimensionality in which many of the parameters are unidentifiable, i.e., do not encode any information when fit to data. Because the inverse problem in such models is ill-conditioned, unidentifiable parameters present challenges for traditional statistical methods. In the Bayesian approach, Monte Carlo samples can depend on the choice of prior in subtle ways. In particular, they often "evaporate" parameters into high-entropy, sub-optimal regions of the parameter space. For profile likelihoods, confidence regions are extremely sensitive to the choice of confidence level. To get a better picture of the relationship between data and parametric uncertainty, we sample the Bayesian posterior at several sampling temperatures and compare the results with those of Frequentist analyses. In analogy to statistical mechanics, we classify samples as either energy-dominated, i.e., characterized by identifiable parameters in constrained (ground state) regions of parameter space, or entropy-dominated, i.e., characterized by unidentifiable (evaporated) parameters. We complement these two pictures with information geometry to illuminate the underlying cause of this phenomenon. In this approach, a parameterized model is interpreted as a manifold embedded in the space of possible data with parameters as coordinates. We calculate geodesics on the model manifold and find that IPs, like other sloppy models, have bounded manifolds with a hierarchy of widths, leading to low effective dimensionality in the model. We show how information geometry can motivate new, natural parameterizations that improve the stability and interpretation of UQ analysis and further suggest simplified, less-sloppy models.

Modeling Continental-Scale Outdoor Environmental Sound Levels with Limited Data (by Katrina Lynn Pedersen)

Modeling outdoor acoustic environments is a challenging problem because outdoor acoustic environments are the combination of diverse sources and propagation effects, including barriers to propagation such as buildings or vegetation. Outdoor acoustic environments are most commonly modeled on small geographic scales (e.g., within a single city). Extending modeling efforts to continental scales is particularly challenging due to an increase in the variety of geographic environments. Furthermore, acoustic data on which to train and validate models are expensive to collect and therefore relatively limited. It is unclear how models trained on this limited acoustic data will perform across continental-scales, which likely contain unique geographic regions which are not represented in the training data. In this dissertation, we consider the problem of continental-scale outdoor environmental sound level modeling using the contiguous United States for our area of study. We use supervised machine learning methods to produce models of various acoustic metrics and unsupervised learning methods to study the natural structures in geospatial data. We present a validation study of two continental-scale models which demonstrates that there is a need for better uncertainty quantification and tools to guide data collection. Using ensemble models, we investigate methods for quantifying uncertainty in continental-scale models. We also study methods of improving model accuracy, including dimensionality reduction, and explore the feasibility of predicting hourly spectral levels.

Modeling defects in Nb3Sn Superconductor Resonance Cavities with Ginzburg-Landau Theory (by Braedon Jones)

Superconducting resonance cavities are used in particle accelerators to accelerate beams of charged particles to near light speed. The fundamental limit to performance in these cavities is the maximum induced magnetic field that the superconductors can expel due to the Meissner effect. Traditionally, cavities have been made from Niobium; however, current technology has nearly reached the theoretical limit of performance for Niobium-based cavities. To overcome these limitations, Nb3Sn is being explored as a potential next-generation material. In actual development of Nb3Sn cavities, material defects arise that may limit performance. Time-dependent Ginzburg-Landau simulate deficiencies to explore if they cause detrimental effects to cavity performance. This research focuses on small ‘island’ regions containing deficits of Sn. These islands have been observed below the surface in real Nb3Sn cavities after fabrication. This research shows that these islands may affect performance if they are near the surface but become irrelevant when they are located more than a penetration depth below the interface.