A. Trudeau, Anthony H. Gonzalez, K. Thongkham, Kyoung-Soo Lee, Stacey Alberts, M. Brodwin, Thomas Connor, Peter R. M. Eisenhardt et al.
The evolution of galaxies depends on their masses and local environments;
understanding when and how environmental quenching starts to operate remains a
challenge. Furthermore, studies of the high-redshift regime have been limited
to massive cluster members, owing to sensitivity limits or small fields of
views when the sensitivity is sufficient, intrinsically biasing the picture of
cluster evolution. In this work, we use stacking to investigate the average
star formation history of more than 10,000 groups and clusters drawn from the
Massive and Distant Clusters of WISE Survey 2 (MaDCoWS2). Our analysis covers
near ultraviolet to far infrared wavelengths, for galaxy overdensities at $0.5
\lesssim z \lesssim 2.54$. We employ SED fitting to measure the specific star
formation rates (sSFR) in four annular apertures with radii between 0 and 1000
kpc. At $z \gtrsim 1.6$, the average sSFR evolves similarly to the field in
both the core and the cluster outskirts. Between $\overline{z} = 1.60$ and
$\overline{z} = 1.35$, the sSFR in the core drops sharply, and continues to
fall relative to the field sSFR at lower redshifts. We interpret this change as
evidence that the impact of environmental quenching dramatically increases at
$z \sim 1.5$, with the short time span of the transition suggesting that the
environmental quenching mechanism dominant at this redshift operates on a rapid
timescale. We find indications that the sSFR may decrease with increasing host
halo mass, but lower-scatter mass tracers than the signal-to-noise ratio (S/N)
are needed to confirm this relationship.
Authors' comments: 22 pages, 14 figures, accepted for publication in ApJ
Pulkit Khandelwal, Michael Tran Duong, Lisa Levorse, Constanza Fuentes, Amanda Denning, Winifred Trotman, Ranjit Ittyerah, Alejandra Bahena et al.
Magnetic resonance imaging (MRI) is the standard modality to understand human brain structure and function in vivo (antemortem). Decades of research in human neuroimaging has led to the widespread development of methods and tools to provide automated volume-based segmentations and surface-based parcellations which help localize brain functions to specialized anatomical regions. Recently ex vivo (postmortem) imaging of the brain has opened-up avenues to study brain structure at sub-millimeter ultra high-resolution revealing details not possible to observe with in vivo MRI. Unfortunately, there has been limited methodological development in ex vivo MRI primarily due to lack of datasets and limited centers with such imaging resources. Therefore, in this work, we present one-of-its-kind dataset of 82 ex vivo T2w whole brain hemispheres MRI at 0.3 mm isotropic resolution spanning Alzheimer's disease and related dementias. We adapted and developed a fast and easy-to-use automated surface-based pipeline to parcellate, for the first time, ultra high-resolution ex vivo brain tissue at the native subject space resolution using the Desikan-Killiany-Tourville (DKT) brain atlas. This allows us to perform vertex-wise analysis in the template space and thereby link morphometry measures with pathology measurements derived from histology. We will open-source our dataset docker container, Jupyter notebooks for ready-to-use out-of-the-box set of tools and command line options to advance ex vivo MRI clinical brain imaging research on the project webpage.
Hidekazu Yoshioka, Motoh Tsujimura
Logit dynamics are evolution equations that describe transitions to equilibria of actions among many players. We formulate a pair-wise logit dynamic in a continuous action space with a generalized exponential function, which we call a generalized pair-wise logit dynamic, depicted by a new evolution equation nonlocal in space. We prove the well-posedness and approximability of the generalized pair-wise logit dynamic to show that it is computationally implementable. We also show that this dynamic has an explicit connection to a mean field game of a controlled pure-jump process, with which the two different mathematical models can be understood in a unified way. Particularly, we show that the generalized pair-wise logit dynamic is derived as a myopic version of the corresponding mean field game, and that the conditions to guarantee the existence of unique solutions are different from each other. The key in this procedure is to find the objective function to be optimized in the mean field game based on the logit function. The monotonicity of the utility is unnecessary for the generalized pair-wise logit dynamic but crucial for the mean field game. Finally, we present applications of the two approaches to fisheries management problems with collected data.
Rezoanoor Rahman, Fariha Taskin
Rainfall is an essential hydrological component, and most of the economic
activities of an agrarian country like Bangladesh depend on rainfall. An
accurate rainfall forecast can help make necessary decisions and reduce the
damages caused by heavy or low to no rainfall. The monthly average rainfall is
a time series data, and recently, long short-term memory (LSTM) neural networks
are being used heavily for time series forecasting problems. One major
challenge of forecasting using LSTMs is to select the appropriate number of lag
values. In this research, we considered the number of lag values selected as a
hyperparameter of LSTM; it, with the other hyperparameters determining LSTMs
structure, has been optimized using Bayesian optimization. We used our proposed
method to forecast rainfall for nine different weather stations of Bangladesh.
Finally, the performance of the proposed model has been compared with some
other LSTM with different lag-selection methods and some several popular
machine learning and statistical forecasting models.
Authors' comments: 19 pages in total
Kunxing Lu, Xianrui Wang, Tetsuya Ueda, Shoji Makino, Jingdong Chen
While the semi-blind source separation-based acoustic echo cancellation (SBSS-AEC) has received much research attention due to its promising performance during double-talk compared to the traditional adaptive algorithms, it suffers from system latency and nonlinear distortions. To circumvent these drawbacks, the recently developed ideas on convolutive transfer function (CTF) approximation and nonlinear expansion have been used in the iterative projection (IP)-based semi-blind source separation (SBSS) algorithm. However, because of the introduction of CTF approximation and nonlinear expansion, this algorithm becomes computationally very expensive, which makes it difficult to implement in embedded systems. Thus, we attempt in this paper to improve this IP-based algorithm, thereby developing an element-wise iterative source steering (EISS) algorithm. In comparison with the IP-based SBSS algorithm, the proposed algorithm is computationally much more efficient, especially when the nonlinear expansion order is high and the length of the CTF filter is long. Meanwhile, its AEC performance is as good as that of IP-based SBSS.
Roberto Goya-Maldonado, Tracy Erwin-Grabner, Ling-Li Zeng, Christopher R. K. Ching, Andre Aleman, Alyssa R. Amod, Zeynep Basgoze, Francesco Benedetti et al.
Major depressive disorder (MDD) is a complex psychiatric disorder that
affects the lives of hundreds of millions of individuals around the globe. Even
today, researchers debate if morphological alterations in the brain are linked
to MDD, likely due to the heterogeneity of this disorder. The application of
deep learning tools to neuroimaging data, capable of capturing complex
non-linear patterns, has the potential to provide diagnostic and predictive
biomarkers for MDD. However, previous attempts to demarcate MDD patients and
healthy controls (HC) based on segmented cortical features via linear machine
learning approaches have reported low accuracies. Here, we used globally
representative data from the ENIGMA-MDD working group containing 7,012
participants from 30 sites (N=2,772 MDD and N=4,240 HC), which allows a
comprehensive analysis with generalizable results. Based on the hypothesis that
integration of vertex-wise cortical features can improve classification
performance, we evaluated the classification of a DenseNet and a Support Vector
Machine (SVM), with the expectation that the former would outperform the
latter. We found that both classifiers exhibited close to chance performance
(balanced accuracy DenseNet: 51%; SVM: 53%), when estimated on unseen sites.
Slightly higher classification performance (balanced accuracy DenseNet: 58%;
SVM: 55%) was found when the cross-validation folds contained subjects from all
sites, indicating site effect. In conclusion, the integration of vertex-wise
morphometric features and the use of the non-linear classifier did not lead to
the differentiability between MDD and HC. Our results support the notion that
MDD classification on this combination of such features and classifiers is
unfeasible. Perhaps more sophisticated integration of multimodal information
may lead to a higher performance in this diagnostic task.
Authors' comments: arXiv admin note: text overlap with arXiv:2206.08122
Hiroto Harada, Michihiro Mikamo, Ryo Furukawa, Ryushuke Sagawa, Hiroshi Kawasaki
Active stereo technique using single pattern projection, a.k.a. one-shot 3D
scan, have drawn a wide attention from industry, medical purposes, etc. One
severe drawback of one-shot 3D scan is sparse reconstruction. In addition,
since spatial pattern becomes complicated for the purpose of efficient
embedding, it is easily affected by noise, which results in unstable decoding.
To solve the problems, we propose a pixel-wise interpolation technique for
one-shot scan, which is applicable to any types of static pattern if the
pattern is regular and periodic. This is achieved by U-net which is pre-trained
by CG with efficient data augmentation algorithm. In the paper, to further
overcome the decoding instability, we propose a robust correspondence finding
algorithm based on Markov random field (MRF) optimization. We also propose a
shape refinement algorithm based on b-spline and Gaussian kernel interpolation
using explicitly detected laser curves. Experiments are conducted to show the
effectiveness of the proposed method using real data with strong noises and
textures.
Authors' comments: MVA2023
Ling Li, Shu Wang, Xiaodian Chen, Qingquan Jiang
Interstellar dust extinction law is essential for interpreting observations.
In this work, we investigate the ultraviolet (UV)--mid-infrared (IR) extinction
law of the Taurus molecular cloud and its possible variations. We select
504,988 dwarf stars (4200 K < Teff < 8000 K) and 4,757 giant stars (4200 K <
Teff < 5200 K) based on the stellar parameters of Gaia DR3 as tracers. We
establish the Teff--intrinsic color relations and determine the intrinsic color
indices and color excesses for different types of stars. In the determination
of color excess ratios (CERs), we analyze and correct the curvature of CERs and
derive the UV--mid-IR CERs of 16 bands. We consider different effective
wavelengths for different types of stars when converting CERs to relative
extinction, and obtain the extinction law with a better wavelength resolution.
In addition, we analyze the possible regional variation of extinction law and
derive the average extinction law of Rv=3.13+-0.32 for the Taurus molecular
cloud. Only 0.9% of subregions have deviations >3sigma, indicating limited
regional variation in the extinction law. We also discuss the effect of Gaia
Teff overestimation on the determination of the Taurus extinction law and find
that the effect is negligible.
Authors' comments: 26 pages, 11 figures, 2 tables, Accepted for publication in ApJ
R. Scott Barrows, Julia M. Comerford, Daniel Stern, Roberto J. Assef
Pairs of galaxies hosting active galactic nuclei (AGN) are powerful probes of
merger-driven supermassive black hole (SMBH) growth as they can resolve
individual AGN and trace mergers over a large range of physical separations. To
exploit this on a large scale for the first time for both obscured and
unobscured AGN, we use photometric redshifts of AGN selected by the Wide-field
Infrared Survey Explorer (WISE) to find probabilistic pairs (<100 kpc
separations) across the sky, along with a comparison sample of inactive galaxy
pairs. Our final sample of integrated pair probabilities yields 198 AGN-AGN
pairs (dual AGN) and 2767 AGN-galaxy pairs (offset AGN) with uniformly measured
AGN and host galaxy physical properties. We find the fraction of galaxy pairs
hosting WISE AGN is dominated by offset AGN and significantly elevated above
that of inactive galaxies for large host stellar masses. We show how the AGN
merger fraction directly increases with AGN extinction for both offset and dual
AGN, with up to ~40% of heavily obscured AGN found in galaxy pairs. Elevated
AGN merger fractions coincide with increased host specific star formation rates
that suggest merger-driven co-evolution of galaxies and SMBHs. Among dual AGN,
the most rapid SMBH growth may occur within the less massive galaxy. Relative
to stochastic mechanisms, mergers produce an excess of AGN at increasingly
smaller separations, especially for obscured AGN (up to a factor of ~5), and
augmented by correlated triggering. Finally, this excess is stronger than for
lower luminosity optically-selected AGN, regardless of AGN obscuration level.
Authors' comments: 19 pages, 16 figures. Accepted for publication in the Astrophysical
Journal
Xiaoyu Feng, Huangxin Chen, Bo Yu, Shuyu Sun
The solidification and macro-segregation problem involving unsteady
multi-physics and multi-phase fields is typically a complex process with mass,
momentum, heat, and species transfers among solid, mushy, and liquid phase
regions. The quantitative prediction of phase change, chemical heterogeneities,
and multi-phase and multi-component flows plays critical roles in many natural
scenarios and industrial applications that involve many disciplines, like
material, energy, and even planet science. In view of this, some scholars and
research institutions have called for more contributors to join the benchmark
analysis of solidification and segregation problems. Our work proposes an
operator-splitting and matrix-based method to avoid non-linear systems. Also,
the combination of vectorization and forward equation-based matrix assembly
techniques enhances the implementability of extensions of 3D applications.
Lastly, the novel scheme is well validated through a bunch of 2D and 3D
benchmark cases. The numerical results also illustrate that this method can
ensure accurate prediction and adequately capture the physical details of
phenomena caused by the solutally and thermally driven flow, which include
channel segregation, the formation of freckles, edge effect, aspect ratio
effect, and 3D effect.
Authors' comments: keyword: Solidification, Macro-segregation, Multi-phase,
Operator-splitting, matrix-based, matrix assembly techniques, Benchmark
modeling
Haoxuan Jiang, Jianghui Ji, Liangliang Yu, Bin Yang, Shoucun Hu, Yuhui Zhao
(704) Interamnia is one of the largest asteroids that locates in the outer
main-belt region, which may contain a large amount of water ice underneath its
surface. We observe this asteroid using 8.2 m Subaru telescope at mid-infrared
wavebands, and utilize thermophysical model for realistic surface layers
(RSTPM) to analyze mid-infrared data from Subaru along with those of IRAS,
AKARI and WISE/NEOWISE. We optimize the method to convert the WISE magnitude to
thermal infrared flux with temperature dependent color corrections, which can
provide significant references for main-belt asteroids at a large heliocentric
distance with low surface temperature. We derive best-fitting thermal
parameters of Interamnia - a mean regolith grain size of $190_{-180}^{+460}~\rm
\mu m$, with a roughness of $0.30_{-0.17}^{+0.35}$ and RMS slope of
$27_{-9}^{+13}$ degrees, thereby producing thermal inertia ranging from 9 to
$92~\rm Jm^{-2}s^{-1/2}K^{-1}$ due to seasonal temperature variation. The
geometric albedo and effective diameter are evaluated to be
$0.0472_{-0.0031}^{+0.0033}$ and $339_{-11}^{+12}~\rm km$, respectively, being
indicative of a bulk density of $1.86\pm0.63~\rm g/cm^3$. The low thermal
inertia is consistent with typical B/C-type asteroids with $D\geq100$ km. The
tiny regolith grain size suggests the presence of a fine regolith on the
surface of Interamnia. Moreover, the seasonal and diurnal temperature
distribution indicates that thermal features between southern and northern
hemisphere appear to be very different. Finally, we present an estimation of
volume fraction of water ice of $9\%\sim66\%$ from the published grain density
and porosity of carbonaceous chondrites.
Authors' comments: 17 pages, 10 figures, accepted for publication in ApJ
Yang Gao, Qing-Hua Tan, Yu Gao, Min Fang, Ryan Chown, Qian Jiao, Chun-Sheng Luo
We complement the MALATANG sample of dense gas in nearby galaxies with
archival observations of $^{12}\rm CO$ and its isotopologues to determine
scaling relations between Wide-field Infrared Survey Explorer (WISE) 12 $\mu$m
emission and molecular gas tracers at sub-kiloparsec scales. We find that 12
$\mu$m luminosity is more tightly correlated with $^{12}\rm CO$ than it is with
$^{13}\rm CO$ or dense gas tracers. Residuals between predicted and observed
$^{12}\rm CO$ are only weakly correlated with molecular gas mass surface
density ($\Sigma_{\rm mol}$) in regions where $\Sigma_{\rm mol}$ is very low
($\sim 10~{\rm M_{\odot}~pc^{-2}}$). Above this limit, the $^{12}\rm CO$
residuals show no correlations with physical conditions of molecular gas, while
$^{13}\rm CO$ residuals depend on the gas optical depth and temperature. By
analyzing differences from galaxy to galaxy, we confirm that the $^{12}\rm
CO$-12 $\mu$m relation is strong and statistically robust with respect to star
forming galaxies and AGN hosts. These results suggest that WISE 12 $\mu$m
emission can be used to trace total molecular gas instead of dense molecular
gas, likely because polycyclic aromatic hydrocarbons (PAHs, a major contributor
to WISE 12 $\mu$m~emission) may be well-mixed with the gas that is traced by
$^{12}\rm CO$. We propose that WISE 12 $\mu$m luminosity can be used to
estimate molecular gas surface density for statistical analyses of the star
formation process in galaxies.
Authors' comments: 16 pages, 7 figures, accepted for publication in ApJ
Rianna Bell, Khaled Said, Tamara Davis, T. H. Jarrett
In this paper, we present our calibrations of the TF relation in the
mid-infrared W1 ($3.4\mu$m) and W2 ($4.6\mu$m) bands, using large samples 848
galaxies and 857 galaxies in the W1 and W2 bands respectively. In this
calibration we performed a correction for the cluster population incompleteness
bias, and a morphological type correction. The calibration was performed using
a new, iterative bivariate fitting procedure. For these calibrations we used
the total absolute magnitudes, and HI linewidths $W_{F50}$ derived from the HI
global profiles as a measure of the rotational velocities. We then performed
two additional calibrations on the same sample using (i) the isophotal
magnitudes and (ii) the average rotational velocities measured along the flat
sections of the spatially resolved rotation curves of the galaxies, which were
obtained from the empirical conversion between rotational velocity definitions.
We compared these three calibrations to determine whether the use of isophotal
magnitudes, or spatially resolved rotational velocities have a significant
impact on the scatter around the TF relations in the W1 and W2 bands. We found
that the original calibrations using total magnitudes and \hi linewidths had
the smallest total scatters. These calibrations are given by $M_{\rm Tot, W1} =
(2.02 \pm 0.44) - (10.08 \pm 0.17)\log_{10}(W_{F50})$ and $M_{\rm Tot, W2} =
(2.00 \pm 0.44) - (10.11 \pm 0.17)\log_{10}(W_{F50})$, with associated total
scatters of $\sigma_{W1} = 0.68$ and $\sigma_{W2} = 0.69$. Finally, we compared
our calibrations in the mid-infrared bands with previous calibrations in the
near-infrared J, H and K bands and the long-wavelength optical I band, which
used the same two corrections. The differences between these relations can be
explained by considering the different regions and components of spiral
galaxies that are traced by the different wavelengths.
Authors' comments: 16 pages, 17 figures, 1 table, submitted to MNRAS, comments are
welcome
Dario Sitnik, Ivica Kopriva
Application of artificial intelligence in medicine brings in highly accurate
predictions achieved by complex models, the reasoning of which is hard to
interpret. Their generalization ability can be reduced because of the lack of
pixel wise annotated images that occurs in frozen section tissue analysis. To
partially overcome this gap, this paper explores the approximate explicit
feature map (aEFM) transform of low-dimensional data into a low-dimensional
subspace in Hilbert space. There, with a modest increase in computational
complexity, linear algorithms yield improved performance and keep
interpretability. They remain amenable to incremental learning that is not a
trivial issue for some nonlinear algorithms. We demonstrate proposed
methodology on a very large-scale problem related to intraoperative pixel-wise
semantic segmentation and clustering of adenocarcinoma of a colon in a liver.
Compared to the results in the input space, logistic classifier achieved
statistically significant performance improvements in micro balanced accuracy
and F1 score in the amounts of 12.04% and 12.58%, respectively. Support vector
machine classifier yielded the increase of 8.04% and 9.41%. For clustering,
increases of 0.79% and 0.85% are obtained with ultra large-scale spectral
clustering algorithm. Results are supported by a discussion of interpretability
using Shapely additive explanation values for predictions of linear classifier
in input space and aEFM induced space.
Authors' comments: 18 pages, 4 figures, 6 tables, appendix
Bandon Decker, Mark Brodwin, Ripon Saha, Thomas Connor, Peter R. M. Eisenhardt, Anthony H. Gonzalez, Emily Moravec, Mustafa Muhibullah et al.
We present stellar mass fractions and composite luminosity functions (LFs)
for a sample of \Ncl\ clusters from the Massive and Distant Clusters of WISE
Survey (MaDCoWS) at a redshift range of $0.951 \leq z \leq 1.43$. Using SED
fitting of optical and deep mid-infrared photometry, we establish the
membership of objects along the lines-of-sight to these clusters and calculate
the stellar masses of member galaxies. We find stellar mass fractions for these
clusters largely consistent with previous works, including appearing to display
a negative correlation with total cluster mass. We measure a composite
$3.6~\mathrm{\mu m}$ LF down to $m^*+2.5$ for all 12 clusters. Fitting a
Schechter function to the LF, we find a characteristic $3.6~\mathrm{\mu m}$
magnitude of $m^*=19.83\pm0.12$ and faint-end slope of $\alpha=-0.81\pm0.10$
for the full sample at a mean redshift of $\bar{z} = 1.18$. We also divide the
clusters into high- and low-redshift bins at $\bar{z}=1.29$ and $\bar{z}=1.06$
respectively and measure a composite LF for each bin. We see a small, but
statistically significant evolution in $m^*$ and $\alpha$ -- consistent with
passive evolution -- when we study the joint fit to the two parameters, which
is probing the evolution of faint cluster galaxies at $z\sim1$. This highlights
the importance of deep IR data in studying the evolution of cluster galaxy
populations at high-redshift.
Authors' comments: 13 pages, 11 figures, Submitted to ApJ
Peter Greenstreet, Thomas Jaki, Alun Bedding, Chris Harbron, Pavel Mozgunov
There is growing interest in platform trials that allow for adding of new
treatment arms as the trial progresses as well as being able to stop treatments
part way through the trial for either lack of benefit/futility or for
superiority. In some situations, platform trials need to guarantee that error
rates are controlled. This paper presents a multi-stage design that allows
additional arms to be added in a platform trial in a pre-planned fashion, while
still controlling the family wise error rate. A method is given to compute the
sample size required to achieve a desired level of power and we show how the
distribution of the sample size and the expected sample size can be found. A
motivating trial is presented which focuses on two settings, with the first
being a set number of stages per active treatment arm and the second being a
set total number of stages, with treatments that are added later getting fewer
stages. Through this example we show that the proposed method results in a
smaller sample size while still controlling the errors compared to running
multiple separate trials.
Authors' comments: 30 Pages, 7 figures
Aliaksei Petsiuk, Joshua M. Pearce
This study presents an open source method for detecting 3D printing anomalies by comparing images of printed layers from a stationary monocular camera with G-code-based reference images of an ideal process generated with Blender, a physics rendering engine. Recognition of visual deviations was accomplished by analyzing the similarity of histograms of oriented gradients (HOG) of local image areas. The developed technique requires preliminary modeling of the working environment to achieve the best match for orientation, color rendering, lighting, and other parameters of the printed part. The output of the algorithm is a level of mismatch between printed and synthetic reference layers. Twelve similarity and distance measures were implemented and compared for their effectiveness at detecting 3D printing errors on six different representative failure types and their control error-free print images. The results show that although Kendall tau, Jaccard, and Sorensen similarities are the most sensitive, Pearson r, Spearman rho, cosine, and Dice similarities produce the more reliable results. This open source method allows the program to notice critical errors in the early stages of their occurrence and either pause manufacturing processes for further investigation by an operator or in the future AI-controlled automatic error correction. The implementation of this novel method does not require preliminary data for training, and the greatest efficiency can be achieved with the mass production of parts by either additive or subtractive manufacturing of the same geometric shape. It can be concluded this open source method is a promising means of enabling smart distributed recycling for additive manufacturing using complex feedstocks as well as other challenging manufacturing environments.
Yu Li, Muhammad Monjurul Karim, Ruwen Qin
Reducing traffic fatalities and serious injuries is a top priority of the US
Department of Transportation. The computer vision (CV)-based crash anticipation
in the near-crash phase is receiving growing attention. The ability to perceive
fatal crash risks earlier is also critical because it will improve the
reliability of crash anticipation. Yet, annotated image data for training a
reliable AI model for the early visual perception of crash risks are not
abundant. The Fatality Analysis Reporting System contains big data of fatal
crashes. It is a reliable data source for learning the relationship between
driving scene characteristics and fatal crashes to compensate for the
limitation of CV. Therefore, this paper develops a data analytics model, named
scenario-wise, Spatio-temporal attention guidance, from fatal crash report
data, which can estimate the relevance of detected objects to fatal crashes
from their environment and context information. First, the paper identifies
five sparse variables that allow for decomposing the 5-year fatal crash dataset
to develop scenario-wise attention guidance. Then, exploratory analysis of
location- and time-related variables of the crash report data suggests reducing
fatal crashes to spatially defined groups. The group's temporal pattern is an
indicator of the similarity of fatal crashes in the group. Hierarchical
clustering and K-means clustering merge the spatially defined groups into six
clusters according to the similarity of their temporal patterns. After that,
association rule mining discovers the statistical relationship between the
temporal information of driving scenes with crash features, for each cluster.
The paper shows how the developed attention guidance supports the design and
implementation of a preliminary CV model that can identify objects of a
possibility to involve in fatal crashes from their environment and context
information.
Authors' comments: 20 pages, 14 figures, submitted and accepted by Accident Analysis &
Prevention
Shammur Absar Chowdhury, Nadir Durrani, Ahmed Ali
Deep neural networks are inherently opaque and challenging to interpret.
Unlike hand-crafted feature-based models, we struggle to comprehend the
concepts learned and how they interact within these models. This understanding
is crucial not only for debugging purposes but also for ensuring fairness in
ethical decision-making. In our study, we conduct a post-hoc functional
interpretability analysis of pretrained speech models using the probing
framework [1]. Specifically, we analyze utterance-level representations of
speech models trained for various tasks such as speaker recognition and dialect
identification. We conduct layer and neuron-wise analyses, probing for speaker,
language, and channel properties. Our study aims to answer the following
questions: i) what information is captured within the representations? ii) how
is it represented and distributed? and iii) can we identify a minimal subset of
the network that possesses this information?
Our results reveal several novel findings, including: i) channel and gender
information are distributed across the network, ii) the information is
redundantly available in neurons with respect to a task, iii) complex
properties such as dialectal information are encoded only in the task-oriented
pretrained network, iv) and is localised in the upper layers, v) we can extract
a minimal subset of neurons encoding the pre-defined property, vi) salient
neurons are sometimes shared between properties, vii) our analysis highlights
the presence of biases (for example gender) in the network. Our
cross-architectural comparison indicates that: i) the pretrained models capture
speaker-invariant information, and ii) CNN models are competitive with
Transformer models in encoding various understudied properties.
Authors' comments: Accepted in CSL journal. Keywords: Speech, Neuron Analysis,
Interpretibility, Diagnostic Classifier, AI explainability, End-to-End
Architecture
John Orlowski-Scherer, Luca Di Mascolo, Tanay Bhandarkar, Alex Manduca, Tony Mroczkowski, Stefania Amodeo, Nick Battaglia, Mark Brodwin et al.
Galaxy clusters are an important tool for cosmology, and their detection and
characterization are key goals for current and future surveys. Using data from
the Wide-field Infrared Survey Explorer (WISE), the Massive and Distant
Clusters of WISE Survey (MaDCoWS) located 2,839 significant galaxy
overdensities at redshifts $0.7\lesssim z\lesssim 1.5$, which included
extensive follow-up imaging from the Spitzer Space Telescope to determine
cluster richnesses. Concurrently, the Atacama Cosmology Telescope (ACT) has
produced large area mm-wave maps in three frequency bands along with a large
catalog of Sunyaev-Zeldovich (SZ) selected clusters, as part of its Data
Release 5 (DR5). Using the maps and cluster catalog from DR5, we explore the
scaling between SZ mass and cluster richness. We use complementary radio survey
data from the Very Large Array, submillimeter data from Herschel, and ACT
224~GHz data to assess the impact of contaminating sources on the SZ signals.
We then use a hierarchical Bayesian model to fit the mass-richness scaling
relation. We find that MaDCoWS clusters have submillimeter contamination which
is consistent with a gray-body spectrum, while the ACT clusters are consistent
with no submillimeter emission on average. We find the best fit ACT SZ mass vs.
MaDCoWS richness scaling relation has a slope of $\kappa =
1.84^{+0.15}_{-0.14}$, where the slope is defined as $M\propto
\lambda_{15}^{\kappa}$ where $\lambda_{15}$ is the richness. Additionally, we
find that the approximate level of in-fill of the ACT and MaDCoWS cluster SZ
signals to be at the percent level
Authors' comments: 25 pages, 17 Figures; accepted for publication in A&A