Haoxuan Jiang, Jianghui Ji, Liangliang Yu
In this work, we investigate the size, thermal inertia, surface roughness and
geometric albedo of 10 Vesta family asteroids by using the Advanced
Thermophysical Model (ATPM), based on the thermal infrared data acquired by
mainly NASA's Wide-field Infrared Survey Explorer (WISE). Here we show that the
average thermal inertia and geometric albedo of the investigated Vesta family
members are 42 $\rm J m^{-2} s^{-1/2} K^{-1}$ and 0.314, respectively, where
the derived effective diameters are less than 10 km. Moreover, the family
members have a relatively low roughness fraction on their surfaces. The
similarity in thermal inertia and geometric albedo among the V-type Vesta
family member may reveal their close connection in the origin and evolution. As
the fragments of the cratering event of Vesta, the family members may have
undergone similar evolution process, thereby leading to very close thermal
properties. Finally, we estimate their regolith grain sizes with different
volume filling factors.
Authors' comments: 29 pages, 40 figures, accepted for publication in AJ
Yuning You, Tianlong Chen, Zhangyang Wang, Yang Shen
Graph convolution networks (GCN) are increasingly popular in many
applications, yet remain notoriously hard to train over large graph datasets.
They need to compute node representations recursively from their neighbors.
Current GCN training algorithms suffer from either high computational costs
that grow exponentially with the number of layers, or high memory usage for
loading the entire graph and node embeddings. In this paper, we propose a novel
efficient layer-wise training framework for GCN (L-GCN), that disentangles
feature aggregation and feature transformation during training, hence greatly
reducing time and memory complexities. We present theoretical analysis for
L-GCN under the graph isomorphism framework, that L-GCN leads to as powerful
GCNs as the more costly conventional training algorithm does, under mild
conditions. We further propose L$^2$-GCN, which learns a controller for each
layer that can automatically adjust the training epochs per layer in L-GCN.
Experiments show that L-GCN is faster than state-of-the-arts by at least an
order of magnitude, with a consistent of memory usage not dependent on dataset
size, while maintaining comparable prediction performance. With the learned
controller, L$^2$-GCN can further cut the training time in half. Our codes are
available at https://github.com/Shen-Lab/L2-GCN.
Authors' comments: Supplementary materials are available at
https://yyou1996.github.io/files/cvpr2020_l2gcn_supplement.pdf. CVPR 2020
Alireza M. Javid, Arun Venkitaraman, Mikael Skoglund, Saikat Chatterjee
We design a ReLU-based multilayer neural network by mapping the feature
vectors to a higher dimensional space in every layer. We design the weight
matrices in every layer to ensure a reduction of the training cost as the
number of layers increases. Linear projection to the target in the higher
dimensional space leads to a lower training cost if a convex cost is minimized.
An $\ell_2$-norm convex constraint is used in the minimization to reduce the
generalization error and avoid overfitting. The regularization hyperparameters
of the network are derived analytically to guarantee a monotonic decrement of
the training cost, and therefore, it eliminates the need for cross-validation
to find the regularization hyperparameter in each layer. We show that the
proposed architecture is norm-preserving and provides an invertible feature
vector, and therefore, can be used to reduce the training cost of any other
learning method which employs linear projection to estimate the target.
Authors' comments: 2020 EURASIP Journal on Advances in Signal Processing
Jie Chen, Zhiheng Li, Jiebo Luo, Chenliang Xu
We address weakly-supervised video actor-action segmentation (VAAS), which
extends general video object segmentation (VOS) to additionally consider action
labels of the actors. The most successful methods on VOS synthesize a pool of
pseudo-annotations (PAs) and then refine them iteratively. However, they face
challenges as to how to select from a massive amount of PAs high-quality ones,
how to set an appropriate stop condition for weakly-supervised training, and
how to initialize PAs pertaining to VAAS. To overcome these challenges, we
propose a general Weakly-Supervised framework with a Wise Selection of training
samples and model evaluation criterion (WS^2). Instead of blindly trusting
quality-inconsistent PAs, WS^2 employs a learning-based selection to select
effective PAs and a novel region integrity criterion as a stopping condition
for weakly-supervised training. In addition, a 3D-Conv GCAM is devised to adapt
to the VAAS task. Extensive experiments show that WS^2 achieves
state-of-the-art performance on both weakly-supervised VOS and VAAS tasks and
is on par with the best fully-supervised method on VAAS.
Authors' comments: 11 pages, 8 figures, cvpr-2020 supplementary video:
https://youtu.be/CX1hEOV9tlo
Mohammad Etemad, Zahra Etemad, Amilcar Soares, Vania Bogorny, Stan Matwin, Luis Torgo
Large amounts of mobility data are being generated from many different sources, and several data mining methods have been proposed for this data. One of the most critical steps for trajectory data mining is segmentation. This task can be seen as a pre-processing step in which a trajectory is divided into several meaningful consecutive sub-sequences. This process is necessary because trajectory patterns may not hold in the entire trajectory but on trajectory parts. In this work, we propose a supervised trajectory segmentation algorithm, called Wise Sliding Window Segmentation (WS-II). It processes the trajectory coordinates to find behavioral changes in space and time, generating an error signal that is further used to train a binary classifier for segmenting trajectory data. This algorithm is flexible and can be used in different domains. We evaluate our method over three real datasets from different domains (meteorology, fishing, and individuals movements), and compare it with four other trajectory segmentation algorithms: OWS, GRASP-UTS, CB-SMoT, and SPD. We observed that the proposed algorithm achieves the highest performance for all datasets with statistically significant differences in terms of the harmonic mean of purity and coverage.
Giulia Bassignana, Jennifer Fransson, Vincent Henry, Olivier Colliot, Violetta Zujovic, Fabrizio De Vico Fallani
Identifying the nodes that have the potential to influence the state of a network is a relevant question for many complex systems. In many applications it is often essential to test the ability of an individual node to control a specific target subset of the network. In biological networks, this might provide precious information on how single genes regulate the expression of specific groups of molecules in the cell. Taking into account these constraints, we propose an optimized heuristic based on the Kalman rank condition to quantify the centrality of a node as the number of target nodes it can control. By introducing a hierarchy among the nodes in the target set, and performing a step-wise research, we ensure for sparse and directed networks the identification of a controllable driver-target configuration in a significantly reduced space and time complexity. We show how the method works for simple network configurations, then we use it to characterize the inflammatory pathways in molecular gene networks associated with macrophage dysfunction in patients with multiple sclerosis. Results indicate that the targeted secreted molecules can in general be controlled by a large number of driver nodes (51%) involved in different cell functions, i.e. sensing, signaling and transcription. However, during the inflammatory response only a moderate fraction of all the possible driver-target pairs are significantly coactivated, as measured by gene expression data obtained from human blood samples. Notably, they differ between multiple sclerosis patients and healthy controls, and we find that this is related to the presence of dysregulated genes along the controllable walks. Our method, that we name step-wise target controllability, represents a practical solution to identify controllable driver-target configurations in directed complex networks and test their relevance from a functional perspective.
Shuxin Wang, Shilei Cao, Dong Wei, Renzhen Wang, Kai Ma, Liansheng Wang, Deyu Meng, Yefeng Zheng
We introduce a one-shot segmentation method to alleviate the burden of manual
annotation for medical images. The main idea is to treat one-shot segmentation
as a classical atlas-based segmentation problem, where voxel-wise
correspondence from the atlas to the unlabelled data is learned. Subsequently,
segmentation label of the atlas can be transferred to the unlabelled data with
the learned correspondence. However, since ground truth correspondence between
images is usually unavailable, the learning system must be well-supervised to
avoid mode collapse and convergence failure. To overcome this difficulty, we
resort to the forward-backward consistency, which is widely used in
correspondence problems, and additionally learn the backward correspondences
from the warped atlases back to the original atlas. This cycle-correspondence
learning design enables a variety of extra, cycle-consistency-based supervision
signals to make the training process stable, while also boost the performance.
We demonstrate the superiority of our method over both deep learning-based
one-shot segmentation methods and a classical multi-atlas segmentation method
via thorough experiments.
Authors' comments: Accepted to Proc. IEEE Conf. Computer Vision and Pattern Recognition
2020
Krzysztof Debicki, Lanpeng Ji, Tomasz Rolski
We derive the exact asymptotics of \[ P\left( \sup_{t\ge 0} \Bigl( X_1(t) - \mu_1 t\Bigr)> u, \ \sup_{s\ge 0} \Bigl( X_2(s) - \mu_2 s\Bigr)> u \right), \ \ u\to\infty, \] where $(X_1(t),X_2(s))_{t,s\ge0}$ is a correlated two-dimensional Brownian motion with correlation $\rho\in[-1,1]$ and $\mu_1,\mu_2>0$. It appears that the play between $\rho$ and $\mu_1,\mu_2$ leads to several types of asymptotics. Although the exponent in the asymptotics as a function of $\rho$ is continuous, one can observe different types of prefactor functions depending on the range of $\rho$, which constitute a phase-type transition phenomena.
Sana Tonekaboni, Shalmali Joshi, Kieran Campbell, David Duvenaud, Anna Goldenberg
Explanations of time series models are useful for high stakes applications like healthcare but have received little attention in machine learning literature. We propose FIT, a framework that evaluates the importance of observations for a multivariate time-series black-box model by quantifying the shift in the predictive distribution over time. FIT defines the importance of an observation based on its contribution to the distributional shift under a KL-divergence that contrasts the predictive distribution against a counterfactual where the rest of the features are unobserved. We also demonstrate the need to control for time-dependent distribution shifts. We compare with state-of-the-art baselines on simulated and real-world clinical data and demonstrate that our approach is superior in identifying important time points and observations throughout the time series.
Marcus C. Christiansen, Christian Furrer
In the presence of monotone information, the stochastic Thiele equation describing the dynamics of state-wise prospective reserves is closely related to the classic martingale representation theorem. When the information utilized by the insurer is non-monotone, the classic martingale theory does not apply. By taking an infinitesimal approach, we derive a generalized stochastic Thiele equation that allows for information discarding. En passant, we solve some open problems for the classic case of monotone information. The results and their implication in practice are illustrated via examples where information is discarded upon and after stochastic retirement.
C. Lazzoni, R. Gratton, J. M. Alcalà, S. Desidera, A. Frasca, C. F. Manara, D. Mesa, E. Rigliaco et al.
Very recently, a second companion on wider orbit has been discovered around
GQ Lup. This is a low-mass accreting star partially obscured by a disk seen at
high inclination. If detected, this disk may be compared to the known disk
around the primary. We detected this disk on archive HST and WISE data. The
extended spectral energy distribution provided by these data confirms the
presence of accretion from Halpha emission and UV excess, and shows an IR
excess attributable to a warm disk. In addition, we resolved the disk on the
HST images. This is found to be roughly aligned with the disk of the primary.
Both of them are roughly aligned with the Lupus I dust filament containing GQ
Lup.
Authors' comments: 5 pages, 4 figures
Hang Zhang, Jinwei Zhang, Qihao Zhang, Jeremy Kim, Shun Zhang, Susan A. Gauthier, Pascal Spincemaille, Thanh D. Nguyen et al.
Brain lesion volume measured on T2 weighted MRI images is a clinically
important disease marker in multiple sclerosis (MS). Manual delineation of MS
lesions is a time-consuming and highly operator-dependent task, which is
influenced by lesion size, shape and conspicuity. Recently, automated lesion
segmentation algorithms based on deep neural networks have been developed with
promising results. In this paper, we propose a novel recurrent slice-wise
attention network (RSANet), which models 3D MRI images as sequences of slices
and captures long-range dependencies through a recurrent manner to utilize
contextual information of MS lesions. Experiments on a dataset with 43 patients
show that the proposed method outperforms the state-of-the-art approaches. Our
implementation is available online at https://github.com/tinymilky/RSANet.
Authors' comments: Accepted for publication in MICCAI 2019
Lei Huang, Jie Qin, Li Liu, Fan Zhu, Ling Shao
Conditioning analysis uncovers the landscape of an optimization objective by
exploring the spectrum of its curvature matrix. This has been well explored
theoretically for linear models. We extend this analysis to deep neural
networks (DNNs) in order to investigate their learning dynamics. To this end,
we propose layer-wise conditioning analysis, which explores the optimization
landscape with respect to each layer independently. Such an analysis is
theoretically supported under mild assumptions that approximately hold in
practice. Based on our analysis, we show that batch normalization (BN) can
stabilize the training, but sometimes result in the false impression of a local
minimum, which has detrimental effects on the learning. Besides, we
experimentally observe that BN can improve the layer-wise conditioning of the
optimization problem. Finally, we find that the last linear layer of a very
deep residual network displays ill-conditioned behavior. We solve this problem
by only adding one BN layer before the last linear layer, which achieves
improved performance over the original and pre-activation residual networks.
Authors' comments: Accepted to ECCV 2020. The code is available at:
https://github.com/huangleiBuaa/LayerwiseCA
Mathilde Guillemot, Catherine Heusele, Rodolphe Korichi, Sylvianne Schnebert, Liming Chen
The lack of transparency of neural networks stays a major break for their use. The Layerwise Relevance Propagation technique builds heat-maps representing the relevance of each input in the model s decision. The relevance spreads backward from the last to the first layer of the Deep Neural Network. Layer-wise Relevance Propagation does not manage normalization layers, in this work we suggest a method to include normalization layers. Specifically, we build an equivalent network fusing normalization layers and convolutional or fully connected layers. Heatmaps obtained with our method on MNIST and CIFAR 10 datasets are more accurate for convolutional layers. Our study also prevents from using Layerwise Relevance Propagation with networks including a combination of connected layers and normalization layer.
L. R. Bedin, C. Fontanive
In the second paper of this series we perfected our method of linking high
precision Hubble Space Telescope astrometry to the high-accuracy Gaia DR2
absolute reference system to overcome the limitations of relative astrometry
with narrow-field cameras. Our test case here is the Y brown dwarf WISE
J163940.83-684738.6, observed at different epochs spread over a 6-yr time
baseline with the Infra-Red channel of the Wide Field Camera 3. We derived
significantly improved astrometric parameters compared to previous
determinations, finding: (mu_RAcosDc,mu_DC,parallax) =
(577.21+/-0.24mas/yr,-3108.39+/-0.27mas/yr,210.4+/-1.8mas). In particular, our
derived absolute parallax corresponds to a distance of 4.75+/-0.05pc for the
faint ultracool dwarf.
Authors' comments: 10 pages, 3 tables, 3 figures (fig.1 at low resolution). Accepted for
publication in MNRAS on 2020 February 21
Youngmin Ro, Jin Young Choi
Existing fine-tuning methods use a single learning rate over all layers. In
this paper, first, we discuss that trends of layer-wise weight variations by
fine-tuning using a single learning rate do not match the well-known notion
that lower-level layers extract general features and higher-level layers
extract specific features. Based on our discussion, we propose an algorithm
that improves fine-tuning performance and reduces network complexity through
layer-wise pruning and auto-tuning of layer-wise learning rates. The proposed
algorithm has verified the effectiveness by achieving state-of-the-art
performance on the image retrieval benchmark datasets (CUB-200, Cars-196,
Stanford online product, and Inshop). Code is available at
https://github.com/youngminPIL/AutoLR.
Authors' comments: Accepted to AAAI 2021
Quanyu Liao, Xin Wang, Bin Kong, Siwei Lyu, Youbing Yin, Qi Song, Xi Wu
Deep neural networks have been demonstrated to be vulnerable to adversarial attacks: subtle perturbations can completely change the classification results. Their vulnerability has led to a surge of research in this direction. However, most works dedicated to attacking anchor-based object detection models. In this work, we aim to present an effective and efficient algorithm to generate adversarial examples to attack anchor-free object models based on two approaches. First, we conduct category-wise instead of instance-wise attacks on the object detectors. Second, we leverage the high-level semantic information to generate the adversarial examples. Surprisingly, the generated adversarial examples it not only able to effectively attack the targeted anchor-free object detector but also to be transferred to attack other object detectors, even anchor-based detectors such as Faster R-CNN.
Qian Liu, Dongyang Cai, Jie Liu, Nan Ding, Tao Wang
In this report, the method for the iqiyi submission to the task of
ActivityNet 2019 Kinetics-700 challenge is described. Three models are involved
in the model ensemble stage: TSN, HG-NL and StNet. We propose the hierarchical
group-wise non-local (HG-NL) module for frame-level features aggregation for
video classification. The standard non-local (NL) module is effective in
aggregating frame-level features on the task of video classification but
presents low parameters efficiency and high computational cost. The HG-NL
method involves a hierarchical group-wise structure and generates multiple
attention maps to enhance performance. Basing on this hierarchical group-wise
structure, the proposed method has competitive accuracy, fewer parameters and
smaller computational cost than the standard NL. For the task of ActivityNet
2019 Kinetics-700 challenge, after model ensemble, we finally obtain an
averaged top-1 and top-5 error percentage 28.444% on the test set.
Authors' comments: Tech report
Cyprien Ruffino, Romain Hérault, Eric Laloy, Gilles Gasso
Generative Adversarial Networks (GANs) have proven successful for unsupervised image generation. Several works have extended GANs to image inpainting by conditioning the generation with parts of the image to be reconstructed. Despite their success, these methods have limitations in settings where only a small subset of the image pixels is known beforehand. In this paper we investigate the effectiveness of conditioning GANs when very few pixel values are provided. We propose a modelling framework which results in adding an explicit cost term to the GAN objective function to enforce pixel-wise conditioning. We investigate the influence of this regularization term on the quality of the generated images and the fulfillment of the given pixel constraints. Using the recent PacGAN technique, we ensure that we keep diversity in the generated samples. Conducted experiments on FashionMNIST show that the regularization term effectively controls the trade-off between quality of the generated images and the conditioning. Experimental evaluation on the CIFAR-10 and CelebA datasets evidences that our method achieves accurate results both visually and quantitatively in term of Fr\'echet Inception Distance, while still enforcing the pixel conditioning. We also evaluate our method on a texture image generation task using fully-convolutional networks. As a final contribution, we apply the method to a classical geological simulation application.
Jin Jin, Lin Zhang, Ethan Leng, Gregory J. Metzger, Joseph S. Koopmeiners
Multi-parametric magnetic resonance imaging (mpMRI) plays an increasingly
important role in the diagnosis of prostate cancer. Various computer-aided
detection algorithms have been proposed for automated prostate cancer detection
by combining information from various mpMRI data components. However, there
exist other features of mpMRI, including the spatial correlation between voxels
and between-patient heterogeneity in the mpMRI parameters, that have not been
fully explored in the literature but could potentially improve cancer detection
if leveraged appropriately. This paper proposes novel voxel-wise Bayesian
classifiers for prostate cancer that account for the spatial correlation and
between-patient heterogeneity in mpMRI. Modeling the spatial correlation is
challenging due to the extreme high dimensionality of the data, and we consider
three computationally efficient approaches using Nearest Neighbor Gaussian
Process (NNGP), knot-based reduced-rank approximation, and a conditional
autoregressive (CAR) model, respectively. The between-patient heterogeneity is
accounted for by adding a subject-specific random intercept on the mpMRI
parameter model. Simulation results show that properly modeling the spatial
correlation and between-patient heterogeneity improves classification accuracy.
Application to in vivo data illustrates that classification is improved by
spatial modeling using NNGP and reduced-rank approximation but not the CAR
model, while modeling the between-patient heterogeneity does not further
improve our classifier. Among our proposed models, the NNGP-based model is
recommended considering its robust classification accuracy and high
computational efficiency.
Authors' comments: 21 pages, 4 figures