Yun Liu, Yujun Shi, JiaWang Bian, Le Zhang, Ming-Ming Cheng, Jiashi Feng
Deep learning stands at the forefront in many computer vision tasks. However, deep neural networks are usually data-hungry and require a huge amount of well-annotated training samples. Collecting sufficient annotated data is very expensive in many applications, especially for pixel-level prediction tasks such as semantic segmentation. To solve this fundamental issue, we consider a new challenging vision task, Internetly supervised semantic segmentation, which only uses Internet data with noisy image-level supervision of corresponding query keywords for segmentation model training. We address this task by proposing the following solution. A class-specific attention model unifying multiscale forward and backward convolutional features is proposed to provide initial segmentation "ground truth". The model trained with such noisy annotations is then improved by an online fine-tuning procedure. It achieves state-of-the-art performance under the weakly-supervised setting on PASCAL VOC2012 dataset. The proposed framework also paves a new way towards learning from the Internet without human interaction and could serve as a strong baseline therein. Code and data will be released upon the paper acceptance.
Shu Kong, Charless Fowlkes
To achieve parsimonious inference in per-pixel labeling tasks with a limited
computational budget, we propose a \emph{Pixel-wise Attentional Gating} unit
(\emph{PAG}) that learns to selectively process a subset of spatial locations
at each layer of a deep convolutional network. PAG is a generic,
architecture-independent, problem-agnostic mechanism that can be readily
"plugged in" to an existing model with fine-tuning. We utilize PAG in two ways:
1) learning spatially varying pooling fields that improve model performance
without the extra computation cost associated with multi-scale pooling, and 2)
learning a dynamic computation policy for each pixel to decrease total
computation while maintaining accuracy.
We extensively evaluate PAG on a variety of per-pixel labeling tasks,
including semantic segmentation, boundary detection, monocular depth and
surface normal estimation. We demonstrate that PAG allows competitive or
state-of-the-art performance on these tasks. Our experiments show that PAG
learns dynamic spatial allocation of computation over the input image which
provides better performance trade-offs compared to related approaches (e.g.,
truncating deep models or dynamically skipping whole layers). Generally, we
observe PAG can reduce computation by $10\%$ without noticeable loss in
accuracy and performance degrades gracefully when imposing stronger
computational constraints.
Authors' comments: https://www.ics.uci.edu/~skong2/PAG.html
Chen Liu, Jimei Yang, Duygu Ceylan, Ersin Yumer, Yasutaka Furukawa
This paper proposes a deep neural network (DNN) for piece-wise planar
depthmap reconstruction from a single RGB image. While DNNs have brought
remarkable progress to single-image depth prediction, piece-wise planar
depthmap reconstruction requires a structured geometry representation, and has
been a difficult task to master even for DNNs. The proposed end-to-end DNN
learns to directly infer a set of plane parameters and corresponding plane
segmentation masks from a single RGB image. We have generated more than 50,000
piece-wise planar depthmaps for training and testing from ScanNet, a
large-scale RGBD video database. Our qualitative and quantitative evaluations
demonstrate that the proposed approach outperforms baseline methods in terms of
both plane segmentation and depth estimation accuracy. To the best of our
knowledge, this paper presents the first end-to-end neural architecture for
piece-wise planar reconstruction from a single RGB image. Code and data are
available at https://github.com/art-programmer/PlaneNet.
Authors' comments: CVPR 2018
Zuxuan Wu, Xintong Han, Yen-Liang Lin, Mustafa Gkhan Uzunbas, Tom Goldstein, Ser Nam Lim, Larry S. Davis
Harvesting dense pixel-level annotations to train deep neural networks for semantic segmentation is extremely expensive and unwieldy at scale. While learning from synthetic data where labels are readily available sounds promising, performance degrades significantly when testing on novel realistic data due to domain discrepancies. We present Dual Channel-wise Alignment Networks (DCAN), a simple yet effective approach to reduce domain shift at both pixel-level and feature-level. Exploring statistics in each channel of CNN feature maps, our framework performs channel-wise feature alignment, which preserves spatial structures and semantic information, in both an image generator and a segmentation network. In particular, given an image from the source domain and unlabeled samples from the target domain, the generator synthesizes new images on-the-fly to resemble samples from the target domain in appearance and the segmentation network further refines high-level features before predicting semantic maps, both of which leverage feature statistics of sampled images from the target domain. Unlike much recent and concurrent work relying on adversarial training, our framework is lightweight and easy to train. Extensive experiments on adapting models trained on synthetic segmentation benchmarks to real urban scenes demonstrate the effectiveness of the proposed framework.
Yuhua Chen, Jordi Pont-Tuset, Alberto Montes, Luc Van Gool
This paper tackles the problem of video object segmentation, given some user
annotation which indicates the object of interest. The problem is formulated as
pixel-wise retrieval in a learned embedding space: we embed pixels of the same
object instance into the vicinity of each other, using a fully convolutional
network trained by a modified triplet loss as the embedding model. Then the
annotated pixels are set as reference and the rest of the pixels are classified
using a nearest-neighbor approach. The proposed method supports different kinds
of user input such as segmentation mask in the first frame (semi-supervised
scenario), or a sparse set of clicked points (interactive scenario). In the
semi-supervised scenario, we achieve results competitive with the state of the
art but at a fraction of computation cost (275 milliseconds per frame). In the
interactive scenario where the user is able to refine their input iteratively,
the proposed method provides instant response to each input, and reaches
comparable quality to competing methods with much less interaction.
Authors' comments: Accepted to CVPR 2018
Josef Hanus, Marco Delbo, Josef Durech, Victor Ali-Lagoa
By means of a varied-shape thermophysical model (VS-TPM) of Hanus et al.
(2015) that takes into account asteroid shape and pole uncertainties, we
analyze the thermal IR data acquired by the NASA's WISE satellite of about 300
asteroids with derived convex shape models. We utilize publicly available
convex shape models and rotation states as input for the TPM. For more than one
hundred asteroids, the TPM gives us an acceptable fit to the thermal IR data
allowing us to report their size, thermal inertia, surface roughness or visible
geometric albedo. This work more than doubles the number of asteroids with
determined thermophysical properties. In the remaining cases, the shape model
and pole orientation uncertainties, specific rotation or thermophysical
properties, poor thermal IR data or their coverage prevent the determination of
reliable thermophysical properties. Finally, we present the main results of the
statistical study of derived thermophysical parameters within the whole
population of main-belt asteroids and within few asteroid families. Our sizes
are, in average, consistent with the radiometric sizes reported by Mainzer et
al. (2016). The thermal inertia increases with decreasing size, but a large
range of thermal inertia values is observed within the similar size ranges
between D~10-100 km. We derived unexpectedly low thermal inertias (<20 SI) for
several asteroids with sizes 10<D<50 km, indicating a very fine and mature
regolith on their surface. The thermal inertia values seem to be consistent
within several collisional families. The fast rotators with P<4 h tend to have
slightly larger thermal inertia values, so probably are not covered by a fine
regolith. This could be explained, for example, by the loss of the fine
regolith due to the centrifugal force, or by the ineffectiveness of the
regolith production (e.g., by the thermal cracking mechanism of Delbo' et al.
2014).
Authors' comments: Published in Icarus
Sven Zimmermann, Wassilij Kopylov, Gernot Schaller
We apply a measurement-based closed-loop control scheme to the dissipative Lipkin-Meshkov-Glick model. Specifically, we use the Wiseman-Milburn feedback master equation to control its quantum phase transition.For the steady state properties of the Lipkin-Meshkov-Glick system under feedback we show that the considered control scheme changes the critical point of the phase transition. Finite-size corrections blur these signatures in operator expectation values but entanglement measures such as concurrence can be used to locate the transition point more precisely. We find that with feedback, the position of the critical point can be shifted to smaller spin-spin interactions, which is potentially useful for setups with limited control on these.
Yu Shi, Jian Li, Zhize Li
Gradient Boosted Decision Trees (GBDT) is a very successful ensemble learning algorithm widely used across a variety of applications. Recently, several variants of GBDT training algorithms and implementations have been designed and heavily optimized in some very popular open sourced toolkits including XGBoost, LightGBM and CatBoost. In this paper, we show that both the accuracy and efficiency of GBDT can be further enhanced by using more complex base learners. Specifically, we extend gradient boosting to use piecewise linear regression trees (PL Trees), instead of piecewise constant regression trees, as base learners. We show that PL Trees can accelerate convergence of GBDT and improve the accuracy. We also propose some optimization tricks to substantially reduce the training time of PL Trees, with little sacrifice of accuracy. Moreover, we propose several implementation techniques to speedup our algorithm on modern computer architectures with powerful Single Instruction Multiple Data (SIMD) parallelism. The experimental results show that GBDT with PL Trees can provide very competitive testing accuracy with comparable or less training time.
Shota Katayama, Hironori Fujisawa, Mathias Drton
Graphical modeling explores dependences among a collection of variables by inferring a graph that encodes pairwise conditional independences. For jointly Gaussian variables, this translates into detecting the support of the precision matrix. Many modern applications feature high-dimensional and contaminated data that complicate this task. In particular, traditional robust methods that down-weight entire observation vectors are often inappropriate as high-dimensional data may feature partial contamination in many observations. We tackle this problem by giving a robust method for sparse precision matrix estimation based on the $\gamma$-divergence under a cell-wise contamination model. Simulation studies demonstrate that our procedure outperforms existing methods especially for highly contaminated data.
Anh Quach, Aravind Prakash, Lok Kwong Yan
Programs are bloated. Our study shows that only 5% of libc is used on average
across the Ubuntu Desktop environment (2016 programs); the heaviest user, vlc
media player, only needed 18%.
In this paper: (1) We present a debloating framework built on a compiler
toolchain that can successfully debloat programs (shared/static libraries and
executables). Our solution can successfully compile and load most libraries on
Ubuntu Desktop 16.04. (2) We demonstrate the elimination of over 79% of code
from coreutils and 86% of code from SPEC CPU 2006 benchmark programs without
affecting functionality. We show that even complex programs such as Firefox and
curl can be debloated without a need to recompile. (3) We demonstrate the
security impact of debloating by eliminating over 71% of reusable code gadgets
from the coreutils suite and show that unused code that contains real-world
vulnerabilities can also be successfully eliminated without adverse effects on
the program. (4) We incur a low load time overhead.
Authors' comments: Usenix Security 2018
Avi Ben-Cohen, Eyal Klang, Michal Marianne Amitai, Jacob Goldberger, Hayit Greenspan
In this work we propose a method for anatomical data augmentation that is
based on using slices of computed tomography (CT) examinations that are
adjacent to labeled slices as another resource of labeled data for training the
network. The extended labeled data is used to train a U-net network for a
pixel-wise classification into different hepatic lesions and normal liver
tissues. Our dataset contains CT examinations from 140 patients with 333 CT
images annotated by an expert radiologist. We tested our approach and compared
it to the conventional training process. Results indicate superiority of our
method. Using the anatomical data augmentation we achieved an improvement of 3%
in the success rate, 5% in the classification accuracy, and 4% in Dice.
Authors' comments: To be presented at IEEE ISBI 2018
James Schombert
Multi-color photometry is presented for a sample of 60 dwarf ellipticals (dE)
selected by morphology. The sample uses data from GALEX, SDSS and WISE to
investigate the colors in filters NUV, ugri and W1 (3.4mum). We confirm the
blueward shift in the color-magnitude relation for dwarf ellipticals, compared
to CMR for bright ellipticals, as seen in previous studies. However, we find
the deviation in color across the UV to near-IR for dE's is a strong signal of
a younger age for dwarf ellipticals, one that indicates decreasing mean age
with lower stellar mass. Lower mass dE's are found to have mean ages of 4 Gyrs
and mean [Fe/H] values of -1.2. Age and metallicity increase to the most
massive dE's with mean ages similar to normal ellipticals (12 Gyrs) and their
lowest metallicities ([Fe/H] = -0.3). Deduced initial star formation rates for
dE's, combined with their current metallicities and central stellar densities,
suggests a connection between field LSB dwarfs and cluster dE's, where the
cluster environment halts star formation for dE's triggering a separate
evolutionary path.
Authors' comments: 15 pages, 9 figure, accepted for AJ. arXiv admin note: text overlap
with arXiv:1609.07500
A. Solarz, M. Bilicki, A. Pollo
Automatic source detection and classification tools based on machine learning
(ML) algorithms are growing in popularity due to their efficiency when dealing
with large amounts of data simultaneously and their ability to work in
multidimensional parameter spaces. In this work, we present a new, automated
method of outlier selection based on support vector machine (SVM) algorithm
called one-class SVM (OCSVM), which uses the training data as one class to
construct a model of 'normality' in order to recognize novel points. We test
the performance of OCSVM algorithm on \textit{Wide-field Infrared Survey
Explorer (WISE)} data trained on the Sloan Digital Sky Survey (SDSS) sources.
Among others, we find $\sim 40,000$ sources with abnormal patterns which can be
associated with obscured and unobscured active galactic nuclei (AGN) source
candidates. We present the preliminary estimation of the clustering properties
of these objects and find that the unobscured AGN candidates are preferentially
found in less massive dark matter haloes ($M_{DMH}\sim10^{12.4}$) than the
obscured candidates ($M_{DMH}\sim 10^{13.2}$). This result contradicts the
unification theory of AGN sources and indicates that the obscured and
unobscured phases of AGN activity take place in different evolutionary paths
defined by different environments.
Authors' comments: 4 figures, 6 pages
Agata Mosinska, Pablo Marquez-Neila, Mateusz Kozinski, Pascal Fua
Delineation of curvilinear structures is an important problem in Computer Vision with multiple practical applications. With the advent of Deep Learning, many current approaches on automatic delineation have focused on finding more powerful deep architectures, but have continued using the habitual pixel-wise losses such as binary cross-entropy. In this paper we claim that pixel-wise losses alone are unsuitable for this problem because of their inability to reflect the topological impact of mistakes in the final prediction. We propose a new loss term that is aware of the higher-order topological features of linear structures. We also introduce a refinement pipeline that iteratively applies the same model over the previous delineation to refine the predictions at each step while keeping the number of parameters and the complexity of the model constant. When combined with the standard pixel-wise loss, both our new loss term and our iterative refinement boost the quality of the predicted delineations, in some cases almost doubling the accuracy as compared to the same classifier trained with the binary cross-entropy alone. We show that our approach outperforms state-of-the-art methods on a wide range of data, from microscopy to aerial images.
Thanh T. Nguyen, Jaesik Choi
Information Bottleneck (IB) is a generalization of rate-distortion theory
that naturally incorporates compression and relevance trade-offs for learning.
Though the original IB has been extensively studied, there has not been much
understanding of multiple bottlenecks which better fit in the context of neural
networks. In this work, we propose Information Multi-Bottlenecks (IMBs) as an
extension of IB to multiple bottlenecks which has a direct application to
training neural networks by considering layers as multiple bottlenecks and
weights as parameterized encoders and decoders. We show that the multiple
optimality of IMB is not simultaneously achievable for stochastic encoders. We
thus propose a simple compromised scheme of IMB which in turn generalizes
maximum likelihood estimate (MLE) principle in the context of stochastic neural
networks. We demonstrate the effectiveness of IMB on classification tasks and
adversarial robustness in MNIST and CIFAR10.
Authors' comments: published in Entropy journal
Ganzhao Yuan, Haoxian Tan, Wei-Shi Zheng
Sparse inverse covariance selection is a fundamental problem for analyzing dependencies in high dimensional data. However, such a problem is difficult to solve since it is NP-hard. Existing solutions are primarily based on convex approximation and iterative hard thresholding, which only lead to sub-optimal solutions. In this work, we propose a coordinate-wise optimization algorithm to solve this problem which is guaranteed to converge to a coordinate-wise minimum point. The algorithm iteratively and greedily selects one variable or swaps two variables to identify the support set, and then solves a reduced convex optimization problem over the support set to achieve the greatest descent. As a side contribution of this paper, we propose a Newton-like algorithm to solve the reduced convex sub-problem, which is proven to always converge to the optimal solution with global linear convergence rate and local quadratic convergence rate. Finally, we demonstrate the efficacy of our method on synthetic data and real-world data sets. As a result, the proposed method consistently outperforms existing solutions in terms of accuracy.
M. Holler, J. Chevalier, J. -P. Lenain, M. de Naurois, D. Sanchez
We present a new paradigm for the simulation of arrays of Imaging Atmospheric
Cherenkov Telescopes (IACTs) which overcomes limitations of current approaches.
Up to now, all major IACT experiments rely on the same Monte-Carlo simulation
strategy, using predefined observation and instrument settings. Simulations
with varying parameters are generated to provide better estimates of the
Instrument Response Functions (IRFs) of different observations. However, a
large fraction of the simulation configuration remains preserved, leading to
complete negligence of all related influences. Additionally, the simulation
scheme relies on interpolations between different array configurations, which
are never fully reproducing the actual configuration for a given observation.
Interpolations are usually performed on zenith angles, off-axis angles, array
multiplicity, and the optical response of the instrument. With the advent of
hybrid systems consisting of a large number of IACTs with different sizes,
types, and camera configurations, the complexity of the interpolation and the
size of the phase space becomes increasingly prohibitive. Going beyond the
existing approaches, we introduce a new simulation and analysis concept which
takes into account the actual observation conditions as well as individual
telescope configurations of each observation run of a given data set. These
run-wise simulations (RWS) thus exhibit considerably reduced systematic
uncertainties compared to the existing approach, and are also more
computationally efficient and simple. The RWS framework has been implemented in
the H.E.S.S. software and tested, and is already being exploited in science
analysis.
Authors' comments: Proceedings of the 35th International Cosmic Ray Conference; Busan,
Korea
Christopher A. Theissen
This paper uses the multi-epoch astrometry from the Wide-field Infrared
Survey Explorer (WISE) to demonstrate a method to measure proper motions and
trigonometric parallaxes with precisions of $\sim$4 mas yr$^{-1}$ and $\sim$7
mas, respectively, for low-mass stars and brown dwarfs. This method relies on
WISE single exposures (Level 1b frames) and a Markov Chain Monte Carlo method.
The limitations of Gaia in observing very low-mass stars and brown dwarfs are
discussed, and it is shown that WISE will be able to measure astrometry past
the 95% completeness limit and magnitude limit of Gaia (L, T, and Y dwarfs
fainter than $G\approx19$ and $G=21$, respectively). This method is applied to
WISE data of 20 nearby ($\lesssim17$ pc) dwarfs with spectral types between
M6-Y2 and previously measured trigonometric parallaxes. Also provided are WISE
astrometric measurements for 23 additional low-mass dwarfs with spectral types
between M6-T7 and estimated photometric distances $<17$ pc. Only nine of these
objects contain parallaxes within Gaia Data Release 2.
Authors' comments: Accepted to ApJ
Angelos-Christos G. Anadiotis, Laura Galluccio, Sebastiano Milardo, Giacomo Morabito, Sergio Palazzo
SD-WISE is a complete software-defined solution for wireless sensor (and actuator) networks (WSNs). SD-WISE has several unique features making it a flexible and expandable solution that can be applied in heterogeneous application domains. Its fundamental feature is that it provides software abstractions of the nodes' resources on both the controller and the nodes sides. By leveraging these abstractions, SD-WISE (i) extends the Software Defined Networking (SDN) approach to WSNs, introducing a more flexible way to define flows as well as the possibility to control the duty cycles of the node radios to increase energy efficiency; (ii) enables network function virtualization (NFV) in WSNs; (iii) leverages the tight interplay between trusted hardware and software to support regulation compliant behavior of sensor nodes. In this paper SD-WISE is introduced, its major operations are described, and its features are demonstrated in three relevant scenarios, thus assessing the effectiveness of the approach.
M. E. Cluver, T. H. Jarrett, D. A. Dale, J. -D. T. Smith, T. August, M. J. I. Brown
We present accurate resolved $WISE$ photometry of galaxies in the combined
SINGS and KINGFISH sample. The luminosities in the W3 12$\mu$m and W4 23$\mu$m
bands are calibrated to star formation rates (SFRs) derived using the total
infrared luminosity, avoiding UV/optical uncertainties due to dust extinction
corrections. The W3 relation has a 1-$\sigma$ scatter of 0.15 dex over nearly 5
orders of magnitude in SFR and 12$\mu$m luminosity, and a range in host stellar
mass from dwarf (10$^7$ M$_\odot$) to $\sim3\times$M$_\star$ (10$^{11.5}$
M$_\odot$) galaxies. In the absence of deep silicate absorption features and
powerful active galactic nuclei, we expect this to be a reliable SFR indicator
chiefly due to the broad nature of the W3 band. By contrast the W4 SFR relation
shows more scatter (1-$\sigma =$ 0.18 dex). Both relations show reasonable
agreement with radio continuum-derived SFRs and excellent accordance with
so-called "hybrid" H$\alpha + 24 \mu$m and FUV$+$24$\mu$m indicators. Moreover,
the $WISE$ SFR relations appear to be insensitive to the metallicity range in
the sample. We also compare our results with IRAS-selected luminous infrared
galaxies, showing that the $WISE$ relations maintain concordance, but
systematically deviate for the most extreme galaxies. Given the all-sky
coverage of $WISE$ and the performance of the W3 band as a SFR indicator, the
$L_{12\mu \rm m}$ SFR relation could be of great use to studies of nearby
galaxies and forthcoming large area surveys at optical and radio wavelengths.
Authors' comments: 34 pages, 6 tables include complete WISE photometry of the combined
SINGS+KINGFISH sample. Accepted for publication in The Astrophysical Journal