Gilles Blanchard, Pierre Neuvial, Etienne Roquain
We introduce a general methodology for post hoc inference in a large-scale multiple testing framework. The approach is called "user-agnostic" in the sense that the statistical guarantee on the number of correct rejections holds for any set of candidate items selected by the user (after having seen the data). This task is investigated by defining a suitable criterion, named the joint-family-wise-error rate (JER for short). We propose several procedures for controlling the JER, with a special focus on incorporating dependencies while adapting to the unknown quantity of signal (via a step-down approach). We show that our proposed setting incorporates as particular cases a version of the higher criticism as well as the closed testing based approach of Goeman and Solari (2011). Our theoretical statements are supported by numerical experiments.
Bo Yang, Hui Liu, He Zhong, Zhangxin Chen
This research investigates the implementation mechanism of block-wise ILU(k)
preconditioner on GPU. The block-wise ILU(k) algorithm requires both the level
k and the block size to be designed as variables. A decoupled ILU(k) algorithm
consists of a symbolic phase and a factorization phase. In the symbolic phase,
a ILU(k) nonzero pattern is established from the point-wise structure extracted
from a block-wise matrix. In the factorization phase, the block-wise matrix
with a variable block size is factorized into a block lower triangular matrix
and a block upper triangular matrix. And a further diagonal factorization is
required to perform on the block upper triangular matrix for adapting a
parallel triangular solver on GPU.We also present the numerical experiments to
study the preconditioner actions on different k levels and block sizes.
Authors' comments: 14 pages
Anatol Odzijewicz, Grzegorz Jakimowicz, Aneta Sliżewska
In this paper we investigate fiber-wise linear complex Banach sub-Poisson
structures defined canonically by the structure of a W*-algebra M. In
particular we show that these structures are arranged in the short exact
sequence of complex Banach sub-Poisson VB-groupoids with the groupoid of
partially invertible elements of M as the side groupoid.
Authors' comments: 52 pages
Vitaly Feldman, Badih Ghazi
Several well-studied models of access to data samples, including statistical
queries, local differential privacy and low-communication algorithms rely on
queries that provide information about a function of a single sample. (For
example, a statistical query (SQ) gives an estimate of $Ex_{x \sim D}[q(x)]$
for any choice of the query function $q$ mapping $X$ to the reals, where $D$ is
an unknown data distribution over $X$.) Yet some data analysis algorithms rely
on properties of functions that depend on multiple samples. Such algorithms
would be naturally implemented using $k$-wise queries each of which is
specified by a function $q$ mapping $X^k$ to the reals. Hence it is natural to
ask whether algorithms using $k$-wise queries can solve learning problems more
efficiently and by how much.
Blum, Kalai and Wasserman (2003) showed that for any weak PAC learning
problem over a fixed distribution, the complexity of learning with $k$-wise SQs
is smaller than the (unary) SQ complexity by a factor of at most $2^k$. We show
that for more general problems over distributions the picture is substantially
richer. For every $k$, the complexity of distribution-independent PAC learning
with $k$-wise queries can be exponentially larger than learning with
$(k+1)$-wise queries. We then give two approaches for simulating a $k$-wise
query using unary queries. The first approach exploits the structure of the
problem that needs to be solved. It generalizes and strengthens (exponentially)
the results of Blum et al.. It allows us to derive strong lower bounds for
learning DNF formulas and stochastic constraint satisfaction problems that hold
against algorithms using $k$-wise queries. The second approach exploits the
$k$-party communication complexity of the $k$-wise query function.
Authors' comments: 32 pages, Appeared in Innovations in Theoretical Computer Science
(ITCS) 2017
Jiwoong Kim
Application of the minimum distance method to the linear regression model for estimating regression parameters is a difficult and time-consuming process due to the complexity of its distance function, and hence, it is computationally expensive. To deal with the computational cost, this paper proposes a fast algorithm which mainly uses technique of coordinate-wise minimization in order to estimate the regression parameters. R package based on the proposed algorithm and written in Rcpp is available online.
Žiga Emeršič, Luka Lan Gabriel, Vitomir Štruc, Peter Peer
Object detection and segmentation represents the basis for many tasks in
computer and machine vision. In biometric recognition systems the detection of
the region-of-interest (ROI) is one of the most crucial steps in the overall
processing pipeline, significantly impacting the performance of the entire
recognition system. Existing approaches to ear detection, for example, are
commonly susceptible to the presence of severe occlusions, ear accessories or
variable illumination conditions and often deteriorate in their performance if
applied on ear images captured in unconstrained settings. To address these
shortcomings, we present in this paper a novel ear detection technique based on
convolutional encoder-decoder networks (CEDs). For our technique, we formulate
the problem of ear detection as a two-class segmentation problem and train a
convolutional encoder-decoder network based on the SegNet architecture to
distinguish between image-pixels belonging to either the ear or the non-ear
class. The output of the network is then post-processed to further refine the
segmentation result and return the final locations of the ears in the input
image. Different from competing techniques from the literature, our approach
does not simply return a bounding box around the detected ear, but provides
detailed, pixel-wise information about the location of the ears in the image.
Our experiments on a dataset gathered from the web (a.k.a. in the wild) show
that the proposed technique ensures good detection results in the presence of
various covariate factors and significantly outperforms the existing
state-of-the-art.
Authors' comments: 12 pages
Vijaya Krishna Yalavarthi, Xiangyu Ke, Arijit Khan
Crowdsourcing is becoming increasingly important in entity resolution tasks
due to their inherent complexity such as clustering of images and natural
language processing. Humans can provide more insightful information for these
difficult problems compared to machine-based automatic techniques.
Nevertheless, human workers can make mistakes due to lack of domain expertise
or seriousness, ambiguity, or even due to malicious intents. The
state-of-the-art literature usually deals with human errors via majority voting
or by assigning a universal error rate over crowd workers. However, such
approaches are incomplete, and often inconsistent, because the expertise of
crowd workers are diverse with possible biases, thereby making it largely
inappropriate to assume a universal error rate for all workers over all
crowdsourcing tasks.
To this end, we mitigate the above challenges by considering an uncertain
graph model, where the edge probability between two records A and B denotes the
ratio of crowd workers who voted Yes on the question if A and B are same
entity. In order to reflect independence across different crowdsourcing tasks,
we apply the well-established notion of possible worlds, and develop
parameter-free algorithms both for next crowdsourcing, as well as for entity
resolution problems. In particular, using our framework, the problem of entity
resolution becomes equivalent to finding the maximum-likelihood clustering;
whereas for the next crowdsourcing, we identify the record pair that maximally
increases the reliability of the maximum-likelihood clustering. Based on
detailed empirical analysis over real-world datasets, we find that our proposed
solution, PERC (probabilistic entity resolution with imperfect crowd) improves
the quality by 15% and reduces the overall cost by 50% for the
crowdsourcing-based entity resolution problem.
Authors' comments: 10 Pages, 11 Figures
Aditya A. Shastri, Deepti Tamrakar, Kapil Ahuja
Breast cancer is becoming pervasive with each passing day. Hence, its early
detection is a big step in saving the life of any patient. Mammography is a
common tool in breast cancer diagnosis. The most important step here is
classification of mammogram patches as normal-abnormal and benign-malignant.
Texture of a breast in a mammogram patch plays a significant role in these
classifications. We propose a variation of Histogram of Gradients (HOG) and
Gabor filter combination called Histogram of Oriented Texture (HOT) that
exploits this fact. We also revisit the Pass Band - Discrete Cosine Transform
(PB-DCT) descriptor that captures texture information well. All features of a
mammogram patch may not be useful. Hence, we apply a feature selection
technique called Discrimination Potentiality (DP). Our resulting descriptors,
DP-HOT and DP-PB-DCT, are compared with the standard descriptors.
Density of a mammogram patch is important for classification, and has not
been studied exhaustively. The Image Retrieval in Medical Application (IRMA)
database from RWTH Aachen, Germany is a standard database that provides
mammogram patches, and most researchers have tested their frameworks only on a
subset of patches from this database. We apply our two new descriptors on all
images of the IRMA database for density wise classification, and compare with
the standard descriptors. We achieve higher accuracy than all of the existing
standard descriptors (more than 92%).
Authors' comments: 28 Pages, 8 Figures, and 7 Tables
Yanting Ma, Yue M. Lu, Dror Baron
Solving a large-scale regularized linear inverse problem using multiple
processors is important in various real-world applications due to the
limitations of individual processors and constraints on data sharing policies.
This paper focuses on the setting where the matrix is partitioned column-wise.
We extend the algorithmic framework and the theoretical analysis of approximate
message passing (AMP), an iterative algorithm for solving linear inverse
problems, whose asymptotic dynamics are characterized by state evolution (SE).
In particular, we show that column-wise multiprocessor AMP (C-MP-AMP) obeys an
SE under the same assumptions when the SE for AMP holds. The SE results imply
that (i) the SE of C-MP-AMP converges to a state that is no worse than that of
AMP and (ii) the asymptotic dynamics of C-MP-AMP and AMP can be identical.
Moreover, for a setting that is not covered by SE, numerical results show that
damping can improve the convergence performance of C-MP-AMP.
Authors' comments: This document contains complete details of the previous version
(i.e., arXiv:1701.02578v1), which was accepted for publication in ICASSP 2017
E. A. Valentijn, K. Begeman, A. Belikov, D. R. Boxhoorn, J. Brinchmann, J. McFarland, H. Holties, K. H. Kuijken et al.
After its first implementation in 2003 the Astro-WISE technology has been
rolled out in several European countries and is used for the production of the
KiDS survey data. In the multi-disciplinary Target initiative this technology,
nicknamed WISE technology, has been further applied to a large number of
projects. Here, we highlight the data handling of other astronomical
applications, such as VLT-MUSE and LOFAR, together with some non-astronomical
applications such as the medical projects Lifelines and GLIMPS, the MONK
handwritten text recognition system, and business applications, by amongst
others, the Target Holding. We describe some of the most important lessons
learned and describe the application of the data-centric WISE type of approach
to the Science Ground Segment of the Euclid satellite.
Authors' comments: 9 pages, 5 figures, Proceedngs IAU Symposium No 325 Astroinformatics
2017
Spyros Gidaris, Nikos Komodakis
Pixel wise image labeling is an interesting and challenging problem with great significance in the computer vision community. In order for a dense labeling algorithm to be able to achieve accurate and precise results, it has to consider the dependencies that exist in the joint space of both the input and the output variables. An implicit approach for modeling those dependencies is by training a deep neural network that, given as input an initial estimate of the output labels and the input image, it will be able to predict a new refined estimate for the labels. In this context, our work is concerned with what is the optimal architecture for performing the label improvement task. We argue that the prior approaches of either directly predicting new label estimates or predicting residual corrections w.r.t. the initial labels with feed-forward deep network architectures are sub-optimal. Instead, we propose a generic architecture that decomposes the label improvement task to three steps: 1) detecting the initial label estimates that are incorrect, 2) replacing the incorrect labels with new ones, and finally 3) refining the renewed labels by predicting residual corrections w.r.t. them. Furthermore, we explore and compare various other alternative architectures that consist of the aforementioned Detection, Replace, and Refine components. We extensively evaluate the examined architectures in the challenging task of dense disparity estimation (stereo matching) and we report both quantitative and qualitative results on three different datasets. Finally, our dense disparity estimation network that implements the proposed generic architecture, achieves state-of-the-art results in the KITTI 2015 test surpassing prior approaches by a significant margin.
Raúl Díaz, Charless C. Fowlkes
Feature point matching for camera localization suffers from scalability problems. Even when feature descriptors associated with 3D scene points are locally unique, as coverage grows, similar or repeated features become increasingly common. As a result, the standard distance ratio-test used to identify reliable image feature points is overly restrictive and rejects many good candidate matches. We propose a simple coarse-to-fine strategy that uses conservative approximations to robust local ratio-tests that can be computed efficiently using global approximate k-nearest neighbor search. We treat these forward matches as votes in camera pose space and use them to prioritize back-matching within candidate camera pose clusters, exploiting feature co-visibility captured by clustering the 3D model camera pose graph. This approach achieves state-of-the-art camera localization results on a variety of popular benchmarks, outperforming several methods that use more complicated data structures and that make more restrictive assumptions on camera pose. We also carry out diagnostic analyses on a difficult test dataset containing globally repetitive structure that suggest our approach successfully adapts to the challenges of large-scale image localization.
Janek Thomas, Andreas Mayr, Bernd Bischl, Matthias Schmid, Adam Smith, Benjamin Hofner
We present a new algorithm for boosting generalized additive models for
location, scale and shape (GAMLSS) that allows to incorporate stability
selection, an increasingly popular way to obtain stable sets of covariates
while controlling the per-family error rate (PFER). The model is fitted
repeatedly to subsampled data and variables with high selection frequencies are
extracted. To apply stability selection to boosted GAMLSS, we develop a new
"noncyclical" fitting algorithm that incorporates an additional selection step
of the best-fitting distribution parameter in each iteration. This new
algorithms has the additional advantage that optimizing the tuning parameters
of boosting is reduced from a multi-dimensional to a one-dimensional problem
with vastly decreased complexity. The performance of the novel algorithm is
evaluated in an extensive simulation study. We apply this new algorithm to a
study to estimate abundance of common eider in Massachusetts, USA, featuring
excess zeros, overdispersion, non-linearity and spatio-temporal structures.
Eider abundance is estimated via boosted GAMLSS, allowing both mean and
overdispersion to be regressed on covariates. Stability selection is used to
obtain a sparse set of stable predictors.
Authors' comments: 16 pages
Nathan D. Cahill, Harmeet Singh, Chao Zhang, Daryl A. Corcoran, Alison M. Prengaman, Paul S. Wenger, John F. Hamilton, Peter Bajorski et al.
Functional connectivity analysis yields powerful insights into our
understanding of the human brain. Group-wise functional community detection
aims to partition the brain into clusters, or communities, in which functional
activity is inter-regionally correlated in a common manner across a group of
subjects. In this article, we show how to use multiple-view spectral clustering
to perform group-wise functional community detection. In a series of
experiments on 291 subjects from the Human Connectome Project, we compare three
versions of multiple-view spectral clustering: MVSC (uniform weights), MVSCW
(weights based on subject-specific embedding quality), and AASC (weights
optimized along with the embedding) with the competing technique of Joint
Diagonalization of Laplacians (JDL). Results show that multiple-view spectral
clustering not only yields group-wise functional communities that are more
consistent than JDL when using randomly selected subsets of individual brains,
but it is several orders of magnitude faster than JDL.
Authors' comments: Presented at The MICCAI-BACON 16 Workshop
(https://arxiv.org/abs/1611.03363)
Yu-An Chung, Shao-Wen Yang, Hsuan-Tien Lin
While deep neural networks have succeeded in several visual applications, such as object recognition, detection, and localization, by reaching very high classification accuracies, it is important to note that many real-world applications demand varying costs for different types of misclassification errors, thus requiring cost-sensitive classification algorithms. Current models of deep neural networks for cost-sensitive classification are restricted to some specific network structures and limited depth. In this paper, we propose a novel framework that can be applied to deep neural networks with any structure to facilitate their learning of meaningful representations for cost-sensitive classification problems. Furthermore, the framework allows end-to-end training of deeper networks directly. The framework is designed by augmenting auxiliary neurons to the output of each hidden layer for layer-wise cost estimation, and including the total estimation loss within the optimization objective. Experimental results on public benchmark visual data sets with two cost information settings demonstrate that the proposed framework outperforms state-of-the-art cost-sensitive deep learning models.
Edward J. Gillis
Wiseman has claimed that Bell was wrong in stating that determinism was
inferred rather than assumed in the summary of the EPR argument in his 1964
paper. The reply of Wiseman and his co-authors to my comment misstates my
reasons for disputing this point, and fails to address the central criticism
that their claim is based on a seriously flawed formalization of Bell's
argument deriving from an unreasonably strong interpretation of the the terms,
'influence', 'affect', and 'depend on'.
Authors' comments: 6 pages
Xiaojie Jin, Yunpeng Chen, Jian Dong, Jiashi Feng, Shuicheng Yan
Intermediate features at different layers of a deep neural network are known
to be discriminative for visual patterns of different complexities. However,
most existing works ignore such cross-layer heterogeneities when classifying
samples of different complexities. For example, if a training sample has
already been correctly classified at a specific layer with high confidence, we
argue that it is unnecessary to enforce rest layers to classify this sample
correctly and a better strategy is to encourage those layers to focus on other
samples.
In this paper, we propose a layer-wise discriminative learning method to
enhance the discriminative capability of a deep network by allowing its layers
to work collaboratively for classification. Towards this target, we introduce
multiple classifiers on top of multiple layers. Each classifier not only tries
to correctly classify the features from its input layer, but also coordinates
with other classifiers to jointly maximize the final classification
performance. Guided by the other companion classifiers, each classifier learns
to concentrate on certain training examples and boosts the overall performance.
Allowing for end-to-end training, our method can be conveniently embedded into
state-of-the-art deep networks. Experiments with multiple popular deep
networks, including Network in Network, GoogLeNet and VGGNet, on scale-various
object classification benchmarks, including CIFAR100, MNIST and ImageNet, and
scene classification benchmarks, including MIT67, SUN397 and Places205,
demonstrate the effectiveness of our method. In addition, we also analyze the
relationship between the proposed method and classical conditional random
fields models.
Authors' comments: To appear in ECCV 2016. Maybe subject to minor changes before
camera-ready version
F. R. Herpich, A. Mateus, G. Stasińska, R. Cid Fernandes, N. Vale Asari
We use the SDSS and WISE surveys to investigate the real nature of galaxies
defined as LINERs in the BPT diagram. After establishing a mid-infrared colour
W2-W3 = 2.5 as the optimal separator between galaxies with and without star
formation, we investigate the loci of different galaxy classes in the W_{Ha}
versus W2-W3 space. We find that: (1) A large fraction of LINER-like galaxies
are emission-line retired galaxies, i.e galaxies which have stopped forming
stars and are powered by hot low-mass evolved stars (HOLMES). Their W2-W3
colours show no sign of star formation and their Ha equivalent widths, W_{Ha},
are consistent with ionization by their old stellar populations. (2) Another
important fraction have W2-W3 indicative of star formation. This includes
objects located in the supposedly `pure AGN' zone of the BPT diagram. (3) A
smaller fraction of LINER-like galaxies have no trace of star formation from
W2-W3 and a high W_{Ha}, pointing to the presence of an AGN. (4) Finally, a few
LINERs tagged as retired by their W_{Ha} but with W2-W3 values indicative of
star formation are late-type galaxies whose SDSS spectra cover only the old
`retired' bulge. This reinforces the view that LINER-like galaxies are a mixed
bag of objects involving different physical phenomena and observational effects
thrusted into the same locus of the BPT diagram.
Authors' comments: Accepted for publication in MNRAS; 9 pages, 6 figures
Aronee Dasgupta, Sahil Chakraborty, Astha Nachrani, Pritam Gajkumar Shah
Wireless Sensor Networks have emerged as one of the leading technologies.
These networks are designed to monitor crucial environmental parameters of
humidity, temperature, wind speed, soil moisture content, UV index, sound, etc.
and then transfer the required information to the base station. However,
security remains the key challenge of such networks as critical data is being
transferred. Most sensor nodes currently deployed have constraints on memory
and processing power and hence operate without an efficient security protocol.
Hereby a protocol which is lightweight and is secure for wireless sensor
applications is proposed.
Authors' comments: 5 pages, 4 figures, 2 tables. Published with International Journal of
Computer Applications (IJCA)
Staša Milojević, Filippo Radicchi, Judit Bar-Ilan
In this paper we present "citation success index", a metric for comparing the citation capacity of pairs of journals. Citation success index is the probability that a random paper in one journal has more citations than a random paper in another journal (50% means the two journals do equally well). Unlike the journal impact factor (IF), the citation success index depends on the broadness and the shape of citation distributions. Also, it is insensitive to sporadic highly-cited papers that skew the IF. Nevertheless, we show, based on 16,000 journals containing ~2.4 million articles, that the citation success index is a relatively tight function of the ratio of IFs of journals being compared, due to the fact that journals with same IF have quite similar citation distributions. The citation success index grows slowly as a function of IF ratio. It is substantial (>90%) only when the ratio of IFs exceeds ~6, whereas a factor of two difference in IF values translates into a modest advantage for the journal with higher IF (index of ~70%). We facilitate the wider adoption of this metric by providing an online calculator that takes as input parameters only the IFs of the pair of journals.