Sahand Negahban, Sewoong Oh, Devavrat Shah
The question of aggregating pair-wise comparisons to obtain a global ranking
over a collection of objects has been of interest for a very long time: be it
ranking of online gamers (e.g. MSR's TrueSkill system) and chess players,
aggregating social opinions, or deciding which product to sell based on
transactions. In most settings, in addition to obtaining a ranking, finding
`scores' for each object (e.g. player's rating) is of interest for
understanding the intensity of the preferences.
In this paper, we propose Rank Centrality, an iterative rank aggregation
algorithm for discovering scores for objects (or items) from pair-wise
comparisons. The algorithm has a natural random walk interpretation over the
graph of objects with an edge present between a pair of objects if they are
compared; the score, which we call Rank Centrality, of an object turns out to
be its stationary probability under this random walk. To study the efficacy of
the algorithm, we consider the popular Bradley-Terry-Luce (BTL) model
(equivalent to the Multinomial Logit (MNL) for pair-wise comparisons) in which
each object has an associated score which determines the probabilistic outcomes
of pair-wise comparisons between objects. In terms of the pair-wise marginal
probabilities, which is the main subject of this paper, the MNL model and the
BTL model are identical. We bound the finite sample error rates between the
scores assumed by the BTL model and those estimated by our algorithm. In
particular, the number of samples required to learn the score well with high
probability depends on the structure of the comparison graph. When the
Laplacian of the comparison graph has a strictly positive spectral gap, e.g.
each item is compared to a subset of randomly chosen items, this leads to
dependence on the number of samples that is nearly order-optimal.
Authors' comments: 45 pages, 3 figures
Simone Antoniucci, Teresa Giannini, Dario Lorenzetti
We present a collection of WISE photometry of EXor sources and candidates
(more recently identified). This represents the first complete survey of such
objects in the mid-IR (3.4 - 22 um) that was carried out with the same
instrumentation. Two-color diagrams constructed with WISE data evidence a clear
segregation between classical and newly identified sources, being these latter
characterized by colder (and less evolved) circumstellar disks. By combining
2MASS and WISE data, we obtain spectral energy distributions (SED's) that are
compatible with the existence of an inner hole in the circumstellar disk. A
compilation of all EXor observations given in the literature at wavelengths
very similar to those of WISE is also provided. This allows us to study their
mid-IR variability, which has been poorly investigated so far and without any
coordination with shorter wavelengths surveys. The presented it WISE photometry
and the compilation of the literature data are intended as a first step toward
the construction of a significant database in this spectral regime. Preliminary
indications on the mechanisms responsible for the luminosity fluctuations are
provided.
Authors' comments: Accepted by New Astronomy
Gijs A. Verdoes Kleijn, Andrey N. Belikov, John P. McFarland
In this paper we describe the way the Astro-WISE information system (or
simply Astro-WISE) supports the data from a wide range of in- struments and
combines multiple surveys and their catalogues. Astro-WISE allows ingesting of
data from any optical instrument, survey or catalogue, pro- cessing of this
data to create new catalogues and bringing in data from di?erent surveys into a
single catalogue, keeping all dependencies back to the original data. Full data
lineage is kept on each step of compiling a new catalogue with an ability to
add a new data source recursively. With these features, Astro- WISE allows not
only combining and retrieving data from multiple surveys, but performing
scientific data reduction and data mining down to the rawest data in the data
processing chain within a single environment.
Authors' comments: Accepted for publication in topical issue of Experimental Astronomy
on Astro-WISE information system
K. Begeman, A. N. Belikov, D. R. Boxhoorn, E. A. Valentijn
In this paper we present the various concepts behind the Astro-WISE
Information System. The concepts form a blueprint for general scientific
information systems (WISE) which can satisfy a wide and challenging range of
requirements for the data dissemination, storage and processing for various
fields in science. We review the main features of the information system and
its practical implementation.
Authors' comments: 21 pages, 6 figures, accepted by Experimental Astronomy for topical
issue on Astro-WISE
T. H. Jarrett, F. Masci, C. W. Tsai, S. Petty, M. Cluver, Roberto J. Assef, D. Benford, A. Blain et al.
After eight months of continuous observations, the Wide-field Infrared Survey
Explorer (WISE) mapped the entire sky at 3.4 {\mu}m, 4.6 {\mu}m, 12 {\mu}m and
22 {\mu}m. We have begun a dedicated WISE High Resolution Galaxy Atlas (WHRGA)
project to fully characterize large, nearby galaxies and produce a legacy image
atlas and source catalogue. Here we summarize the deconvolution technique used
to significantly improve the spatial resolution of WISE imaging, specifically
designed to study the internal anatomy of nearby galaxies. As a case study, we
present results for the galaxy NGC 1566, comparing the WISE super-resolution
image processing to that of Spitzer, GALEX and ground-based imaging. The is the
first paper in a two part series; results for a much larger sample of nearby
galaxies is presented in the second paper.
Authors' comments: Published in the AJ (2012, AJ, 144, 68)
Alexey Ostrovsky
A subset $B \subset Y$ is constructible if it is an element of the smallest family that contains all open sets and is stable under finite intersections and complements. A function $f : X \to Y$ is said to be piece-wise closed if $X$ can be written as a countable union of closed sets $Z_n$ such that $f$ is closed on every $Z_n.$ We prove that if a continuous function $f$ takes each closed set into a constructible subset of $Y$, then $f$ is piece-wise closed.
Ofer Yaron, Avishay Gal-Yam
We have entered an era of massive data sets in astronomy. In particular, the
number of supernova (SN) discoveries and classifications has substantially
increased over the years from few tens to thousands per year. It is no longer
the case that observations of a few prototypical events encapsulate most
spectroscopic information about SNe, motivating the development of modern tools
to collect, archive, organize and distribute spectra in general, and SN spectra
in particular. For this reason we have developed the Weizmann Interactive
Supernova data REPository - WISeREP - an SQL-based database (DB) with an
interactive web-based graphical interface. The system serves as an archive of
high quality SN spectra, including both historical (legacy) data as well as
data that is accumulated by ongoing modern programs. The archive provides
information about objects, their spectra, and related meta-data. Utilizing
interactive plots, we provide a graphical interface to visualize data, perform
line identification of the major relevant species, determine object redshifts,
classify SNe and measure expansion velocities. Guest users may view and
download spectra or other data that have been placed in the public domain.
Registered users may also view and download data that are proprietary to
specific programs with which they are associated. The DB currently holds >8000
spectra, of which >5000 are public; the latter include published spectra from
the Palomar Transient Factory, all of the SUSPECT archive, the
Caltech-Core-Collapse Program, the CfA SN spectra archive and published spectra
from the UC Berkeley SNDB repository. It offers an efficient and convenient way
to archive data and share it with colleagues, and we expect that data stored in
this way will be easy to access, increasing its visibility, usefulness and
scientific impact.
Authors' comments: To be published in PASP. WISeREP:
http://www.weizmann.ac.il/astrophysics/wiserep/
Kenneth L. Clarkson, Wolfgang Mulzer, C. Seshadhri
Computing the coordinate-wise maxima of a planar point set is a classic and
well-studied problem in computational geometry. We give an algorithm for this
problem in the \emph{self-improving setting}. We have $n$ (unknown) independent
distributions $\cD_1, \cD_2, ..., \cD_n$ of planar points. An input pointset
$(p_1, p_2, ..., p_n)$ is generated by taking an independent sample $p_i$ from
each $\cD_i$, so the input distribution $\cD$ is the product $\prod_i \cD_i$. A
self-improving algorithm repeatedly gets input sets from the distribution $\cD$
(which is \emph{a priori} unknown) and tries to optimize its running time for
$\cD$. Our algorithm uses the first few inputs to learn salient features of the
distribution, and then becomes an optimal algorithm for distribution $\cD$. Let
$\OPT_\cD$ denote the expected depth of an \emph{optimal} linear comparison
tree computing the maxima for distribution $\cD$. Our algorithm eventually has
an expected running time of $O(\text{OPT}_\cD + n)$, even though it did not
know $\cD$ to begin with.
Our result requires new tools to understand linear comparison trees for
computing maxima. We show how to convert general linear comparison trees to
very restricted versions, which can then be related to the running time of our
algorithm. An interesting feature of our algorithm is an interleaved search,
where the algorithm tries to determine the likeliest point to be maximal with
minimal computation. This allows the running time to be truly optimal for the
distribution $\cD$.
Authors' comments: To appear in Symposium of Computational Geometry 2012 (17 pages, 2
figures)
Gilles Celeux, Stéphane Chrétien, Florence Forbes
In some situations, EM algorithm shows slow convergence problems. One possible reason is that standard procedures update the parameters simultaneously. In this paper we focus on finite mixture estimation. In this framework, we propose a component-wise EM, which updates the parameters sequentially. We give an interpretation of this procedure as a proximal point algorithm and use it to prove the convergence. Illustrative numerical experiments show how our algorithm compares to EM and a version of the SAGE algorithm.
Vikram Kamat
We consider the following generalization of the seminal Erd\H{o}s-Ko-Rado
theorem, due to Frankl. For some k>=2, let F be a k-wise intersecting family of
r-subsets of an n element set X, i.e. for any k sets F1,...,Fk in F, their
intersection is nonempty. If r <= ((k-1)n)/k, then |F|<= {n-1 \choose r-1}. We
prove a stability version of this theorem, analogous to similar results of
Dinur-Friedgut, Keevash-Mubayi and others for the Erd\H{o}s-Ko-Rado theorem.
The technique we use is a generalization of Katona's circle method, initially
employed by Keevash, which uses expansion properties of a particular Cayley
graph of the symmetric group.
Authors' comments: There are 10 pages. This is the second version, consistent with the
version published in Elec. J. Combinatorics, so Theorem 1.4 and Section 3
(containing its proof) from the older version have been removed. The
appropriate journal reference has also been added
Peter Albers, Al Momin
We prove that on a restricted contact type hypersurface the number of
leaf-wise intersections is bounded from below by a certain cup-length.
Authors' comments: 13 pages, 4 figures; v2: minor modifications
Frank J. Masci, John W. Fowler
We describe a new image co-addition tool, AWAIC, to support the creation of a
digital Image Atlas from the multiple frame exposures acquired with the
Wide-field Infrared Survey Explorer (WISE). AWAIC includes preparatory steps
such as frame background matching and outlier detection using robust
frame-stack statistics. Frame co-addition is based on using the detector's
Point Response Function (PRF) as an interpolation kernel. This kernel reduces
the impact of prior-masked pixels; enables the creation of an optimal matched
filtered product for point source detection; and most important, it allows for
resolution enhancement (HiRes) to yield a model of the sky that is consistent
with the observations to within measurement error. The HiRes functionality
allows for non-isoplanatic PRFs, prior noise-variance weighting, uncertainty
estimation, and includes a ringing-suppression algorithm. AWAIC also supports
the popular overlap-area weighted interpolation method, and is generic enough
for use on any astronomical image data that supports the FITS and WCS
standards.
Authors' comments: 16 pages, 6 figures. Invited paper to appear in Proceedings of ADASS
XVIII Conference
Peter Albers, Urs Frauenfelder
In this article we explain how critical points of a particular perturbation
of the Rabinowitz action functional give rise to leaf-wise intersection points
in hypersurfaces of restricted contact type. This is used to derive existence
and multiplicity results for leaf-wise intersection points in hypersurfaces of
restricted contact type in general exact symplectic manifolds. The notion of
leaf-wise intersection points was introduced by Moser.
Authors' comments: 18 pages, 1 figure; v3: completely rewritten, improved results
Ronen Gradwohl, Amir Yehudayoff
In this note we prove a large deviation bound on the sum of random variables
with the following dependency structure: there is a dependency graph $G$ with a
bounded chromatic number, in which each vertex represents a random variable.
Variables that are represented by neighboring vertices may be arbitrarily
dependent, but collections of variables that form an independent set in $G$ are
$t$-wise independent.
Authors' comments: 5 pages
Edwin A. Valentijn, John P. McFarland, Jan Snigula, Kor G. Begeman, Danny R. Boxhoorn, Roeland Rengelink, Ewout Helmich, Philippe Heraudeau et al.
The recent explosion of recorded digital data and its processed derivatives
threatens to overwhelm researchers when analysing their experimental data or
when looking up data items in archives and file systems. While current hardware
developments allow to acquire, process and store 100s of terabytes of data at
the cost of a modern sports car, the software systems to handle these data are
lagging behind. This general problem is recognized and addressed by various
scientific communities, e.g., DATAGRID/EGEE federates compute and storage power
over the high-energy physical community, while the astronomical community is
building an Internet geared Virtual Observatory, connecting archival data.
These large projects either focus on a specific distribution aspect or aim to
connect many sub-communities and have a relatively long trajectory for setting
standards and a common layer. Here, we report "first light" of a very different
solution to the problem initiated by a smaller astronomical IT community. It
provides the abstract "scientific information layer" which integrates
distributed scientific analysis with distributed processing and federated
archiving and publishing. By designing new abstractions and mixing in old ones,
a Science Information System with fully scalable cornerstones has been
achieved, transforming data systems into knowledge systems. This break-through
is facilitated by the full end-to-end linking of all dependent data items,
which allows full backward chaining from the observer/researcher to the
experiment. Key is the notion that information is intrinsic in nature and thus
is the data acquired by a scientific experiment. The new abstraction is that
software systems guide the user to that intrinsic information by forcing full
backward and forward chaining in the data modelling.
Authors' comments: To be published in ADASS XVI ASP Conference Series, 2006, R. Shaw, F.
Hill and D. Bell, eds
Jana Rodriguez Hertz
On a Peano continuum, all local stable and unstable components of a continuum-wise expansive homeomorphism are non trivial. In particular, there is sensitive dependence on initial conditions. This generalizes results in \cite{h,l} about lack of Lyapunov stable points (weak sinks) and existence of non trivial stable and unstable components for expansive homeomorphisms on Peano continua. We also use this fact to generalize a result in \cite{ktt}: a Peano curve $X$ admitting a continuum-wise expansive homeomorphism is nowhere rim-countable. However, it is not resolved the question of whether such a dynamics could be possible in a locally planar Peano curve. Some other questions are posed.
Itai Benjamini, Gady Kozma, Dan Romik
We construct examples of a random walk with pairwise-independent steps which is almost-surely bounded, and for any $m$ and $k$ a random walk with $k$-wise independent steps which has no stationary distribution modulo $m$.
Alonso Botero, Benni Reznik
We address the decomposition of a multi-mode pure Gaussian state with respect to a bi-partite division of the modes. For any such division the state can always be expressed as a product state involving entangled two-mode squeezed states and single mode local states at each side. The character of entanglement of the state can therefore be understood modewise; that is, a given mode on one side is entangled with only one corresponding mode of the other, and therefore the total bi-partite entanglement is the sum of the modewise entanglement. This decomposition is generally not applicable to all mixed Gaussian states. However, the result can be extended to a special family of "isotropic" states, characterized by a phase space covariance matrix with a completely degenerate symplectic spectrum.
UKQCD collaboration, G. N. Lacagnina
The h+, h_A1 form factors for the semi-leptonic B->D and B->D* decays are
evaluated in quenched lattice QCD with beta=6.2. The action and the operators
are fully O(a) non-perturbatively improved. The Isgur-Wise function is
evaluated and fitted to extract its slope; the latter is found to be
rho^2=1.1(2)(3) from the B->D* decay and rho^2=1.0(2)(3) from the B->D decay.
The form factors ratios R_1, R_2 are evaluated and found to be in agreement
with experimental determinations.
Authors' comments: Lattice 2001 proc., 3 pages, 3 figures
[UKQCD Collaboration], G. Douglas
We derive the form factors relevant for decays of pseudo-scalar mesons
corresponding to the semi-leptonic decay B->D l nu. The simulations are
performed in the quenched approximation at beta=6.0 and beta=6.2 using a
non-perturbatively improved clover action. The slope of the Isgur Wise function
and |Vcb| are extracted from the form factors.
Authors' comments: LaTeX. 6 PostScript figures. Contribution to LATTICE99(Heavy Quarks)