Young Woong Park, Yan Jiang, Diego Klabjan, Loren Williams
Cluster-wise linear regression (CLR), a clustering problem intertwined with regression, is to find clusters of entities such that the overall sum of squared errors from regressions performed over these clusters is minimized, where each cluster may have different variances. We generalize the CLR problem by allowing each entity to have more than one observation, and refer to it as generalized CLR. We propose an exact mathematical programming based approach relying on column generation, a column generation based heuristic algorithm that clusters predefined groups of entities, a metaheuristic genetic algorithm with adapted Lloyd's algorithm for K-means clustering, a two-stage approach, and a modified algorithm of Sp{\"a}th \cite{Spath1979} for solving generalized CLR. We examine the performance of our algorithms on a stock keeping unit (SKU) clustering problem employed in forecasting halo and cannibalization effects in promotions using real-world retail data from a large supermarket chain. In the SKU clustering problem, the retailer needs to cluster SKUs based on their seasonal effects in response to promotions. The seasonal effects are the results of regressions with predictors being promotion mechanisms and seasonal dummies performed over clusters generated. We compare the performance of all proposed algorithms for the SKU problem with real-world and synthetic data.
Ming Xu
A Finsler space $(M,F)$ is called flag-wise positively curved, if for any
$x\in M$ and any tangent plane $\mathbf{P}\subset T_xM$, we can find a nonzero
vector $y\in \mathbf{P}$, such that the flag curvature $K^F(x,y,
\mathbf{P})>0$. Though compact positively curved spaces are very rare in both
Riemannian and Finsler geometry, flag-wise positively curved metrics should be
easy to be found. A generic Finslerian perturbation for a non-negatively curved
homogeneous metric may have a big chance to produce flag-wise positively curved
metrics. This observation leads our discovery of these metrics on many compact
manifolds. First we prove any Lie group $G$ such that its Lie algebra
$\mathfrak{g}$ is compact non-Abelian and $\dim\mathfrak{c}(\mathfrak{g})\leq
1$ admits flag-wise positively curved left invariant Finsler metrics. Similar
techniques can be applied to our exploration for more general compact coset
spaces. We will prove, whenever $G/H$ is a compact simply connected coset
space, $G/H$ and $S^1\times G/H$ admit flag-wise positively curved Finsler
metrics. This provides abundant examples for this type of metrics, which are
not homogeneous in general.
Authors' comments: 9 pages. In the newest version, Theorem 1.3 is strenghened to provide
many more examples
Jimi Sanchez
In software testing, the large size of the input domain makes exhaustively testing the inputs a daunting and often impossible task. Pair-wise testing is a popular approach to combinatorial testing problems. This paper reviews Pair-wise testing and its history, strengths, weaknesses, and tools for generating test cases.
M. R. Zapatero Osorio, N. Lodieu, V. J. S. Béjar, E. L. Martín, V. D. Ivanov, A. Bayo, H. M. J. Boffin, K. Mužić et al.
(Abridged) We aim at measuring the near-infrared photometry, and deriving the
mass, age, temperature, and surface gravity of WISE J085510.74-071442.5
(J0855-0714), which is the coolest known object beyond the Solar System as of
today. We use publicly available data from the archives of the HST and the VLT
to determine the emission of this source at 1.153 micron (F110W) and 1.575
micron (CH_4). J0855-0714 is detected at both wavelengths with signal-to-noise
ratio of ~10 (F110W) and ~4 (CH_4-off) at the peak of the corresponding PSFs.
This is the first detection of J0855-0714 in the H-band. We measure 26.31 +/-
0.10 and 23.22 +/- 0.35 mag in F110W and CH_4 (Vega system). J0855-0714 remains
unresolved in the HST images that have a spatial resolution of 0.22".
Companions at separations of 0.5 AU (similar brightness) and at ~1 AU (~1 mag
fainter in the F110W filter) are discarded. By combining the new data with
published photometry, we build the spectral energy distribution of J0855-0714
from 0.89 to 22.09 micron, and contrast it against state-of-the-art
solar-metallicity models of planetary atmospheres. We determine a temperature
of 225-250 K, a bolometric luminosity of log L/Lsol = -8.57, and a high surface
gravity of log g = 5.0 (cm/s2), which suggests an old age although such a high
gravity is not fully compatible with evolutionary models. After comparison with
the cooling theory for brown dwarfs and planets, we infer a mass in the
interval 2-10 Mjup for ages of 1-12 Gyr and log g > 3.5 (cm/s2). At the age of
the Sun, J0855-0714 would be a ~5-Mjup free-floating planetary-mass object.
J0855-0714 may represent the old image of the free-floating planetary-mass
objects of similar mass discovered in star-forming regions and young stellar
clusters. As many J0855-0714-like objects as M5-L2 stars may be expected to
populate the solar neighborhood.
Authors' comments: Accepted for publication in A&A
Agnieszka Kurcz, Maciej Bilicki, Aleksandra Solarz, Magdalena Krupa, Agnieszka Pollo, Katarzyna Małek
The WISE satellite has detected hundreds of millions sources over the entire
sky. Classifying them reliably is however a challenging task due to
degeneracies in WISE multicolour space and low levels of detection in its two
longest-wavelength bandpasses. Here we aim at obtaining comprehensive and
reliable star, galaxy and quasar catalogues based on automatic source
classification in full-sky WISE data. This means that the final classification
will employ only parameters available from WISE itself, in particular those
reliably measured for a majority of sources. For the automatic classification
we applied the support vector machines (SVM) algorithm, which requires a
training sample with relevant classes already identified, and we chose to use
the SDSS spectroscopic dataset for that purpose. By calibrating the classifier
on the test data drawn from SDSS, we first established that a polynomial kernel
is preferred over a radial one for this particular dataset. Next, using three
classification parameters (W1 magnitude, W1-W2 colour, and a differential
aperture magnitude) we obtained very good classification efficiency in all the
tests. At the bright end, the completeness for stars and galaxies reaches ~95%,
deteriorating to ~80% at W1=16 mag, while for quasars it stays at a level of
~95% independently of magnitude. Similar numbers are obtained for purity.
Application of the classifier to full-sky WISE data, flux-limited to 16 mag
(Vega) in the 3.4 {\mu}m channel, and appropriate a posteriori cleaning allowed
us to obtain reliably-looking catalogues of star and galaxy candidates.
However, the sources flagged by the classifier as `quasars' are in fact
dominated by dusty galaxies but also exhibit contamination from sources located
mainly at low ecliptic latitudes, consistent with Solar System objects.
[abridged]
Authors' comments: 18 pages, 17 figures, 4 tables
Ayana, Shiqi Shen, Yu Zhao, Zhiyuan Liu, Maosong Sun
Recently, neural models have been proposed for headline generation by learning to map documents to headlines with recurrent neural networks. Nevertheless, as traditional neural network utilizes maximum likelihood estimation for parameter optimization, it essentially constrains the expected training objective within word level rather than sentence level. Moreover, the performance of model prediction significantly relies on training data distribution. To overcome these drawbacks, we employ minimum risk training strategy in this paper, which directly optimizes model parameters in sentence level with respect to evaluation metrics and leads to significant improvements for headline generation. Experiment results show that our models outperforms state-of-the-art systems on both English and Chinese headline generation tasks.
Jinyoung Yang, Evgeny Levi, Radu V. Craiu, Jeffrey S. Rosenthal
One of the most widely used samplers in practice is the component-wise Metropolis-Hastings (CMH) sampler that updates in turn the components of a vector valued Markov chain using accept-reject moves generated from a proposal distribution. When the target distribution of a Markov chain is irregularly shaped, a `good' proposal distribution for one part of the state space might be a `poor' one for another part of the state space. We consider a component-wise multiple-try Metropolis (CMTM) algorithm that can automatically choose from a set of candidate moves sampled from different distributions. The computational efficiency is increased using an adaptation rule for the CMTM algorithm that dynamically builds a better set of proposal distributions as the Markov chain runs. The ergodicity of the adaptive chain is demonstrated theoretically. The performance is studied via simulations and real data examples.
Manseob Lee
Let $M$ be a closed smooth manifold and let $f:M\to M$ be a diffeomorphism.
$C^1$-generically, a continuum-wise expansive satisfies Axiom A without cycles.
Moreover, there is a partially hyperbolic diffeomorphism $f$ such that it is
not continuum-wise expansive.
Authors' comments: 10pages
Yingwei Li
Using pointwise semigroup techniques, we establish sharp rates of decay in
space and time of a perturbed reaction diffusion front to its time-asymptotic
limit. This recovers results of Sattinger, Henry and others of time-exponential
convergence in weighted $L^p$ and Sobolev norms, while capturing the new
feature of spatial diffusion at Gaussian rate. Novel features of the argument
are a point-wise Green function decomposition reconciling spectral
decomposition and short-time Nash-Aronson estimates and an instantaneous
tracking scheme similar to that used in the study of stability of viscous shock
waves.
Authors' comments: arXiv admin note: substantial text overlap with arXiv:1004.0909 by
other authors
Zequn Jie, Xiaodan Liang, Jiashi Feng, Wen Feng Lu, Eng Hock Francis Tay, Shuicheng Yan
Object proposal is essential for current state-of-the-art object detection
pipelines. However, the existing proposal methods generally fail in producing
results with satisfying localization accuracy. The case is even worse for small
objects which however are quite common in practice. In this paper we propose a
novel Scale-aware Pixel-wise Object Proposal (SPOP) network to tackle the
challenges. The SPOP network can generate proposals with high recall rate and
average best overlap (ABO), even for small objects. In particular, in order to
improve the localization accuracy, a fully convolutional network is employed
which predicts locations of object proposals for each pixel. The produced
ensemble of pixel-wise object proposals enhances the chance of hitting the
object significantly without incurring heavy extra computational cost. To solve
the challenge of localizing objects at small scale, two localization networks
which are specialized for localizing objects with different scales are
introduced, following the divide-and-conquer philosophy. Location outputs of
these two networks are then adaptively combined to generate the final proposals
by a large-/small-size weighting network. Extensive evaluations on PASCAL VOC
2007 show the SPOP network is superior over the state-of-the-art models. The
high-quality proposals from SPOP network also significantly improve the mean
average precision (mAP) of object detection with Fast-RCNN framework. Finally,
the SPOP network (trained on PASCAL VOC) shows great generalization performance
when testing it on ILSVRC 2013 validation set.
Authors' comments: accepted by IEEE Transactions on Image Processing
Alan D. Logan
Motivated by a question of Bumagin and Wise, we construct a continuum of
finitely generated, residually finite groups whose outer automorphism groups
are pairwise non-isomorphic finitely generated, non-recursively-presentable
groups. These are the first examples of such residually finite groups.
Authors' comments: 8 pages
I. Gezer, H. Van Winckel, Z. Bozkurt, K. De Smedt, D. Kamath, M. Hillen, R. Manick
We present a detailed study based on infrared photometry of all Galactic RV
Tauri stars from the General Catalogue of Variable Stars (GCVS). RV Tauri stars
are the brightest among the population II Cepheids. They are thought to evolve
away from the asymptotic giant branch (AGB) towards the white dwarf domain.
IRAS detected several RV Tauri stars because of their large IR excesses and it
was found that they occupy a specific region in the [12] - [25], [25] - [60]
IRAS two-colour diagram. We used the all sky survey of WISE to extend these
studies and compare the infrared properties of all RV Tauri stars in the GCVS
with a selected sample of post-AGB objects with the goal to place the RV Tauri
pulsators in the context of post-AGB evolution. Moreover, we correlated the IR
properties of both the RV Tauri stars and the comparison sample with other
observables like binarity and the presence of a photospheric chemical anomaly
called depletion. We find that Galactic RV Tauri stars display a range of
infrared properties and we differentiate between disc sources, objects with no
IR excess and objects for which the spectral energy distribution (SED) is
uncertain. We obtain a clear correlation between disc sources and binarity. RV
Tauri stars with a variable mean magnitude are exclusively found among the disc
sources. We also find evidence for disc evolution among the binaries.
Furthermore our studies show that the presence of a disc seems to be a
necessary but not sufficient condition for the depletion process to become
efficient.
Authors' comments: 14 pages, 13 figures, 6 tables. Accepted for publication in Monthly
Notices of the Royal Astronomical Society (MNRAS)
Vadim Lebedev, Victor Lempitsky
We revisit the idea of brain damage, i.e. the pruning of the coefficients of a neural network, and suggest how brain damage can be modified and used to speedup convolutional layers. The approach uses the fact that many efficient implementations reduce generalized convolutions to matrix multiplications. The suggested brain damage process prunes the convolutional kernel tensor in a group-wise fashion by adding group-sparsity regularization to the standard training process. After such group-wise pruning, convolutions can be reduced to multiplications of thinned dense matrices, which leads to speedup. In the comparison on AlexNet, the method achieves very competitive performance.
Jyh-Jing Hwang, Tyng-Luh Liu
We address the problem of contour detection via per-pixel classifications of
edge point. To facilitate the process, the proposed approach leverages with
DenseNet, an efficient implementation of multiscale convolutional neural
networks (CNNs), to extract an informative feature vector for each pixel and
uses an SVM classifier to accomplish contour detection. In the experiment of
contour detection, we look into the effectiveness of combining per-pixel
features from different CNN layers and verify their performance on BSDS500.
Authors' comments: 2 pages. arXiv admin note: substantial text overlap with
arXiv:1412.6857
J. A. Toalá, M. A. Guerrero, G. Ramos-Larios, V. Guzmán
We present a morphological study of nebulae around Wolf-Rayet (WR) stars
using archival narrow-band optical and Wide-field Infrared Survey Explorer
(WISE) infrared images. The comparison among WISE images in different bands and
optical images proves to be a very efficient procedure to identify the nebular
emission from WR nebulae, and to disentangle it from that of the ISM material
along the line of sight. In particular, WR nebulae are clearly detected in the
WISE W4 band at 22 $\mu$m. Analysis of available mid-IR Spitzer spectra shows
that the emission in this band is dominated by thermal emission from dust
spatially coincident with the thin nebular shell or most likely with the
leading edge of the nebula. The WR nebulae in our sample present different
morphologies that we classified into well defined WR bubbles (bubble ${\cal
B}$-type nebulae), clumpy and/or disrupted shells (clumpy/disrupted ${\cal
C}$-type nebulae), and material mixed with the diffuse medium (mixed ${\cal
M}$-type nebulae). The variety of morphologies presented by WR nebulae shows a
loose correlation with the central star spectral type, implying that the
nebular and stellar evolutions are not simple and may proceed according to
different sequences and time-lapses. We report the discovery of an obscured
shell around WR35 only detected in the infrared.
Authors' comments: 11 pages, 6 figures, plus 23 appendix figures; to appear in Astronomy
and Astrophysics
Dustin Lang, David W. Hogg, David J. Schlegel
We present photometry of images from the Wide-Field Infrared Survey Explorer (WISE; Wright et al. 2010) of over 400 million sources detected by the Sloan Digital Sky Survey (SDSS; York et al. 2000). We use a "forced photometry" technique, using measured SDSS source positions, star-galaxy separation and galaxy profiles to define the sources whose fluxes are to be measured in the WISE images. We perform photometry with The Tractor image modeling code, working on our "unWISE" coaddds and taking account of the WISE point-spread function and a noise model. The result is a measurement of the flux of each SDSS source in each WISE band. Many sources have little flux in the WISE bands, so often the measurements we report are consistent with zero. However, for many sources we get three- or four-sigma measurements; these sources would not be reported by the WISE pipeline and will not appear in the WISE catalog, yet they can be highly informative for some scientific questions. In addition, these small-signal measurements can be used in stacking analyses at catalog level. The forced photometry approach has the advantage that we measure a consistent set of sources between SDSS and WISE, taking advantage of the resolution and depth of the SDSS images to interpret the WISE images; objects that are resolved in SDSS but blended together in WISE still have accurate measurements in our photometry. Our results, and the code used to produce them, are publicly available at http://unwise.me.
Chao-Wei Tsai, Peter Eisenhardt, Jingwen Wu, Daniel Stern, Roberto Assef, Andrew Blain, Carrie Bridge, Dominic Benford et al.
We present 20 WISE-selected galaxies with bolometric luminosities L_bol >
10^14 L_sun, including five with infrared luminosities L_IR = L(rest 8-1000
micron) > 10^14 L_sun. These "extremely luminous infrared galaxies," or ELIRGs,
were discovered using the "W1W2-dropout" selection criteria which requires
marginal or non-detections at 3.4 and 4.6 micron (W1 and W2, respectively) but
strong detections at 12 and 22 micron in the WISE survey. Their spectral energy
distributions are dominated by emission at rest-frame 4-10 micron, suggesting
that hot dust with T_d ~ 450K is responsible for the high luminosities. These
galaxies are likely powered by highly obscured AGNs, and there is no evidence
suggesting these systems are beamed or lensed. We compare this WISE-selected
sample with 116 optically selected quasars that reach the same L_bol level,
corresponding to the most luminous unobscured quasars in the literature. We
find that the rest-frame 5.8 and 7.8 micron luminosities of the WISE-selected
ELIRGs can be 30-80% higher than that of the unobscured quasars. The existence
of AGNs with L_bol > 10^14 L_sun at z > 3 suggests that these supermassive
black holes are born with large mass, or have very rapid mass assembly. For
black hole seed masses ~ 10^3 M_sun, either sustained super-Eddington accretion
is needed, or the radiative efficiency must be <15%, implying a black hole with
slow spin, possibly due to chaotic accretion.
Authors' comments: 17 pages in emulateapj format, including 11 figures and 5 tables. ApJ
in press
Dustin Lang
The Wide-Field Infrared Survey Explorer (WISE; Wright et al. 2010) satellite observed the full sky in four mid-infrared bands in the 2.8 to 28 micron range. The primary mission was completed in 2010. The WISE team have done a superb job of producing a series of high-quality, well-documented, complete Data Releases in a timely manner. However, the "Atlas Image" coadds that are part of the recent AllWISE and previous data releases were intentionally blurred. Convolving the images by the point-spread function while coadding results in "matched-filtered" images that are close to optimal for detecting isolated point sources. But these matched-filtered images are sub-optimal or inappropriate for other purposes. For example, we are photometering the WISE images at the locations of sources detected in the Sloan Digital Sky Survey (York et al. 2000) through forward modeling, and this blurring decreases the available signal-to-noise by effectively broadening the point-spread function. This paper presents a new set of coadds of the WISE images that have not been blurred. These images retain the intrinsic resolution of the data and are appropriate for photometry preserving the available signal-to-noise. Users should be cautioned, however, that the W3- and W4-band coadds contain artifacts around large, bright structures (large galaxies, dusty nebulae, etc); eliminating these artifacts is the subject of ongoing work. These new coadds, and the code used to produce them, are publicly available at http://unwise.me .
Simone Ferraro, Blake D. Sherwin, David N. Spergel
The Integrated Sachs-Wolfe effect (ISW) measures the decay of the
gravitational potential due to cosmic acceleration and is thus a direct probe
of Dark Energy. In some of the earlier studies, the amplitude of the ISW effect
was found to be in tension with the predictions of the standard $\Lambda$CDM
model. We measure the cross-power of galaxies and AGN from the WISE mission
with CMB temperature data from WMAP9 in order to provide an independent
measurement of the ISW amplitude. Cross-correlations with the recently released
Planck lensing potential maps are used to calibrate the bias and contamination
fraction of the sources, thus avoiding systematic effects that could be present
when using auto-spectra to measure bias. We find an amplitude of the
cross-power of $\mathcal{A} = 1.24\pm 0.47$ from the galaxies and $\mathcal{A}
= 0.88 \pm 0.74$ from the AGN, fully consistent with the $\Lambda$CDM
prediction of $\mathcal{A} =1$. The ISW measurement signal-to-noise ratio is
2.7 and 1.2 respectively, giving a combined significance close to $3 \sigma$.
Comparing the amplitudes of the galaxy and AGN cross-correlations, which arise
from different redshifts, we find no evidence for redshift evolution in Dark
Energy properties, consistent with a Cosmological Constant.
Authors' comments: 9 pages, 8 figures
L. D. Anderson, T. M. Bania, Dana S. Balser, V. Cunningham, T. V. Wenger, B. M. Johnstone, W. P. Armentrout
Using data from the all-sky Wide-Field Infrared Survey Explorer (WISE)
satellite, we made a catalog of over 8000 Galactic HII regions and HII region
candidates by searching for their characteristic mid-infrared (MIR) morphology.
WISE has sufficient sensitivity to detect the MIR emission from HII regions
located anywhere in the Galactic disk. We believe this is the most complete
catalog yet of regions forming massive stars in the Milky Way. Of the ~8000
cataloged sources, ~1500 have measured radio recombination line (RRL) or
H$\alpha$ emission, and are thus known to be HII regions. This sample improves
on previous efforts by resolving HII region complexes into multiple sources and
by removing duplicate entries. There are ~2500 candidate HII regions in the
catalog that are spatially coincident with radio continuum emission. Our
group's previous RRL studies show that ~95% of such targets are HII regions. We
find that ~500 of these candidates are also positionally associated with known
HII region complexes, so the probability of their being bona fide HII regions
is even higher. At the sensitivity limits of existing surveys, ~4000 catalog
sources show no radio continuum emission. Using data from the literature, we
find distances for ~1500 catalog sources, and molecular velocities for ~1500
HII region candidates.
Authors' comments: Accepted to ApJS. Play with catalog contents here:
http://astro.phys.wvu.edu/wise/