Rina Foygel, Mathias Drton
The group lasso is a penalized regression method, used in regression problems
where the covariates are partitioned into groups to promote sparsity at the
group level. Existing methods for finding the group lasso estimator either use
gradient projection methods to update the entire coefficient vector
simultaneously at each step, or update one group of coefficients at a time
using an inexact line search to approximate the optimal value for the group of
coefficients when all other groups' coefficients are fixed. We present a new
method of computation for the group lasso in the linear regression case, the
Single Line Search (SLS) algorithm, which operates by computing the exact
optimal value for each group (when all other coefficients are fixed) with one
univariate line search. We perform simulations demonstrating that the SLS
algorithm is often more efficient than existing computational methods. We also
extend the SLS algorithm to the sparse group lasso problem via the Signed
Single Line Search (SSLS) algorithm, and give theoretical results to support
both algorithms.
Authors' comments: We have been made aware of the earlier work by Puig et al. (2009)
which derives the same result for the (non-sparse) group lasso setting. We
leave this manuscript available as a technical report, to serve as a
reference for the previously untreated sparse group lasso case, and for
timing comparisons of various methods in the group lasso setting. The
manuscript is updated to include this reference
María Gómez Rocha, Wolfgang Schweiger
We investigate electromagnetic and weak form factors of heavy-light mesons in
the context of point-form relativistic quantum mechanics. To this aim we treat
the physical processes from which such electroweak form factors are extracted
by means of a coupled channel approach which accounts for the dynamics of the
intermediate gauge bosons. It is shown that heavy-quark symmetry is respected
by this formulation. A simple analytical expression is obtained for the
Isgur-Wise function in the heavy-quark limit. Breaking of heavy-quark symmetry
due to realistic values of the heavy-quark mass are studied numerically.
Authors' comments: Contribution based on a talk by Maria Gomez Rocha at the
Mini-Workshop in Bled, July 4-11, 2010
Edward L. Wright, Peter R. M. Eisenhardt, Amy Mainzer, Michael E. Ressler, Roc M. Cutri, Thomas Jarrett, J. Davy Kirkpatrick, Deborah Padgett et al.
The all sky surveys done by the Palomar Observatory Schmidt, the European
Southern Observatory Schmidt, and the United Kingdom Schmidt, the InfraRed
Astronomical Satellite and the 2 Micron All Sky Survey have proven to be
extremely useful tools for astronomy with value that lasts for decades. The
Wide-field Infrared Survey Explorer is mapping the whole sky following its
launch on 14 December 2009. WISE began surveying the sky on 14 Jan 2010 and
completed its first full coverage of the sky on July 17. The survey will
continue to cover the sky a second time until the cryogen is exhausted
(anticipated in November 2010). WISE is achieving 5 sigma point source
sensitivities better than 0.08, 0.11, 1 and 6 mJy in unconfused regions on the
ecliptic in bands centered at wavelengths of 3.4, 4.6, 12 and 22 microns.
Sensitivity improves toward the ecliptic poles due to denser coverage and lower
zodiacal background. The angular resolution is 6.1, 6.4, 6.5 and 12.0
arc-seconds at 3.4, 4.6, 12 and 22 microns, and the astrometric precision for
high SNR sources is better than 0.15 arc-seconds.
Authors' comments: 22 pages with 19 included figures. Updated to better match the
accepted version in the AJ
Petros Drineas, Anastasios Zouzias
Given an n x n matrix A, we present a simple, element-wise sparsification
algorithm that zeroes out all sufficiently small elements of A and then retains
some of the remaining elements with probabilities proportional to the square of
their magnitudes. We analyze the approximation accuracy of the proposed
algorithm using a recent, elegant non-commutative Bernstein inequality, and
compare our bounds with all existing (to the best of our knowledge)
element-wise matrix sparsification algorithms.
Authors' comments: 8 pages
Yu-Min Yen
In this short report, we discuss how coordinate-wise descent algorithms can
be used to solve minimum variance portfolio (MVP) problems in which the
portfolio weights are constrained by $l_{q}$ norms, where $1\leq q \leq 2$. A
portfolio which weights are regularised by such norms is called a sparse
portfolio (Brodie et al.), since these constraints facilitate sparsity (zero
components) of the weight vector. We first consider a case when the portfolio
weights are regularised by a weighted $l_{1}$ and squared $l_{2}$ norm. Then
two benchmark data sets (Fama and French 48 industries and 100 size and BM
ratio portfolios) are used to examine performances of the sparse portfolios.
When the sample size is not relatively large to the number of assets, sparse
portfolios tend to have lower out-of-sample portfolio variances, turnover
rates, active assets, short-sale positions, but higher Sharpe ratios than the
unregularised MVP. We then show some possible extensions; particularly we
derive an efficient algorithm for solving an MVP problem in which assets are
allowed to be chosen grouply.
Authors' comments: This paper has been withdrawn by the author due to a crucial sign
error in equation 1
Alberto Lovison
We propose a strategy for approximating Pareto optimal sets based on the
global analysis framework proposed by Smale (Dynamical systems, New York, 1973,
pp. 531-544). The method highlights and exploits the underlying manifold
structure of the Pareto sets, approximating Pareto optima by means of
simplicial complexes. The method distinguishes the hierarchy between singular
set, Pareto critical set and stable Pareto critical set, and can handle the
problem of superposition of local Pareto fronts, occurring in the general
nonconvex case. Furthermore, a quadratic convergence result in a suitable
set-wise sense is proven and tested in a number of numerical examples.
Authors' comments: 29 pages, 12 figures
Jop Briet, Harry Buhrman, Troy Lee, Thomas Vidick
XOR games are a simple computational model with connections to many areas of
complexity theory. Perhaps the earliest use of XOR games was in the study of
quantum correlations. XOR games also have an interesting connection to
Grothendieck's inequality, a fundamental theorem of analysis, which shows that
two players sharing entanglement can achieve at most a constant factor
advantage over players following classical strategies in an XOR game.
Perez-Garcia et al. show that when the players share GHZ states, this
advantage is bounded by a constant. We use a multilinear generalization of
Grothendieck's inequality due to Blei and Tonge to simplify the proof of the
second result and extend it to the case of so-called Schmidt states, answering
an open problem of Perez-Garcia et al. Via a reduction given in that paper,
this answers a 35-year-old problem in operator algebras due to Varopoulos,
showing that the space of compact operators on a Hilbert space is a Q-algebra
under Schur product.
A further generalization of Grothendieck's inequality due to Carne lets us
show that the gap between the entangled and classical value is at most a
constant in any multiplayer XOR game in which the players are allowed to share
combinations of GHZ states and EPR pairs of any dimension.
As an application of our results, we show that the discrepancy method in
communication complexity remains a lower bound in the multiparty model where
the players have quantum communication and the kinds of entanglement discussed
above. This answers an open question of Lee, Schechtman, and Shraibman.
Authors' comments: 26 pages
Fabian Ziltener
Let $(M,\omega)$ be a symplectic manifold, $N\subseteq M$ a coisotropic
submanifold, and $\Sigma$ a compact oriented (real) surface. I define a natural
Maslov index for each continuous map $u:\Sigma\to M$ that sends every connected
component of $\partial\Sigma$ to some isotropic leaf of $N$. This index is real
valued and generalizes the usual Lagrangian Maslov index. The idea is to use
the linear holonomy of the isotropic foliation of $N$ to compensate for the
loss of boundary data in the case codimension $N<\dim M/2$. The definition is
based on the Salamon-Zehnder (mean) Maslov index of a path of linear symplectic
automorphisms. I prove a lower bound on the number of leafwise fixed points of
a Hamiltonian diffeomorphism, if $(M,\omega)$ is geometrically bounded and $N$
is closed, regular (i.e. "fibering"), and monotone. As an application, we
obtain a presymplectic non-embedding result. I also prove a coisotropic version
of the Audin conjecture.
Authors' comments: 47 pages
Jan O. Eeg, Kresimir Kumericki
We consider the Isgur-Wise function xi(omega) within a new modified version
of a heavy-light chiral quark model. While early versions of such models gave
too small absolute value of the slope, namely xi'(1) of about -0.4 to -0.3, we
show how extended version(s) may lead to values around -1, in better agreement
with recent measurements. This is obtained by introducing a new mass parameter
in the heavy quark propagator. We also shortly comment on the consequences for
the decay modes B --> D D-bar.
Authors' comments: 20 pages, 7 PS figure, LaTeX
[ETM Collaboration], Benoit Blossier, Marc Wagner, Olivier Pene
We perform a two-flavor dynamical lattice computation of the Isgur-Wise
functions tau_{1/2} and tau_{3/2} at zero recoil in the static limit. We find
tau_{1/2}(1) = 0.297(26) and tau_{3/2}(1) = 0.528(23) fulfilling Uraltsev's sum
rule by around 80%. We also comment on a persistent conflict between theory and
experiment regarding semileptonic decays of B mesons into orbitally excited P
wave D mesons, the so-called "1/2 versus 3/2 puzzle", and we discuss the
relevance of lattice results in this context.
Authors' comments: 7 pages, 2 figures, talk given at the XXVII International Symposium
on Lattice Field Theory, July 26 - 31 2009, Peking University, Beijing, China
Edsel A. Peña, Joshua D. Habiger, Wensong Wu
Improved procedures, in terms of smaller missed discovery rates (MDR), for
performing multiple hypotheses testing with weak and strong control of the
family-wise error rate (FWER) or the false discovery rate (FDR) are developed
and studied. The improvement over existing procedures such as the \v{S}id\'ak
procedure for FWER control and the Benjamini--Hochberg (BH) procedure for FDR
control is achieved by exploiting possible differences in the powers of the
individual tests. Results signal the need to take into account the powers of
the individual tests and to have multiple hypotheses decision functions which
are not limited to simply using the individual $p$-values, as is the case, for
example, with the \v{S}id\'ak, Bonferroni, or BH procedures. They also enhance
understanding of the role of the powers of individual tests, or more precisely
the receiver operating characteristic (ROC) functions of decision processes, in
the search for better multiple hypotheses testing procedures. A
decision-theoretic framework is utilized, and through auxiliary randomizers the
procedures could be used with discrete or mixed-type data or with rank-based
nonparametric tests. This is in contrast to existing $p$-value based procedures
whose theoretical validity is contingent on each of these $p$-value statistics
being stochastically equal to or greater than a standard uniform variable under
the null hypothesis. Proposed procedures are relevant in the analysis of
high-dimensional "large $M$, small $n$" data sets arising in the natural,
physical, medical, economic and social sciences, whose generation and creation
is accelerated by advances in high-throughput technology, notably, but not
limited to, microarray technology.
Authors' comments: Published in at http://dx.doi.org/10.1214/10-AOS844 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org)
Elena F. Sheka
The reactions of fullerene C60 with atomic fluorine have been studied by
unrestricted broken spin-symmetry Hartree-Fock (UBS HF) approach implemented in
semiempirical codes based on AM1 technique. The calculations were focused on a
sequential addition of fluorine atom to the fullerene cage following indication
of the cage atom highest chemical susceptibility that is calculated at each
step. The effectively-non-paired-electron concept of the fullerene atoms
chemical susceptibility lays the foundation of the suggested computational
synthesis. The obtained results are analyzed from energetic, symmetry, and the
composition abundance viewpoints. A good fitting of the data to experimental
findings proves a creative role of the suggested synthesis methodology.
Authors' comments: 33 pages, 11 figures, 2 tables, 2 charts
A. Le Yaouanc, L. Oliver, J. -C. Raynal
We propose a group theoretical method to study Isgur-Wise functions. A
current matrix element splits into a heavy quark matrix element and an overlap
of the initial and final clouds, related to the IW functions, that contain the
long distance physics. The light cloud belongs to the Hilbert space of a
unitary representation of the Lorentz group. Decomposing into irreducible
representations one obtains the IW function as an integral formula,
superposition of irreducible IW functions with positive measures, providing
positivity bounds on its derivatives. Our method is equivalent to the sum rule
approach, but sheds another light on the physics and summarizes and gives all
its possible constraints. We expose the general formalism, thoroughly applying
it to the case j = 0 for the light cloud, relevant to the semileptonic decay
Lambda_b -> Lambda_c + l + nu. In this case, the principal series of the
representations contribute, and also the supplementary series. We recover the
bound for the curvature of the j = 0 IW function xi_Lambda (w) that we did
obtain from the sum rule method, and we get new bounds for higher derivatives.
We demonstrate also that if the lower bound for the curvature is saturated,
then xi_Lambda (w) is completely determined, given by an explicit elementary
function. We give criteria to decide if any ansatz for the Isgur-Wise function
is compatible or not with the sum rules. We apply the method to some simple
model forms proposed in the literature. Dealing with a Hilbert space, the sum
rules are convergent, but this feature does not survive hard gluon radiative
corrections.
Authors' comments: 70 pages
Ina Taralova, D. Fournier-Prunaret
This paper analyses the behaviour of a second order DPCM (Differential Pulse
Code Modulation) transmission system when the nonlinear characteristic of the
quantizer is taken into consideration. In this way, qualitatively new
properties of the DPCM system have been unravelled, which cannot be observed
and explained if the nonlinearity of the quantizer is neglected. For the
purposes of this study, a piece-wise linear nondifferentiable quantizer
characteristic is considered. The resulting model of the DPCM is of the form of
iteration equations (i.e. map), where the inverse iterate is not unique (i.e.
noninvertible map). Therefore the mathematical theory of noninvertible maps is
particularly suitable for this analysis, together with the more classic tools
of Non Linear Dynamics. This study allowed us in addition to show from a
theoretical point of view some new properties of nondifferentiable maps, in
comparison with differentiable ones. After a short review of noninvertible
maps, the presented methods and tools for noninvertible maps are applied to the
DPCM system. An original algorithm for calculation of bifurcation curves for
the DPCM map is proposed. Via the studies in the parameter and phase plane,
different nonlinear phenomena such as the overlapping of bifurcation curves
causing multistability, chaotic behaviour, or multiple basins with fractal
boundary are pointed out. All observed phenomena show a very complex dynamical
behaviour even in the constant input signal case, discussed here.
Authors' comments: 17 pages
Alicia A. Johnson, Galin L. Jones, Ronald C. Neath
It is common practice in Markov chain Monte Carlo to update the simulation
one variable (or sub-block of variables) at a time, rather than conduct a
single full-dimensional update. When it is possible to draw from each
full-conditional distribution associated with the target this is just a Gibbs
sampler. Often at least one of the Gibbs updates is replaced with a
Metropolis-Hastings step, yielding a Metropolis-Hastings-within-Gibbs
algorithm. Strategies for combining component-wise updates include composition,
random sequence and random scans. While these strategies can ease MCMC
implementation and produce superior empirical performance compared to
full-dimensional updates, the theoretical convergence properties of the
associated Markov chains have received limited attention. We present conditions
under which some component-wise Markov chains converge to the stationary
distribution at a geometric rate. We pay particular attention to the
connections between the convergence rates of the various component-wise
strategies. This is important since it ensures the existence of tools that an
MCMC practitioner can use to be as confident in the simulation results as if
they were based on independent and identically distributed samples. We
illustrate our results in two examples including a hierarchical linear mixed
model and one involving maximum likelihood estimation for mixed models.
Authors' comments: Published in at http://dx.doi.org/10.1214/13-STS423 the Statistical
Science (http://www.imstat.org/sts/) by the Institute of Mathematical
Statistics (http://www.imstat.org)
Ron Peled, Ariel Yadin, Amir Yehudayoff
A k-wise independent distribution on n bits is a joint distribution of the
bits such that each k of them are independent. In this paper we consider k-wise
independent distributions with identical marginals, each bit has probability p
to be 1. We address the following question: how high can the probability that
all the bits are 1 be, for such a distribution? For a wide range of the
parameters n,k and p we find an explicit lower bound for this probability which
matches an upper bound given by Benjamini et al., up to multiplicative factors
of lower order. The question we investigate can be seen as a relaxation of a
major open problem in error-correcting codes theory, namely, how large can a
linear error correcting code with given parameters be?
The question is a type of discrete moment problem, and our approach is based
on showing that bounds obtained from the theory of the classical moment problem
provide good approximations for it. The main tool we use is a bound controlling
the change in the expectation of a polynomial after small perturbation of its
zeros.
Authors' comments: 30 pages, 4 figures. This version adds an appendix with short proofs
of some of the cited results
Avishay Gal-Yam, Dan Maoz, Puragra Guhathakurta, Alexei V. Filippenko
We describe the Wise Observatory Optical Transient Search (WOOTS), a survey
for supernovae (SNe) and other variable and transient objects in the fields of
redshift 0.06-0.2 Abell galaxy clusters. We present the survey design and
data-analysis procedures, and our object detection and follow-up strategies. We
have obtained follow-up spectroscopy for all viable SN candidates, and present
the resulting SN sample here. Out of the 12 SNe we have discovered, seven are
associated with our target clusters while five are foreground or background
field events. All but one of the SNe (a foreground field event) are Type Ia
SNe. Our non-cluster SN sample is uniquely complete, since all SN candidates
have been either spectroscopically confirmed or ruled out. This allows us to
estimate that flux-limited surveys similar to WOOTS would be dominated (~80%)
by SNe Ia. Our spectroscopic follow-up observations also elucidate the
difficulty in distinguishing active galactic nuclei from SNe. In separate
papers we use the WOOTS sample to derive the SN rate in clusters for this
redshift range, and to measure the fraction of intergalactic cluster SNe. We
also briefly report here on some quasars and asteroids discovered by WOOTS.
Authors' comments: Submitted to ApJ. Comments welcome
Bharat Sukhwani, Uday Padmanabhan, Janet M. Wang
New nanotechnology based devices are replacing CMOS devices to overcome CMOS
technology's scaling limitations. However, many such devices exhibit
non-monotonic I-V characteristics and uncertain properties which lead to the
negative differential resistance (NDR) problem and the chaotic performance.
This paper proposes a new circuit simulation approach that can effectively
simulate nanotechnology devices with uncertain input sources and negative
differential resistance (NDR) problem. The experimental results show a 20-30
times speedup comparing with existing simulators.
Authors' comments: Submitted on behalf of EDAA (http://www.edaa.com/)
Satoshi Aoki, Takayuki Hibi, Hidefumi Ohsugi, Akimichi Takemura
We consider testing independence in group-wise selections with some
restrictions on combinations of choices. We present models for frequency data
of selections for which it is easy to perform conditional tests by Markov chain
Monte Carlo (MCMC) methods. When the restrictions on the combinations can be
described in terms of a Segre-Veronese configuration, an explicit form of a
Gr\"obner basis consisting of moves of degree two is readily available for
performing a Markov chain. We illustrate our setting with the National Center
Test for university entrance examinations in Japan. We also apply our method to
testing independence hypotheses involving genotypes at more than one locus or
haplotypes of alleles on the same chromosome.
Authors' comments: 25 pages, 5 figures
Anindya Dey, Debaprasad Maity, Soumitra SenGupta
The Goldberger-Wise mechanism of stabilizing modulus in the Randall-Sundrum
braneworld,by introducing a bulk scalar field with quartic interaction terms
localized at the 3-branes has been extremely popular as a stabilizing mechanism
when the back-reaction of the scalar field on the geometry is negligibly small.
In this note we re-examine the mechanism by an exact analysis without resorting
to the approximations adopted by Goldberger and Wise. An exact calculation of
the stabilization condition indicates the existence of closely spaced minimum
and a maximum for the potential and also brings out some new features involved
in the context of stabilization of such braneworld models.
Authors' comments: 5 pages, Revtex, two figures