Jana Rodriguez Hertz
On a Peano continuum, all local stable and unstable components of a continuum-wise expansive homeomorphism are non trivial. In particular, there is sensitive dependence on initial conditions. This generalizes results in \cite{h,l} about lack of Lyapunov stable points (weak sinks) and existence of non trivial stable and unstable components for expansive homeomorphisms on Peano continua. We also use this fact to generalize a result in \cite{ktt}: a Peano curve $X$ admitting a continuum-wise expansive homeomorphism is nowhere rim-countable. However, it is not resolved the question of whether such a dynamics could be possible in a locally planar Peano curve. Some other questions are posed.
Itai Benjamini, Gady Kozma, Dan Romik
We construct examples of a random walk with pairwise-independent steps which is almost-surely bounded, and for any $m$ and $k$ a random walk with $k$-wise independent steps which has no stationary distribution modulo $m$.
Alonso Botero, Benni Reznik
We address the decomposition of a multi-mode pure Gaussian state with respect to a bi-partite division of the modes. For any such division the state can always be expressed as a product state involving entangled two-mode squeezed states and single mode local states at each side. The character of entanglement of the state can therefore be understood modewise; that is, a given mode on one side is entangled with only one corresponding mode of the other, and therefore the total bi-partite entanglement is the sum of the modewise entanglement. This decomposition is generally not applicable to all mixed Gaussian states. However, the result can be extended to a special family of "isotropic" states, characterized by a phase space covariance matrix with a completely degenerate symplectic spectrum.
UKQCD collaboration, G. N. Lacagnina
The h+, h_A1 form factors for the semi-leptonic B->D and B->D* decays are
evaluated in quenched lattice QCD with beta=6.2. The action and the operators
are fully O(a) non-perturbatively improved. The Isgur-Wise function is
evaluated and fitted to extract its slope; the latter is found to be
rho^2=1.1(2)(3) from the B->D* decay and rho^2=1.0(2)(3) from the B->D decay.
The form factors ratios R_1, R_2 are evaluated and found to be in agreement
with experimental determinations.
Authors' comments: Lattice 2001 proc., 3 pages, 3 figures
[UKQCD Collaboration], G. Douglas
We derive the form factors relevant for decays of pseudo-scalar mesons
corresponding to the semi-leptonic decay B->D l nu. The simulations are
performed in the quenched approximation at beta=6.0 and beta=6.2 using a
non-perturbatively improved clover action. The slope of the Isgur Wise function
and |Vcb| are extracted from the form factors.
Authors' comments: LaTeX. 6 PostScript figures. Contribution to LATTICE99(Heavy Quarks)
N. G. Stefanis, A. I. Karanikas, C. N. Ktorides
An effective field theoretic description of the Isgur-Wise function in
heavy-meson transitions is presented which emulates soft interactions by a
fermion worldline with an infinitesimal self-intersecting loop. A
point-splitting regularization technique is used, which replaces pointlike
worldlines by ``ribbons'' in the sense of Witten. The calculated vertex
function is correctly normalized, does not depend on the heavy-quark mass, and
complies with different sets of recent experimental data of the ARGUS and CLEO
collaborations.
Authors' comments: LaTeX, using Worldstyle and comprising three eps files (5 pages in
total). Invited talk presented by the first author at the International
Workshop Quark Confinement and the Hadron Spectrum II, June 26-29, 1996,
Villa Olmo, Como, Italy. To appear in the Proceedings (World Scientific)
David E. Brahm, James Walden
We investigate the relations that must hold among baryonic Isgur-Wise
functions $\eta_i$ in the large-N_c limit from unitarity constraints, and
compare to those found by Chow using the Skyrme model [or SU(4)]. Given the
exponential dropoff of the $\eta_i$ away from threshold, unitarity requires
only that the usual normalization conditions hold at w=1, and that
$\eta=\eta_1$ near threshold. Our results are consistent with, but less
powerful than, the Skyrme model relations.
Authors' comments: 6 pages, LaTeX, 3 figures using epsf.tex. More detailed analysis, but
the result is unchanged
B. Holdom, M. Sutherland
The original de Rafael-Taron bound on the slope of the Isgur-Wise function at
zero recoil is known to be violated in QCD by singularities appearing in an
unphysical region. To be consistent, quark models must have corresponding
singularity structures. In an existing relativistic quark-loop model, the
meson-quark-antiquark vertex is such that the required singularity is an
anomalous threshold. We also discuss the implications of another anomalous
threshold, whose location is determined by quark masses alone.
Authors' comments: 8 pages, LaTeX, 4 LaTeX figures in separate uufile, UTPT-94-07
Taichiro Kugo, Mark G. Mitchard, Yuhsuke Yoshida
We develop the improved ladder approximation to QCD in order to apply it to
the heavy quark mesons. The resulting Bethe-Salpeter equation is expanded in
powers of the inverse heavy quark mass 1/M, and is shown to be consistent with
the heavy quark spin symmetry. We calculate numerically the universal leading
order BS amplitude for heavy pseudoscalar and vector mesons, and use this to
evaluate the Isgur-Wise function and the decay constant F_B. The resulting
Isgur-Wise function predicts a large charge radius, rho^2 = 1.8 - 2.0, which
when fitted to the ARGUS data corresponds to the value Vcb = .044 - .050 for
the Kobayashi-Maskawa matrix element.
Authors' comments: 26 pages, Plain TeX, 1 epsf and 6 PostScript files are included, KUNS
1234
Claude W. Bernard, Yue Shen, Amarjit Soni
We review our method and numerical results for calculation of the Isgur-Wise
function on the lattice. We present a discussion of the systematic errors.
Using recent experimental results, we find $V_{cb} = 0.044\pm 0.005\pm .007$.
Contribution to Lattice '93 proceedings. Needs espcrc2.sty file (included after
\end{document}). Search Figure1.ps for postscript files.
Authors' comments: 3 pages, 1 postscript figure attached. Preprint BUHEP-93-27, Wash. U.
HEP/93-36
Laurent Lellouch, [UKQCD Collaboration]
We calculate the Isgur-Wise function by measuring the elastic scattering
amplitude of a $D$ meson in the quenched approximation on a $24^3\times48$
lattice at $\beta=6.2$, using an $O(a)$-improved fermion action. We use this
result, in conjunction with heavy-quark symmetry, to extract $|V_{cb}|$ from
the experimentally measured $\bar B\to D^*l\bar\nu\,$\ differential decay
width.
Authors' comments: 3 pages, uuencoded compressed postscript file, to appear in the
Proceedings of the International Europhysics Conference on High Energy
Physics, Marseille, July 22-28, 1993. Southampton Preprint 93/94-06
[the UKQCD Collaboration]
We calculate the Isgur-Wise function by measuring the elastic scattering
amplitude of a $D$ meson in the quenched approximation on a $24^3\times48$
lattice at $\beta=6.2$, using an $O(a)$-improved fermion action. Fitting the
resulting chirally-extrapolated Isgur-Wise function to Stech's
relativistic-oscillator parametrization, we obtain a slope parameter
$\rho^2=1.2+7-3. We then use this result, in conjunction with heavy-quark
symmetry, to extract $V_{cb}$\ from the experimentally measured $\bar B\to
D^*l\bar\nu\,$\ differential decay width. We find
$|V_{cb}|\sqrt{\tau_B/1.48{\mathrm ps}}= 0.038 +2-2 +8-3, where the first set
of errors is due to experimental uncertainties, while the second is due to the
uncertainty in our lattice determination of $\rho^2$.
Authors' comments: 11 postscript pages + 3 postscript figures, all in one uuencoded,
compressed, tar file. This is the published version (Phys. Rev. Lett. 72, pp.
462-465 (1994)). The lattice data were partially re-analyzed and our final
results for the slope and Vcb differ from those of our first submission. The
text was also trimmed slightly to fit page requirements
Adam F. Falk, Michael Luke, Mark B. Wise
We reconsider the recent derivation by de Rafael and Taron of bounds on the
slope of the Isgur-Wise function. We argue that one must be careful to include
cuts starting below the heavy meson pair production threshold, arising from
heavy quark-antiquark bound states, and that if such cuts are properly
accounted for then no constraints may be derived.
Authors' comments: 8 pages, uses harvmac, SLAC-PUB-5956, UCSD/PTH 92-35, CALT-68-1830
Jeffrey E. Mandula, Michael C. Ogilvie
We construct the Isgur-Wise limit of QCD in a form appropriate to lattice
gauge theory techniques. The formulation permits a calculation of heavy quark
processes even when the momentum transfers are much larger than the inverse
lattice spacing. Applications include semi-leptonic heavy quark decay and
scattering processes, including the computation of the nonperturbative part of
the Isgur-Wise universal function.
Authors' comments: Talk given at the 1992 International Lattice Gauge Theory Conference
("Lattice '92"), Amsterdam, 4 pages, in postscript
Tao Feng, Pengrui Han, Guanyu Lin, Ge Liu, Jiaxuan You
Large language models (LLMs) have transformed AI research thanks to their powerful internal capabilities and knowledge. However, existing LLMs still fail to effectively incorporate the massive external knowledge when interacting with the world. Although retrieval-augmented LLMs are proposed to mitigate the issue, they are still fundamentally constrained by the context length of LLMs, as they can only retrieve top-K raw data chunks from the external knowledge base which often consists of millions of data chunks. Here we propose Thought-Retriever, a novel model-agnostic algorithm that helps LLMs generate output conditioned on arbitrarily long external data, without being constrained by the context length or number of retrieved data chunks. Our key insight is to let an LLM fully leverage its intermediate responses generated when solving past user queries (thoughts), filtering meaningless and redundant thoughts, organizing them in thought memory, and retrieving the relevant thoughts when addressing new queries. This effectively equips LLM-based agents with a self-evolving long-term memory that grows more capable through continuous interaction. Besides algorithmic innovation, we further meticulously prepare a novel benchmark, AcademicEval, which requires an LLM to faithfully leverage ultra-long context to answer queries based on real-world academic papers. Extensive experiments on AcademicEval and two other public datasets validate that Thought-Retriever remarkably outperforms state-of-the-art baselines, achieving an average increase of at least 7.6% in F1 score and 16% in win rate across various tasks. More importantly, we further demonstrate two exciting findings: (1) Thought-Retriever can indeed help LLM self-evolve after solving more user queries; (2) Thought-Retriever learns to leverage deeper thoughts to answer more abstract user queries.
Kaustubh D. Dhole
Retrieval-Augmented Generation equips large language models with the
capability to retrieve external knowledge, thereby mitigating hallucinations by
incorporating information beyond the model's intrinsic abilities. However, most
prior works have focused on invoking retrieval deterministically, which makes
it unsuitable for tasks such as long-form question answering. Instead,
dynamically performing retrieval by invoking it only when the underlying LLM
lacks the required knowledge can be more efficient. In this context, we delve
deeper into the question, "To Retrieve or Not to Retrieve?" by exploring
multiple uncertainty detection methods. We evaluate these methods for the task
of long-form question answering, employing dynamic retrieval, and present our
comparisons. Our findings suggest that uncertainty detection metrics, such as
Degree Matrix Jaccard and Eccentricity, can reduce the number of retrieval
calls by almost half, with only a slight reduction in question-answering
accuracy.
Authors' comments: 1st workshop of "Quantify Uncertainty and Hallucination in Foundation
Models: The Next Frontier in Reliable AI" at ICLR 2025
Thong Nguyen, Andrew Yates
Generative retrieval is a promising new neural retrieval paradigm that aims
to optimize the retrieval pipeline by performing both indexing and retrieval
with a single transformer model. However, this new paradigm faces challenges
with updating the index and scaling to large collections. In this paper, we
analyze two prominent variants of generative retrieval and show that they can
be conceptually viewed as bi-encoders for dense retrieval. Specifically, we
analytically demonstrate that the generative retrieval process can be
decomposed into dot products between query and document vectors, similar to
dense retrieval. This analysis leads us to propose a new variant of generative
retrieval, called Tied-Atomic, which addresses the updating and scaling issues
by incorporating techniques from dense retrieval. In experiments on two
datasets, NQ320k and the full MSMARCO, we confirm that this approach does not
reduce retrieval effectiveness while enabling the model to scale to large
collections.
Authors' comments: GenIR@SIGIR2023
Saeid Bahmanpour, Jameson Cahill, Peter G. Casazza, John Jasper, Lindsey M. Woodland
Phase retrieval has become a very active area of research. We will classify when phase retrieval by Parseval frames passes to the Naimark complement and when phase retrieval by projections passes to the orthogonal complements. We introduce a new concept we call norm retrieval and show that this is what is necessary for passing phase retrieval to complements. This leads to a detailed study of norm retrieval and its relationship to phase retrieval. One fundamental result: a frame $\{\varphi_i\}_{i=1}^M$ yields phase retrieval if and only if $\{T\varphi_i\}_{i=1}^M$ yields norm retrieval for every invertible operator $T$.
Shengyao Zhuang, zhichao Xu, Ivano Lauriola
Transformer-based document cross-encoder rerankers are a central component of modern information retrieval systems. Despite their success, these models suffer from high computational costs due to processing long query-document sequences at inference time. A known approach to improve efficiency is token compression, which consists of aggregating groups of tokens together in the initial embedding layer, reducing the effective number of tokens, and making the computation faster. While token compression has proven to be successful for bi-encoder retrievers, we empirically observed that this approach may be ineffective for cross-encoder rerankers. In this paper, we propose Layer-wise Token Compression (LTC), which applies adaptive token pooling at intermediate transformer layers. Through extensive ablation studies on MS MARCO passage and document ranking tasks, we demonstrate that compression at middle layers preserves ranking quality while increasing inference QPS by up to 25% for passage ranking and up to 116% for document ranking. We also extend LTC to listwise LLM rerankers and show that the same approach can be easily applied to long-context listwise reranking, where the QPS improvements are even greater. More surprisingly, when applying rerankers trained on short passages to long-document ranking tasks, models trained with compression outperform their uncompressed counterparts, suggesting that compression may act as a beneficial regularizer that encourages length-invariant representations.
Authors' comments: SIGIR2026 short paper
Jiachen Ma, Jiawen Zhang, Xiangtian Li, Bo Zou, Chaochao Lu, Chao Yang
While Large Language Models (LLMs) demonstrate remarkable capabilities, they remain susceptible to sophisticated, multi-step jailbreak attacks that circumvent conventional surface-level safety alignment by exploiting the internal generation process. To address these vulnerabilities, we propose Reflector, a principled two-stage framework that internalizes self-reflection within the generation trajectory. Reflector first leverages teacher-guided generation to produce high-quality reflection data for supervised fine-tuning (SFT), establishing structured reflection patterns. It subsequently uses Reinforcement Learning (RL) with outcome-driven and reward-validity supervision to instill robust, autonomous self-reflection capabilities. Empirical results show that Reflector achieves Defense Success Rates (DSR) exceeding 90% against complex indirect attacks while generalizing robustly across diverse threat scenarios. Notably, the framework enhances both task-specific and general utility, yielding a 5.85% gain on GSM8K alongside improved performance on knowledge-intensive benchmarks. By internalizing trajectory-level safety, Reflector overcomes the fundamental limitations of surface alignment without significant computational overhead, offering an efficient and scalable solution for the development of safe and capable LLMs.
Authors' comments: ICML 2026