N. G. Stefanis, A. I. Karanikas, C. N. Ktorides
An effective field theoretic description of the Isgur-Wise function in
heavy-meson transitions is presented which emulates soft interactions by a
fermion worldline with an infinitesimal self-intersecting loop. A
point-splitting regularization technique is used, which replaces pointlike
worldlines by ``ribbons'' in the sense of Witten. The calculated vertex
function is correctly normalized, does not depend on the heavy-quark mass, and
complies with different sets of recent experimental data of the ARGUS and CLEO
collaborations.
Authors' comments: LaTeX, using Worldstyle and comprising three eps files (5 pages in
total). Invited talk presented by the first author at the International
Workshop Quark Confinement and the Hadron Spectrum II, June 26-29, 1996,
Villa Olmo, Como, Italy. To appear in the Proceedings (World Scientific)
David E. Brahm, James Walden
We investigate the relations that must hold among baryonic Isgur-Wise
functions $\eta_i$ in the large-N_c limit from unitarity constraints, and
compare to those found by Chow using the Skyrme model [or SU(4)]. Given the
exponential dropoff of the $\eta_i$ away from threshold, unitarity requires
only that the usual normalization conditions hold at w=1, and that
$\eta=\eta_1$ near threshold. Our results are consistent with, but less
powerful than, the Skyrme model relations.
Authors' comments: 6 pages, LaTeX, 3 figures using epsf.tex. More detailed analysis, but
the result is unchanged
B. Holdom, M. Sutherland
The original de Rafael-Taron bound on the slope of the Isgur-Wise function at
zero recoil is known to be violated in QCD by singularities appearing in an
unphysical region. To be consistent, quark models must have corresponding
singularity structures. In an existing relativistic quark-loop model, the
meson-quark-antiquark vertex is such that the required singularity is an
anomalous threshold. We also discuss the implications of another anomalous
threshold, whose location is determined by quark masses alone.
Authors' comments: 8 pages, LaTeX, 4 LaTeX figures in separate uufile, UTPT-94-07
Taichiro Kugo, Mark G. Mitchard, Yuhsuke Yoshida
We develop the improved ladder approximation to QCD in order to apply it to
the heavy quark mesons. The resulting Bethe-Salpeter equation is expanded in
powers of the inverse heavy quark mass 1/M, and is shown to be consistent with
the heavy quark spin symmetry. We calculate numerically the universal leading
order BS amplitude for heavy pseudoscalar and vector mesons, and use this to
evaluate the Isgur-Wise function and the decay constant F_B. The resulting
Isgur-Wise function predicts a large charge radius, rho^2 = 1.8 - 2.0, which
when fitted to the ARGUS data corresponds to the value Vcb = .044 - .050 for
the Kobayashi-Maskawa matrix element.
Authors' comments: 26 pages, Plain TeX, 1 epsf and 6 PostScript files are included, KUNS
1234
Claude W. Bernard, Yue Shen, Amarjit Soni
We review our method and numerical results for calculation of the Isgur-Wise
function on the lattice. We present a discussion of the systematic errors.
Using recent experimental results, we find $V_{cb} = 0.044\pm 0.005\pm .007$.
Contribution to Lattice '93 proceedings. Needs espcrc2.sty file (included after
\end{document}). Search Figure1.ps for postscript files.
Authors' comments: 3 pages, 1 postscript figure attached. Preprint BUHEP-93-27, Wash. U.
HEP/93-36
Laurent Lellouch, [UKQCD Collaboration]
We calculate the Isgur-Wise function by measuring the elastic scattering
amplitude of a $D$ meson in the quenched approximation on a $24^3\times48$
lattice at $\beta=6.2$, using an $O(a)$-improved fermion action. We use this
result, in conjunction with heavy-quark symmetry, to extract $|V_{cb}|$ from
the experimentally measured $\bar B\to D^*l\bar\nu\,$\ differential decay
width.
Authors' comments: 3 pages, uuencoded compressed postscript file, to appear in the
Proceedings of the International Europhysics Conference on High Energy
Physics, Marseille, July 22-28, 1993. Southampton Preprint 93/94-06
[the UKQCD Collaboration]
We calculate the Isgur-Wise function by measuring the elastic scattering
amplitude of a $D$ meson in the quenched approximation on a $24^3\times48$
lattice at $\beta=6.2$, using an $O(a)$-improved fermion action. Fitting the
resulting chirally-extrapolated Isgur-Wise function to Stech's
relativistic-oscillator parametrization, we obtain a slope parameter
$\rho^2=1.2+7-3. We then use this result, in conjunction with heavy-quark
symmetry, to extract $V_{cb}$\ from the experimentally measured $\bar B\to
D^*l\bar\nu\,$\ differential decay width. We find
$|V_{cb}|\sqrt{\tau_B/1.48{\mathrm ps}}= 0.038 +2-2 +8-3, where the first set
of errors is due to experimental uncertainties, while the second is due to the
uncertainty in our lattice determination of $\rho^2$.
Authors' comments: 11 postscript pages + 3 postscript figures, all in one uuencoded,
compressed, tar file. This is the published version (Phys. Rev. Lett. 72, pp.
462-465 (1994)). The lattice data were partially re-analyzed and our final
results for the slope and Vcb differ from those of our first submission. The
text was also trimmed slightly to fit page requirements
Adam F. Falk, Michael Luke, Mark B. Wise
We reconsider the recent derivation by de Rafael and Taron of bounds on the
slope of the Isgur-Wise function. We argue that one must be careful to include
cuts starting below the heavy meson pair production threshold, arising from
heavy quark-antiquark bound states, and that if such cuts are properly
accounted for then no constraints may be derived.
Authors' comments: 8 pages, uses harvmac, SLAC-PUB-5956, UCSD/PTH 92-35, CALT-68-1830
Jeffrey E. Mandula, Michael C. Ogilvie
We construct the Isgur-Wise limit of QCD in a form appropriate to lattice
gauge theory techniques. The formulation permits a calculation of heavy quark
processes even when the momentum transfers are much larger than the inverse
lattice spacing. Applications include semi-leptonic heavy quark decay and
scattering processes, including the computation of the nonperturbative part of
the Isgur-Wise universal function.
Authors' comments: Talk given at the 1992 International Lattice Gauge Theory Conference
("Lattice '92"), Amsterdam, 4 pages, in postscript
Kaustubh D. Dhole
Retrieval-Augmented Generation equips large language models with the
capability to retrieve external knowledge, thereby mitigating hallucinations by
incorporating information beyond the model's intrinsic abilities. However, most
prior works have focused on invoking retrieval deterministically, which makes
it unsuitable for tasks such as long-form question answering. Instead,
dynamically performing retrieval by invoking it only when the underlying LLM
lacks the required knowledge can be more efficient. In this context, we delve
deeper into the question, "To Retrieve or Not to Retrieve?" by exploring
multiple uncertainty detection methods. We evaluate these methods for the task
of long-form question answering, employing dynamic retrieval, and present our
comparisons. Our findings suggest that uncertainty detection metrics, such as
Degree Matrix Jaccard and Eccentricity, can reduce the number of retrieval
calls by almost half, with only a slight reduction in question-answering
accuracy.
Authors' comments: 1st workshop of "Quantify Uncertainty and Hallucination in Foundation
Models: The Next Frontier in Reliable AI" at ICLR 2025
Thong Nguyen, Andrew Yates
Generative retrieval is a promising new neural retrieval paradigm that aims
to optimize the retrieval pipeline by performing both indexing and retrieval
with a single transformer model. However, this new paradigm faces challenges
with updating the index and scaling to large collections. In this paper, we
analyze two prominent variants of generative retrieval and show that they can
be conceptually viewed as bi-encoders for dense retrieval. Specifically, we
analytically demonstrate that the generative retrieval process can be
decomposed into dot products between query and document vectors, similar to
dense retrieval. This analysis leads us to propose a new variant of generative
retrieval, called Tied-Atomic, which addresses the updating and scaling issues
by incorporating techniques from dense retrieval. In experiments on two
datasets, NQ320k and the full MSMARCO, we confirm that this approach does not
reduce retrieval effectiveness while enabling the model to scale to large
collections.
Authors' comments: GenIR@SIGIR2023
Saeid Bahmanpour, Jameson Cahill, Peter G. Casazza, John Jasper, Lindsey M. Woodland
Phase retrieval has become a very active area of research. We will classify when phase retrieval by Parseval frames passes to the Naimark complement and when phase retrieval by projections passes to the orthogonal complements. We introduce a new concept we call norm retrieval and show that this is what is necessary for passing phase retrieval to complements. This leads to a detailed study of norm retrieval and its relationship to phase retrieval. One fundamental result: a frame $\{\varphi_i\}_{i=1}^M$ yields phase retrieval if and only if $\{T\varphi_i\}_{i=1}^M$ yields norm retrieval for every invertible operator $T$.
Izat Temiraliev, Diji Yang, Yi Zhang
To achieve general-purpose utility, we argue that robots must evolve from passive executors into active Information Retrieval users. In strictly zero-shot settings where no prior demonstrations exist, robots face a critical information gap, such as the exact sequence required to assemble a complex furniture kit, that cannot be satisfied by internal parametric knowledge (common sense) or past internal memory. While recent robotic works attempt to use search before action, they primarily focus on retrieving past kinematic trajectories (analogous to searching internal memory) or text-based safety rules (searching for constraints). These approaches fail to address the core information need of active task construction: acquiring unseen procedural knowledge from external, unstructured documentation. In this paper, we define the paradigm as Retrieval-Augmented Robotics (RAR), empowering the robot with the information-seeking capability that bridges the gap between visual documentation and physical actuation. We formulate the task execution as an iterative Retrieve-Reason-Act loop: the robot or embodied agent actively retrieves relevant visual procedural manuals from an unstructured corpus, grounds the abstract 2D diagrams to 3D physical parts via cross-modal alignment, and synthesizes executable plans. We validate this paradigm on a challenging long-horizon assembly benchmark. Our experiments demonstrate that grounding robotic planning in retrieved visual documents significantly outperforms baselines relying on zero-shot reasoning or few-shot example retrieval. This work establishes the basis of RAR, extending the scope of Information Retrieval from answering user queries to driving embodied physical actions.
Jongho Kim, Jaeyoung Kim, Seung-won Hwang, Jihyuk Kim, Yu Jin Kim, Moontae Lee
We study leveraging adaptive retrieval to ensure sufficient "bridge" documents are retrieved for reasoning-intensive retrieval. Bridge documents are those that contribute to the reasoning process yet are not directly relevant to the initial query. While existing reasoning-based reranker pipelines attempt to surface these documents in ranking, they suffer from bounded recall. Naive solution with adaptive retrieval into these pipelines often leads to planning error propagation. To address this, we propose REPAIR, a framework that bridges this gap by repurposing reasoning plans as dense feedback signals for adaptive retrieval. Our key distinction is enabling mid-course correction during reranking through selective adaptive retrieval, retrieving documents that support the pivotal plan. Experimental results on reasoning-intensive retrieval and complex QA tasks demonstrate that our method outperforms existing baselines by 5.6%pt.
Yash Saxena, Manas Gaur
Retrieval Augmented Generation (RAG) has made significant strides in overcoming key limitations of large language models, such as hallucination, lack of contextual grounding, and issues with transparency. However, traditional RAG systems consist of three interconnected neural components - the retriever, re-ranker, and generator - whose internal reasoning processes remain opaque. This lack of transparency complicates interpretability, hinders debugging efforts, and erodes trust, especially in high-stakes domains where clear decision-making is essential. To address these challenges, we introduce the concept of Neurosymbolic RAG, which integrates symbolic reasoning using a knowledge graph with neural retrieval techniques. This new framework aims to answer two primary questions: (a) Can retrievers provide a clear and interpretable basis for document selection? (b) Can symbolic knowledge enhance the clarity of the retrieval process? We propose three methods to improve this integration. First is MAR (Knowledge Modulation Aligned Retrieval) that employs modulation networks to refine query embeddings using interpretable symbolic features, thereby making document matching more explicit. Second, KG-Path RAG enhances queries by traversing knowledge graphs to improve overall retrieval quality and interpretability. Lastly, Process Knowledge-infused RAG utilizes domain-specific tools to reorder retrieved content based on validated workflows. Preliminary results from mental health risk assessment tasks indicate that this neurosymbolic approach enhances both transparency and overall performance
Authors' comments: 8 pages, 2 Figures, To Appear in IEEE Intelligent Systems
Pranav Jadhav
Patient cohort retrieval is a pivotal task in medical research and clinical practice, enabling the identification of specific patient groups from extensive electronic health records (EHRs). In this work, we address the challenge of cohort retrieval in the echocardiography domain by applying Dense Passage Retrieval (DPR), a prominent methodology in semantic search. We propose a systematic approach to transform an echocardiographic EHR dataset of unstructured nature into a Query-Passage dataset, framing the problem as a Cohort Retrieval task. Additionally, we design and implement evaluation metrics inspired by real-world clinical scenarios to rigorously test the models across diverse retrieval tasks. Furthermore, we present a custom-trained DPR embedding model that demonstrates superior performance compared to traditional and off-the-shelf SOTA methods.To our knowledge, this is the first work to apply DPR for patient cohort retrieval in the echocardiography domain, establishing a framework that can be adapted to other medical domains.
Manthankumar Solanki
Textual data question answering has gained significant attention due to its
growing applicability. Recently, a novel approach leveraging the
Retrieval-Augmented Generation (RAG) method was introduced, utilizing the
Prize-Collecting Steiner Tree (PCST) optimization for sub-graph construction.
However, this method focused solely on node attributes, leading to incomplete
contextual understanding. In this paper, we propose an enhanced approach that
replaces the PCST method with an attention-based sub-graph construction
technique, enabling more efficient and context-aware retrieval. Additionally,
we encode both node and edge attributes, leading to richer graph
representations. Our method also incorporates an improved projection layer and
multi-head attention pooling for better alignment with Large Language Models
(LLMs). Experimental evaluations on the WebQSP dataset demonstrate that our
approach is competitive and achieves marginally better results compared to the
original method, underscoring its potential for more accurate question
answering.
Authors' comments: Extended version of a paper presented at NeurIPS 2024
(arXiv:2402.07630)
Qinyuan Cheng, Xiaonan Li, Shimin Li, Qin Zhu, Zhangyue Yin, Yunfan Shao, Linyang Li, Tianxiang Sun et al.
In Retrieval-Augmented Generation (RAG), retrieval is not always helpful and
applying it to every instruction is sub-optimal. Therefore, determining whether
to retrieve is crucial for RAG, which is usually referred to as Active
Retrieval. However, existing active retrieval methods face two challenges: 1.
They usually rely on a single criterion, which struggles with handling various
types of instructions. 2. They depend on specialized and highly differentiated
procedures, and thus combining them makes the RAG system more complicated and
leads to higher response latency. To address these challenges, we propose
Unified Active Retrieval (UAR). UAR contains four orthogonal criteria and casts
them into plug-and-play classification tasks, which achieves multifaceted
retrieval timing judgements with negligible extra inference cost. We further
introduce the Unified Active Retrieval Criteria (UAR-Criteria), designed to
process diverse active retrieval scenarios through a standardized procedure.
Experiments on four representative types of user instructions show that UAR
significantly outperforms existing work on the retrieval timing judgement and
the performance of downstream tasks, which shows the effectiveness of UAR and
its helpfulness to downstream tasks.
Authors' comments: Accepted to Findings of EMNLP 2024, camera-ready version
Alireza Salemi, Hamed Zamani
Evaluating retrieval-augmented generation (RAG) presents challenges, particularly for retrieval models within these systems. Traditional end-to-end evaluation methods are computationally expensive. Furthermore, evaluation of the retrieval model's performance based on query-document relevance labels shows a small correlation with the RAG system's downstream performance. We propose a novel evaluation approach, eRAG, where each document in the retrieval list is individually utilized by the large language model within the RAG system. The output generated for each document is then evaluated based on the downstream task ground truth labels. In this manner, the downstream performance for each document serves as its relevance label. We employ various downstream task metrics to obtain document-level annotations and aggregate them using set-based or ranking metrics. Extensive experiments on a wide range of datasets demonstrate that eRAG achieves a higher correlation with downstream RAG performance compared to baseline methods, with improvements in Kendall's $\tau$ correlation ranging from 0.168 to 0.494. Additionally, eRAG offers significant computational advantages, improving runtime and consuming up to 50 times less GPU memory than end-to-end evaluation.
Shiguang Wu, Wenda Wei, Mengqi Zhang, Zhumin Chen, Jun Ma, Zhaochun Ren, Maarten de Rijke, Pengjie Ren
Generative retrieval generates identifiers of relevant documents in an
end-to-end manner using a sequence-to-sequence architecture for a given query.
The relation between generative retrieval and other retrieval methods,
especially those based on matching within dense retrieval models, is not yet
fully comprehended. Prior work has demonstrated that generative retrieval with
atomic identifiers is equivalent to single-vector dense retrieval. Accordingly,
generative retrieval exhibits behavior analogous to hierarchical search within
a tree index in dense retrieval when using hierarchical semantic identifiers.
However, prior work focuses solely on the retrieval stage without considering
the deep interactions within the decoder of generative retrieval.
In this paper, we fill this gap by demonstrating that generative retrieval
and multi-vector dense retrieval share the same framework for measuring the
relevance to a query of a document. Specifically, we examine the attention
layer and prediction head of generative retrieval, revealing that generative
retrieval can be understood as a special case of multi-vector dense retrieval.
Both methods compute relevance as a sum of products of query and document
vectors and an alignment matrix. We then explore how generative retrieval
applies this framework, employing distinct strategies for computing document
token vectors and the alignment matrix. We have conducted experiments to verify
our conclusions and show that both paradigms exhibit commonalities of term
matching in their alignment matrix.
Authors' comments: 12 pages, 5 figures, 8 tables, accepted at SIGIR 2024