Weeki — Data & AI Solutions

Probabilistic Information Retrieval & LM-based IR

Probabilistic models of IR: BM25, language models, and relevance models for document retrieval.

Retrieval & RAG Theory4hAdvancedEnglish

Learning-to-Rank

Consistency and calibration of learning-to-rank: pairwise, listwise losses, and surrogate analysis.

Retrieval & RAG Theory4hAdvancedEnglish

Dense vs Sparse Retrieval

Theory of dense and sparse neural retrieval: representation, training, and fusion strategies.

Retrieval & RAG Theory4hAdvancedEnglish

Metric Learning & Approximate Nearest Neighbor

Theory of metric learning losses and ANN data structures for embedding-based retrieval.

Retrieval & RAG Theory4hAdvancedEnglish

RAG Error Decomposition & Performance Bounds

Analyze RAG system errors: retrieval failures, generation hallucinations, and end-to-end performance bounds.

Retrieval & RAG Theory4hAdvancedEnglish

Evaluation Theory in IR/NLP

Rigorous evaluation methodology: inter-annotator agreement, statistical testing, and replicability in IR/NLP.

Retrieval & RAG Theory4hAdvancedEnglish

Tokenization & Subword Models

Information-theoretic analysis of tokenization: BPE, Unigram, and their impact on downstream performance.

Retrieval & RAG Theory3hAdvancedEnglish

Fact Verification & Hallucination Testing

Methods for automated fact checking, hallucination detection, and faithfulness evaluation in LLMs.

Retrieval & RAG Theory4hAdvancedEnglish

Document Structure as Graphs

Model document structure—sections, tables, references—as graphs for enhanced understanding.

Retrieval & RAG Theory4hAdvancedEnglish

Provenance & Verifiable Retrieval

Track and verify the provenance of retrieved information for trustworthy RAG systems.

Retrieval & RAG Theory4hAdvancedEnglish

Cross-Lingual Retrieval & Alignment Theory

Theory of cross-lingual information retrieval: multilingual embeddings, alignment, and zero-shot transfer.

Retrieval & RAG Theory4hAdvancedEnglish

Knowledge Editing & Consistency Constraints

Edit knowledge in language models while maintaining consistency: ROME, MEND, and constraint propagation.

Retrieval & RAG Theory4hAdvancedEnglish

Retrieval, RAG & Document AI