David Argüello Sánchez

Search Engineer at Empathy (EmpathyBroker). Author of influential analysis on TF-IDF failure modes in e-commerce product search.

Key Contributions

  • TF-IDF in E-commerce: Showed with concrete examples (iPad, Polo) how TF (term frequency) and IDF (inverse document frequency) produce counterintuitive rankings in product catalogs. Standard IR assumptions (repetition = relevance, rarity = importance) break for short structured product titles. Advocated disabling TF-IDF via boolean similarity for cleaner, more tunable relevance.