Average relative improvement over Guided Dual on idea-level novelty.
Sparking Scientific Creativity via
LLM-Driven Interdisciplinary Inspiration
Overview
Most AI-for-science systems optimize for fast solution generation. This paper argues that the more important missing layer is the reasoning process behind creative, interdisciplinary ideation. Idea-Catalyst is built for that earlier stage: helping researchers identify where a field is stuck, where adjacent ideas might transfer, and which cross-domain signals are actually worth exploring.
Average relative improvement over Guided Dual on takeaway-level insightfulness.
Dataset coverage spanning a broad set of target-domain research goals.
Extracted cross-domain takeaways used to seed new directions.
Recontextualized idea candidates produced across the benchmark.
How Idea-Catalyst Works
The framework is designed around a simple principle: interdisciplinary inspiration should be earned through structured analysis, not guessed through unconstrained retrieval. It operationalizes three metacognitive behaviors from the paper: clarifying research goals, recognizing unresolved challenges, and strategically exploring external domains with high impact potential.
Analyze the target field
Start with a concrete research goal, then unpack it into target-domain questions, current wins, and unresolved challenges.
Rewrite the challenge
Translate a domain-specific obstacle into a portable conceptual challenge that other disciplines might already know how to solve.
Rank cross-domain inspiration
Retrieve candidate domains, extract the most relevant takeaways, and rank them by interdisciplinary potential before integrating them back.
Hover a stage above to see what the system is producing, filtering, and carrying forward.
Figure 2: Idea-Catalyst pipeline
Results
Across both takeaway-level and idea-level evaluations, the method consistently beats retrieval-only baselines on the dimensions that matter for exploratory research: insightfulness and novelty. More diverse retrieval alone is not enough. What matters is selecting source domains that offer meaningful conceptual leverage for the target problem.
Highest insightfulness at every top-k
Idea-Catalyst reaches the strongest takeaway-level insightfulness scores across top-1, top-2, and top-3 comparisons, averaging a 16.22% relative improvement over Guided Dual.
Novelty improves without drifting off-task
On final integrated ideas, the paper reports a 21.38% average relative novelty gain over Guided Dual and substantially stronger performance than free-form source retrieval.
Decomposition and ranking both matter
Removing target-domain decomposition or replacing interdisciplinary potential-based ranking weakens performance, suggesting the gains come from structured reasoning rather than cosmetic rewriting.
Figure 3: Without guidance, ideation stays close to home
Figure 4: The right source domain depends on the challenge
Blindly choosing diverse domains is not enough. The strongest interdisciplinary ideas come from choosing the outside domains that best match the unresolved challenge.
Positive signal from real researchers
Six PhD researchers evaluated the system on their own problems. Generated research questions received a 4.00/5 average score, retrieved literature scored 3.50/5, source-domain takeaways were judged moderately relevant and insightful, and the final ideas were rated more novel than useful, reflecting the common gap between ideation and execution.
Appendix interface views used in the human study
Citation
If you find our paper to be useful, please cite it!
@misc{kargupta2026sparkingscientificcreativityllmdriven,
title = {Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration},
author = {Priyanka Kargupta and Shuhaib Mehri and Dilek Hakkani-Tur and Jiawei Han},
year = {2026},
eprint = {2603.12226},
archivePrefix = {arXiv},
primaryClass = {cs.CL},
url = {https://arxiv.org/abs/2603.12226}
}