The Dataset

Open-access scholarly titles reformulated across three controlled styles

Original research titles are collected from open-access scholarly repositories and processed through a structured LLM-based reformulation pipeline to produce stylistically diverse variants.

Data Generation

Each original title is transformed into 30 reformulated variants:

โš™๏ธ

Technical Titles

10 variants โ€” formal, methodology-focused, domain-specific language targeting expert audiences.

๐Ÿ“–

Accessible Titles

10 variants โ€” plain-language rewrites designed for broad comprehension without prior domain knowledge.

๐Ÿ”ฅ

Catchy Titles

10 variants โ€” creative, hook-driven titles using analogies and engaging language to spark curiosity.

Data Availability

See Important Dates for the full release schedule.