Research
Selected publications
Work published in venues including COLING, LREC-COLING, EMNLP Findings, ACM ICMR, PeerJ Computer Science, and RANLP, with recurring themes around text segmentation, long-context LLM behaviour, and multimodal analysis.
Efficient Solutions For An Intriguing Failure of LLMs
COLING 2025 · Hosseini, Castro, Ghinassi & Purver. Long context windows do not automatically guarantee flawless analysis of long sequences. pp. 1880–1891.
Recent Trends in Linear Text Segmentation: A Survey
Findings of EMNLP 2024 · Ghinassi, Wang, Newell & Purver. A comprehensive survey of recent developments in linear text segmentation. pp. 3084–3095.
When Cohesion Lies in the Embedding Space
LREC-COLING 2024 · Ghinassi, Wang, Newell & Purver. Embedding-based reference-free metrics for topic segmentation. pp. 17525–17536.
MoralBERT: A Fine-Tuned Language Model for Capturing Moral Values in Social Discussions
GoodIT 2024 · Preniqi, Ghinassi, Ive, Saitis & Kalimeri. pp. 433–442.
Automatic Detection of Moral Values in Music Lyrics
ISMIR 2024 · Preniqi, Ghinassi, Ive, Kalimeri & Saitis. pp. 164–172.
Language Pivoting from Parallel Corpora for Word Sense Disambiguation of Historical Languages
LREC-COLING 2024 · Ghinassi, Tedeschi, Marongiu, Navigli & McGillivray. Case study on Latin. pp. 10073–10084.
Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models
NLP4PI @ EMNLP 2024 · Ghinassi, Catalano & Colella. pp. 123–139.
Multimodal Topic Segmentation of Podcast Shows with Pre-trained Neural Encoders
ACM ICMR 2023 · Ghinassi, Wang, Newell & Purver. pp. 602–606.
Lessons Learnt from Linear Text Segmentation
RANLP 2023 · Ghinassi, Wang, Newell & Purver. A fair comparison of architectural and sentence encoding strategies. pp. 408–418.
Comparing Neural Sentence Encoders for Topic Segmentation Across Domains
PeerJ Computer Science 2023 · Ghinassi, Wang, Newell & Purver. Not your typical text similarity task.
Exploring Pre-Trained Neural Audio Representations for Audio Topic Segmentation
IEEE ICME · Ghinassi, Purver, Phan & Newell.
Unsupervised Text Segmentation via Deep Sentence Encoders
DataTV @ ACM IMX 2021 · Ghinassi. A first step towards a common framework for text-based segmentation, summarization and indexing of media content.