Text Segmentation With Two-level Transformer And Auxiliary Coherence Modeling
- Author(s):
- Glavas, Goran; Somasundaran, Swapna
- Patent Issued:
- Sep 05, 2023
- Patent Number:
- 11,748,571
- Source:
- ETS Patent
- Document Type:
- Patent
- Family ID:
- 87882534
- Subject/Key Words:
- Patent, Active Patent, Machine Learning, Text Processing, Text Coherence
Abstract
Data is received that encapsulates a document of text. The text is then segmented into a plurality of semantically coherent units using a coherence-aware text segmentation (CATS) machine learning model. Data is then provided that characterizes the segmenting. Related apparatus, systems, techniques and articles are also described.