Download semblance of order

8/16/2023

We show this is in part due to a subtlety in how shuffling is implemented in previous work – before rather than after subword segmentation. We probe these language models for word order information and investigate what position embeddings learned from shuffled text encode, showing that these models retain a notion of word order information. Somewhat counter-intuitively, some of these studies also report that position embeddings appear to be crucial for models’ good performance with shuffled text.

Abstract Recent studies have shown that language models pretrained and/or fine-tuned on randomly permuted sentences exhibit competitive performance on GLUE, putting into question the importance of word order information.

0 Comments

Download semblance of order

Leave a Reply.

Author

Archives

Categories