winety

joined 1 year ago
[โ€“] winety@lemmy.zip 3 points 1 day ago (1 children)

I think that the use of em-dashes specifically is a result of either the preprocessing of the training data or postprocessing of the generated text. I doubt that the material the models are trained on (i.e. Reddit) contains more em-dashes that hyphens in the position of sentence breaks.

But it definitely gets the use of dash as sentence break from people writing like that. If you ask ChatGPT in another language, whose users don't generally use dashes, e.g. Slovak, it won't use then as much.

[โ€“] winety@lemmy.zip 22 points 1 day ago (6 children)

There are dozens of us that know how to type en- and em-dashes! Dozens I say!