I'm more than willing to forgive ChatGPT for wrong readings of Japanese names. E...

tkgally · on May 1, 2024

You're right about Japanese names in general, but current LLMs' mistakes can go far beyond the range of possible readings. I've seen errors on the level of 田中 (Tanaka) being rendered as Suzuki--in other words, one common name being replaced with another.

I'm sure this problem can be solved. The linked article suggests a promising approach--more and better Japanese data.

glandium · on May 1, 2024

Arbitrary example: 上川. It can be both うえかわ or かみかわ (or even other variants). Which one is it? Depends where the family is originally from, I guess.

It's not only people's name, it's also place names. Example: https://ja.m.wikipedia.org/wiki/%E5%85%AB%E5%B9%A1