Ok so there are still a few confusing things I found about the metadata rules and guidelines. I think these should be clarified to avoid confusion and pointless discussions.
First of all, it should be stated somewhere that the standardization rules such as using feat. instead of ft. apply to both the unicode and the romanized field. This might seem obvious for some people, but nevertheless it's better to explain it clearly for everyone.
It also contradicts with "Commas, vs., &, feat., CV:, etc. must include a trailing whitespace. If the marker is preceded by a word, a leading whitespace is also required, unless the marker is a comma." because what if there is a parenthesis followed by a comma? For example: Character1 (CV: Person1), Character2 (CV: Person2) Is a space required between them or not and how does this work in the case of a unicode comma 、?
Also, can we finally standardize the (Short Ver.) marker as well lol
First of all, it should be stated somewhere that the standardization rules such as using feat. instead of ft. apply to both the unicode and the romanized field. This might seem obvious for some people, but nevertheless it's better to explain it clearly for everyone.
Special unicode characters must be filtered to their nearest standard equivalent or removed from the Romanised fields within a .osu file. ★ ☆ ⚝ ✩ ✪ ✫ ✬ ✭ 🟉 🟊 ✮ ✯ ✰ and the likes are substituted to an asterisk. Other special characters are to be romanised or dropped on case-by-case basis.It wouldn't hurt to add other common symbols that can be romanized instead of only listing a dozen of stars, for example when an interpunct ・ is used to divide 2 words or groups (not first name and surname since in that case it would be omitted).
If a mapset track is composed of two or more songs, list the song titles clearly with a dividing symbol inbetween or use a title descriptive of its contents. If the title becomes too long as a result, a descriptive title must be used instead.I think it would make more sense to replace "dividing symbol" by "slash" because that's what most maps use. It should also be stated that a slash needs a leading and trailing whitespace in this case for better readability whereas usually it wouldn't be necessary to add spaces.
If a symbol is used to group parts of a title, a whitespace must be used before and after the group, but not directly before or after the symbols within the groups.This rule somehow contradicts with the definition of a whitespace in the glossary (A visual spacing between characters, not always a literal space. Full-width characters do not require whitespaces.) because the definition says full-width characters don't need a whitespace while the rule says it's necessary. This depends on the context and seems quite misleading, especially for full-width symbols such as brackets, slashes etc.
It also contradicts with "Commas, vs., &, feat., CV:, etc. must include a trailing whitespace. If the marker is preceded by a word, a leading whitespace is also required, unless the marker is a comma." because what if there is a parenthesis followed by a comma? For example: Character1 (CV: Person1), Character2 (CV: Person2) Is a space required between them or not and how does this work in the case of a unicode comma 、?
Single symbols should be romanised so that they have leading and trailing whitespaces, unless the symbol itself is not commonly requiring whitespaces in English. This may be ignored if the artist purposefully uses special characters that ignore their common usages.It would be nice to clarify which symbols don't require whitespaces in English. Stuff like periods and commas are clear, but what about *, ~, +, -, / etc. ?
Also, can we finally standardize the (Short Ver.) marker as well lol