I don't see any restrictions for this right now as long as they are related to the set. Also don't think that this worth specific mentioning.abraker wrote:
Any thoughts about mapping style or patterns the maps have being in tags?
I don't see any restrictions for this right now as long as they are related to the set. Also don't think that this worth specific mentioning.abraker wrote:
Any thoughts about mapping style or patterns the maps have being in tags?
To your previous post, if we discuss language, we will use terms related to languages and linguistics. I can't avoid that.Mishima Yurara wrote:
it makes absolutely no sense that the chinese do not have the higher priority when talking about chinese.. it is Quite Literally the language that they speak AND they are also... Quite Literally... the most Affected by the proposal regarding chinese metadata.. not sure how the priority of a group of people is parallel to a nonreasonable discussion either..
Where's the basis for that? It does make it harder for the reasons mentioned already. In both the proposal and several of these posts. How do Chinese people read it easier, if the text doesn't change for them at all?Mishima Yurara wrote:
it really doesnt make it any harder to read the title/artist with separated syllables so i dont see the harm in staying consistent with other platforms alongside making it easier for chinese people to..... read their own language L
Well I can understand the logic with this however I feel we should stick to the official metadata because, well, it's the official metadata. Seeing if a map is the Tv size/short version shouldn't be to hard for the majority of people. Since things like ~Anime-Ban~ , TV edit. , (short ver.) should make it pretty obvious its the short version of the song. I don't think we should add things like (TV Size) to songs from shows that don't officially differentiate the short version from the full version as it's not official.Shiguma wrote:
I believe that TV size cuts of songs should have the (TV Size) label on them, regardless of the official source. My reasoning for this is, when you search up a song, having the (TV Size) in the metadata won't affect searching for that song, while also making it very clear that whichever set you are looking at is the short version of a song. If we're bringing common sense into metadata, I don't see why we shouldn't do this.
Thirdly, to address the problems of grouping together romanized Chinese syllables into words. It is true that in grouping together syllables there is a lot of ambiguity, but much of that ambiguity should be able to be cleared in context. For instance, taking this charming example provided to us:Wafu wrote:
2. As for the memory point, again, you are considering this point from the Chinese speaker perspective. That's not the target group. As above, it's about how Latin script works with words. As you probably know, when people who use Latin script read longer words, they generally don't read them, they just recognize it by the shape of the word. Because of that, they will also miss minor spelling errors, because they read the originally intended word by the shape. That suggests (which is a fact by the way) that they memorize text (that is seemingly a word) much easier than syllables. As an example, you probably have the shape of "Romanisation" memorized pretty well. That means if I'd misspell it to "Ronamisation", you would quite likely not notice that. Whereas if I did "Ro Na Mi Sa Ti On", you would more likely notice the error, because you would read it syllable by syllable.
Context should be able to very easily clear up such ambiguities. What is the song about? What is the rest of the song saying? Context will provide an almost effortless resolution to such conclusions, which I imagine would comprise the vast majority of such instances.Hollow Wings wrote:
"Gu Niang, Shui Jiao Yi Wan Duo Shao Qian?"
this sentence mainly has two meanings:
1. "Hey gril, how much it costs if i buy a bowl of your dumplings?" (姑娘,水饺一碗多少钱?)
2. "Hey gril, how much it costs if i sleep you one night?" (姑娘,睡觉一晚多少钱?)
These will most likely make up such a negligible percentage of these instances of ambiguity that to go through with the proposed changes and deal with these intentionally ambiguous titles as they come up would not be completely remiss -- but I personally believe that even these hypothetical cases, however rare, should be considered before pushing any changes. That is just my opinion, ultimately it's not up to me.Fycho wrote:
For example, specific examples like "他谁都打不过", it's used intentionally to represent two meanings that are "Nobody can beat him" and "He can beat everybody", "Ta / Shui / Dou Da Bu Guo" and "Ta / Shui Dou Da Bu Guo".
iirc, this was one of the options we were considering. It could be worth discussing this option if people want to follow the standards. It solves some of the brought up issues. I doesn't solve the "v" issue, which should be fixed regardless of keeping or changing the current system as it has no basis other than keyboard layout.Nyquill wrote:
Alternatively, we can also implement this document here: https://www.loc.gov/catdir/cpso/romaniz ... hinese.pdf
The reason I mention this is that some maps in 8k mania tend to be either 8k or 7k+1 and it's impossible to know until you download and check them out. Putting down "SV" or "stream" or etc would also allow the added benefit to search for types of maps in any gamemode. I feel like something should be mentioned in guidelines.Sieg wrote:
I don't see any restrictions for this right now as long as they are related to the set. Also don't think that this worth specific mentioning.abraker wrote:
Any thoughts about mapping style or patterns the maps have being in tags?
Context after here is not holding new idea so I delete them in my post. But anyways, I'm completely not convinced how changes on Mandarin/Chinese metadata would help it be more practical. On the contrary, they're ignoring the general case of Chinese and making things even worse.melloe wrote:
Thirdly, to address the problems of grouping together romanized Chinese syllables into words. It is true that in grouping together syllables there is a lot of ambiguity, but much of that ambiguity should be able to be cleared in context. For instance, taking this charming example provided to us:Context should be able to very easily clear up such ambiguities. What is the song about? What is the rest of the song saying? Context will provide an almost effortless resolution to such conclusions, which I imagine would comprise the vast majority of such instances.Hollow Wings wrote:
"Gu Niang, Shui Jiao Yi Wan Duo Shao Qian?"
this sentence mainly has two meanings:
1. "Hey gril, how much it costs if i buy a bowl of your dumplings?" (姑娘,水饺一碗多少钱?)
2. "Hey gril, how much it costs if i sleep you one night?" (姑娘,睡觉一晚多少钱?)
However, some of those ambiguities will be purposely rendered in the form of puns etc., such as here:These will most likely make up such a negligible percentage of these instances of ambiguity that to go through with the proposed changes and deal with these intentionally ambiguous titles as they come up would not be completely remiss -- but I personally believe that even these hypothetical cases, however rare, should be considered before pushing any changes. That is just my opinion, ultimately it's not up to me.Fycho wrote:
For example, specific examples like "他谁都打不过", it's used intentionally to represent two meanings that are "Nobody can beat him" and "He can beat everybody", "Ta / Shui / Dou Da Bu Guo" and "Ta / Shui Dou Da Bu Guo".
In fact for many contemporary Chinese ballads, their titles are deliberately came up as such (in the form of puns). As for the first example given here, the song title can still be sexually suggestive even if its formal title is about dumplings. Because Chinese lyrics are not as logical as daily language,
and people just can easily get the ambiguous meaning because there is no way to distinguish their pronunciation difference in a song-wise tone without logical context. "Context will provide an almost effortless resolution to such conclusions" as you said is often not the truth. Joint of Chinese characters into a single word will often cause loss of meaning in this way. (I have more examples, one of which is my uploaded map)
Fourthly, about "v" vs "u." To Chinese speakers of course "v" makes the most sense, as that is the input they use in their everyday lives, but to the western audience, "v" will make absolutely no sense. "u" and "yu" are both inadequate romanizations of "ü," because "yu" will be pronounced "yoo" by most westerners, but "v" will be next to useless for everybody except for Chinese players. "v" is more ambitious in that it serves to correctly represent a specific sound instead of simply approximating it, but for western osu players it is completely counterproductive.
I'm not sure if you go through the HW's post thoroughly but there was an example given to prove that the change from "v" to "u" will result a worse case under certain conditions: “绿光” & “露光” will be both romanised in "Lu Guang" while their actual pronunciation are completely different. For non-Chinese speakers, I don't think it can be a better way either to pronounce it or to remember the title by any means. Ofc I understand that "v" has no connection with the actual pronunciation of "ü". I was also confused when I first used a keyboard to type Chinese. However, this is just a general knowledge for all Chinese users and Chinese learners. That's how we Chinese grow up. So even we may understand that "v" can be senseless in pronunciation manner,
I don't get why non-Chinese speakers have the advantage to ignore such knowledge (which is common to us) at all. When you want to memorize a title in a different language, accepting its small piece of rule/regularity (actually it's really small) is not demanding is it? In fact for the pronunciation of Japanese romanised way of "ra" (similarly, ri, ru, re, ro), the actual pronunciation is far from /ra/, but somehow similar to be in the middle of /ra/ and /la/. Personally I'd even say it sounds much closer to /la/ in general. But when you have to memorize it, you simply accept its setting of being forced "ra". That's the same thing.
Lastly, Chinese is generally referred to as logographic rather then ideographic, as a character represents a morphheme rather than a more nebulous concept, and as ideogram usually refers specifically to a symbol that is independent of any corresponding sound--although of course no logographic writing system is without a phonetic component built into it. The terms themselves are rather fuzzy anyways, so to achieve anything of actual accuracy one has to resort to such ungainly terms as HW's "ideophonographical." However, to call Chinese logographic is not incorrect. In fact, most people, even linguists, do it.
I don't know how you call Chinese logographic so steadily so I just want TRUE evidence. And I don't even want to read Wafu's post again because he was simply doing this once and once again without compelling support. Anyways, the most intuitive thoughts of the language Chinese is still ideographically, based on how we accept Chinese education for more than 12 years. Many words that combined by two or more characters are also generated by the joint of meanings of those characters together. For example, “未来”(future) can be split as “未”(not happening) and “来”(come). And the easy joint would be "has not come yet", which is the close meaning of "future". And the word “银行”(bank) can be split as “银”(silver, which is the general currency in ancient China) and “行” (an organization/commercial firm focusing on specific fields, pronounced as Hang). And it's obvious that the joint of those two meanings an organization/commercial firm focusing on money, which is bank.
The third example would be my own map https://osu.ppy.sh/s/598869 “花儿纳吉” (The actual correct pronunciation should be Hua Er Na Zei, which is different from normal Mandarin pronunciation Hua Er Na Ji). This title has no direct meaning from Mandarin as it's from minority Chinese language (Qiang language). The official meaning is "Being happy like a flower". However, the song title still has its similar meaning to the combination of Mandarin in Chinese culture , which was also part of intention by the song author: “花儿” is flower, “纳” is containing/accepting, “吉” is happiness. If being wrongly considered as logographic, the song title would be less valuable, which is what we cannot accept. There are just thousands of more examples so I have to stop here.
As a result, I completely don't understand why you guys keep trying to call Chinese logographic by any means. It's highly COUNTER-INTUITIVE. And in fact the change of combining characters is highly impractical (as you wanted to state below the opposite way) in this way, because simply consider each character as pronunciation (as logographic indicates) will result in MEANING LOSS and CULTURE LOSS, which is definitely a wrong way to approach to Chinese language.To the crux of the issue.
The real dichotomy here is between practicality and officiality/aesthetics. That is a highly subjective discussion and is conducive to many (as seen here) tetchy discussions. Grouping words together will almost certainly make it more convenient for non-Chinese speakers, there should really be no question about this. I personally don't even pay attention to the name of a Chinese map if it's over three or four characters long; the profusion of capitals and spacing, to my English-speaking mind, is simply inconvenient, and I would rather memorize the mapper's name, the artist's name, and the background instead. Japanese titles, meanwhile, are multisyllabic, and I would rather have a few multisyllabic words than six monosyllabic words. How closely we adhere to "ISO 7098" really should not be a question. We're a small international circle-clicking community, not an official international organization, so shouldn't we rather consider things from a functional, practical perspective?
Sorry but I just think the way of changing is even more impractical for the reason stated above
no, you don't understand, using nonsense metadata will fuck up all Chinese players and all Chinese learner players. Wtf are you considering making a nonsense international standard and let all the players think wtf is the osu meta?CrystilonZ wrote:
We've expressed (thoroughly I believe) what problems the current system has. Please read all the previous points made in this discussion.Shad0w1and wrote:
So let's face the reality, there isn't a standard for Chinese romanization into ANSI code. I can't understand that without a commonly accepted standard, why would you guys try to change the current metadata rule?
We illustrate many reasons why each Chinese character has its own meaning and such meta would be ignored when words are joint together. And you're just repeating "same thing", "it's not non-sense", "you're ignoring what I am saying". It may be useful for one time but not for many because people don't see you're supporting ideas. You are just repeating yourself.Wafu wrote:
1. First of all, you did use the ISO document as your argument, but you didn't even know that the citation about "ideophonograph" language was just confirming what CrystilonZ posted. You agree with ISO on the same thing that you disagree with CrystilonZ on. They state the same thing.
Another repeat. First of all, please provide good evidence/support to "That makes the language logographic" or you're just repeating your own words. That would lose the ground where you try to stand on. Secondly, when you accept a majority part of the language has "pictographic and ideographic features", you don't accept the fact that when combining words together such feature will be lost and it can be highly detrimental to the language meta. I just don't get it.Wafu wrote:
Yes, I agree with that point. Some Chinese characters indeed do use "pictographic and ideographic features". You even quoted me saying that. That doesn't make the language pictographic or ideographic, because even the characters with pictographic or ideographic features are logograms. That makes the language logographic. Why do you call something non-sense and then say the same thing?
The new system you are trying to bring out has huge difference than Romanisation we've had, as HW, Fycho and I illustrated lots of evidence and facts ("五环境内","他谁都打不过","花儿纳吉")So seeing how people work in the past doesn't mean you're qualified enough to judge romanisation of Chinese correctly. In fact I don't think any automatic system could handle such romanisation correctly. In these cases native Chinese speakers still have louder voice.Wafu wrote:
7. If you have problem with me comparing how osu! works for 2 different Romanisation, I think there's a different problem. Stop calling me ignorant if you ignore what I've even written in that paragraph. It's also not non-sense. I literally just say how people work. How can that be non-sense? That is an observation.
I've already explained in my previous post that using "u" will cause terrible ambiguity. You insist on pronunciation of "v" is different from what we want, while you ignore the fact that ALL OTHER alphabets have their own pronunciation function in Chinese pinyin. The "v" is the most convenient one when you have to find a new thing corresponding to "ü". And again, it's just a language setting. That's how it's used for decades. You non-Chinese speakers should not have the advantage to ignore that. After all it's not demanding for people to remember such a small thing if they want to approach to Chinese. Pronunciation of "v" should not become the barrier of knowing any of Chinese, or they're determined to fail to learn it anyways.Wafu wrote:
2. No, "v" doesn't work the best, it doesn't work at all because it has no linguistic basis. That doesn't mean "u" is the best, although we agreed that it generally won't make difference for a regular player, there are still many options that can be considered, but it can't be "v", and probably not "y", because that's associated with a different sound (even in other Romanisation systems we use). "u" pronounced in a certain way will result in the "ü" we are going for, it really is the core sound of it, I described this in the post 2 times already, so I guess I don't have to repeat myself.
Answer:Wafu wrote:
Third point, not sure why you are personally attacking me. How do you know what my education is, what my job is, what my real life is? You don't know any single thing about my personal life, so don't act like you do.
Wafu wrote:
but remember it works vice-versa.
Last but not least, insulting others in private message doesn't mean that you were polite at all. It only means you pretend to be polite but failed and you wanted to hide the fact that you were not. So please learn to stay calm and polite consistently.Wafu wrote:
Your false accusations (of us not reading stuff or not being professional) did, indeed, make me send you this message (and it is called exactly that: "Private message"). I'm not making fun of you as it was not public, you making it public doesn't mean I'm making fun of you. I wanted you to know that putting this down to "there's no research" was unfair of you, as you didn't invest your time into the research either. Was I being rude to you in the private message? Yes, as as you were when you clearly did, intentionally ridicule the proposal, except I at least could keep it private.
My question for Thai romanization is whether we should treat them similarly to how we are treating Chinese romanization which is to separate words with spaces, or if we should clump them together, for example: พระนางเจาพระบรมราชินีนาถ romanized as: Phranāng Čhao Phrabǭrommarāchinī Nāt or Phranāngchaophrabǭrommarāchinīnāt. Also, should we make all the separated words uppercase, or only the first? Since Thai chains everything together, there is no indicator for upper and lower case when we split the phrase up (if we do).Library of Congress wrote:
Romanization
1. Tonal marks are not romanized.
2. The symbol ฯ indicates omission and is shown in romanization by “ … ” the conventional sign for
ellipsis.
3. When the repeat symbol ๆ is used, the syllable is repeated in romanization.
4. The symbol ฯลฯ is romanized Ia.
5. Thai consonants are sometimes purely consonantal and sometimes followed by an inherent vowel
romanized o, a, or ǭ depending on the pronunciation as determined from an authoritative
dictionary, such as the Royal Institute's latest edition (1999).
6. Silent consonants, with their accompanying vowels, if any, are not romanized.
7. When the pronunciation requires one consonant to serve a double function – at the end of
one syllable and the beginning of the next – it is romanized twice according to the
respective values.
8. The numerals are: ๐ (0), ๑ (1), ๒ (2), ๓ (3), ๔ (4), ๕ (5), ๖ (6), ๗ (7), ๘ (8), and ๙ (9).
9. In Thai, words are not written separately. In romanization, however, text is divided into words
according to the guidelines provided in Word Division below.
i'm already pissed off since idea of "Wafu: 'You, as Chinese have no priority in this matter, just because it's about Chinese,' " came up.CrystilonZ wrote:
To be honest I'm very pissed off right now and you have no idea how hard it is for me to post in this calm manner.
You, as Chinese have no priority in this matter, just because it's about Chinese. I believe all people here are civilised people and civilised people argue with reason.Read more about this here <Argument from authority>
May I ask why you chose the McCune Reischauer system in particular? Some quick research reveals it is currently only used officially in North Korea, with "Revised Romanization of Korean" being the current official system used for South Korea. As uhhh.... 99% of korean songs , especially those mapped in osu, come from South Korea, it would make far more sense to use that instead, overall. This may need more input from those who are more familiar with Korean.Monstrata wrote:
Songs with Korean metadata must be romanised using the McCune-Reischauer system for romanizing Korean when there is no romanisation or translation information listed by a reputable source.
could you please cite said documents that you mention throughout your post by linking them in case anyone wants to review it for themselves.Monstrata wrote:
Another language to examine is Thai. The Library of Congress recommends nine additional rules for Thai romanization which are:
No, roman characters are written from left to right. It would obscure the meaning and reading to write them in reverse order. Japanese can also be written from right to left, albeit in vertical lines instead of horizontal, do we romanise it backwards? no.Monstrata wrote:
Another problem with Arabic is that it is typed in reverse, right to left. Should we also apply this to romanization?
It's not the same. R and L are pronounced almost the same way across most phonetics. V and u are way different since V is a consonant.Fycho wrote:
If saying "v" couldn't be readed by foreigners and makes misconception, then we probably need to rework the Japanese rule as ra / ri / ru / re / ro are actually pronounced as la / li / lu / le / lo in Japanese, which is kinda unfriendly towards those latin scripts users who don't know Japanese. English speakers will pronounce "ra" differently from how it's supposed to be pronounced in Japanese.
As all of us known, Modified Hepburn(Japan gov uses Kunrei) and Pinyin(China gov uses Pinyin) system are the international standard systems, people who learn Chinese will start as pinyin, and when they start learning input lately, they will know "v", and "v" is the most familiar and well-known letter for Chinese speakers and leaners. "u" messes up with the vowel "u", and "yu" would be pronounced as "yoo" or other wrong pronunciation by most English speakers. Both are not inadequate for representing "ü". If anyone has better choice than "v", feel free to advise. Otherwise we would keep the "v" for "ü".
Even so, it's also the fact that few westerners pronounce it ever correctly in tournament commentary or any other similar situations. If "l" is indicates the actual pronunciation much better, "it should be changed to reflect the pronunciation correctly to be friendly to non-Japanese speakers" (as how pronunciation theory goes without taking care about any other factors)Monstrata wrote:
It's not the same. R and L are pronounced almost the same way across most phonetics. V and u are way different since V is a consonant.
Ask yourself, how would you pronounce ü using english phonetics. The answer should not be "v" because that's a voiced labiodental fricative. Not a vowel.
I'd love to see this one got added! But maybe we can ask mapper to provide the whole song name in a specific area? For example, creator's words or so. Otherwise we just cannot tell what song it is exactlyProposal wrote:
If the artist or title field exceeds the uploadable maximum length, or both together cause Windows filenames for the .osu files to exceed 255 characters, any additional markers from the fields causing this have to be dropped consistently and if this is still not sufficient, the corresponding fields need to be abbreviated reasonably and end in ... to signal that this song title has been shortened.
Context after here is not holding new idea so I delete them in my post. But anyways, I'm completely not convinced how changes on Mandarin/Chinese metadata would help it be more practical. On the contrary, they're ignoring the general case of Chinese and making things even worse.melloe wrote:
Thirdly, to address the problems of grouping together romanized Chinese syllables into words. It is true that in grouping together syllables there is a lot of ambiguity, but much of that ambiguity should be able to be cleared in context. For instance, taking this charming example provided to us:Context should be able to very easily clear up such ambiguities. What is the song about? What is the rest of the song saying? Context will provide an almost effortless resolution to such conclusions, which I imagine would comprise the vast majority of such instances.Hollow Wings wrote:
"Gu Niang, Shui Jiao Yi Wan Duo Shao Qian?"
this sentence mainly has two meanings:
1. "Hey gril, how much it costs if i buy a bowl of your dumplings?" (姑娘,水饺一碗多少钱?)
2. "Hey gril, how much it costs if i sleep you one night?" (姑娘,睡觉一晚多少钱?)
However, some of those ambiguities will be purposely rendered in the form of puns etc., such as here:These will most likely make up such a negligible percentage of these instances of ambiguity that to go through with the proposed changes and deal with these intentionally ambiguous titles as they come up would not be completely remiss -- but I personally believe that even these hypothetical cases, however rare, should be considered before pushing any changes. That is just my opinion, ultimately it's not up to me.Fycho wrote:
For example, specific examples like "他谁都打不过", it's used intentionally to represent two meanings that are "Nobody can beat him" and "He can beat everybody", "Ta / Shui / Dou Da Bu Guo" and "Ta / Shui Dou Da Bu Guo".
In fact for many contemporary Chinese ballads, their titles are deliberately came up as such (in the form of puns). As for the first example given here, the song title can still be sexually suggestive even if its formal title is about dumplings. Because Chinese lyrics are not as logical as daily language,
and people just can easily get the ambiguous meaning because there is no way to distinguish their pronunciation difference in a song-wise tone without logical context. "Context will provide an almost effortless resolution to such conclusions" as you said is often not the truth. Joint of Chinese characters into a single word will often cause loss of meaning in this way. (I have more examples, one of which is my uploaded map)
I think you must misunderstand me, because I am pretty much in agreement with you on this. As for the first example, if the song content itself is not sexual in nature then naturally the song title should be not sexually suggestive. However, I have already said that puns are intentional ambiguities that cannot be resolved by any combination of word groupings, and that that is one reason to not go through with the proposed changes, which is actually what you said--we are in agreement on this. Let me clarify, I am not flat out taking a single stance with every paragraph I type, I am just offering my own opinion on different matters regarding this topic, sometimes in favor of the proposal, sometimes against.
Fourthly, about "v" vs "u." To Chinese speakers of course "v" makes the most sense, as that is the input they use in their everyday lives, but to the western audience, "v" will make absolutely no sense. "u" and "yu" are both inadequate romanizations of "ü," because "yu" will be pronounced "yoo" by most westerners, but "v" will be next to useless for everybody except for Chinese players. "v" is more ambitious in that it serves to correctly represent a specific sound instead of simply approximating it, but for western osu players it is completely counterproductive.
I'm not sure if you go through the HW's post thoroughly but there was an example given to prove that the change from "v" to "u" will result a worse case under certain conditions: “绿光” & “露光” will be both romanised in "Lu Guang" while their actual pronunciation are completely different. For non-Chinese speakers, I don't think it can be a better way either to pronounce it or to remember the title by any means. Ofc I understand that "v" has no connection with the actual pronunciation of "ü". I was also confused when I first used a keyboard to type Chinese. However, this is just a general knowledge for all Chinese users and Chinese learners. That's how we Chinese grow up. So even we may understand that "v" can be senseless in pronunciation manner,
I don't get why non-Chinese speakers have the advantage to ignore such knowledge (which is common to us) at all. When you want to memorize a title in a different language, accepting its small piece of rule/regularity (actually it's really small) is not demanding is it? In fact for the pronunciation of Japanese romanised way of "ra" (similarly, ri, ru, re, ro), the actual pronunciation is far from /ra/, but somehow similar to be in the middle of /ra/ and /la/. Personally I'd even say it sounds much closer to /la/ in general. But when you have to memorize it, you simply accept its setting of being forced "ra". That's the same thing.
Again I am offering a two-sided view, so that anyone who reads my post (if they don't want to slough through the other very, very long posts) can have multiple perspectives to build their own opinion off of, perhaps I should have made that more clear. For proposal: v makes no sense to westerners. Against proposal: u is not the same as ü, so "v" is a better choice if we want to be exactly precise about that particular vowel instead of just approximating it with "u," the tradeoff being of course that westerners will be confused.
Non-Chinese will not even know to look up the usage of "v" at all, unless you go around and tell everyone, they will just accept it as a strange aspect of the Chinese language and continue pronouncing it "el vee" or "lvvv" or "liv," instead of actually pronouncing it "lü."
In Japanese the IPA notation /ɾ/ as in ra ri ru re ro in English is simply marked as "r" instead of getting its own special character--in other words, we are approximating the pronunciation. And I suspect many non-Japanese have not even memorized that fact, and if they know that "ra" is actually /ɾa/ it's only because they have heard it from anime or something. If we were to do the same for Chinese, we would be again be approximating the ü sound by labeling it "u."
Lastly, Chinese is generally referred to as logographic rather then ideographic, as a character represents a morphheme rather than a more nebulous concept, and as ideogram usually refers specifically to a symbol that is independent of any corresponding sound--although of course no logographic writing system is without a phonetic component built into it. The terms themselves are rather fuzzy anyways, so to achieve anything of actual accuracy one has to resort to such ungainly terms as HW's "ideophonographical." However, to call Chinese logographic is not incorrect. In fact, most people, even linguists, do it.
I don't know how you call Chinese logographic so steadily so I just want TRUE evidence. And I don't even want to read Wafu's post again because he was simply doing this once and once again without compelling support. Anyways, the most intuitive thoughts of the language Chinese is still ideographically, based on how we accept Chinese education for more than 12 years. Many words that combined by two or more characters are also generated by the joint of meanings of those characters together. For example, “未来”(future) can be split as “未”(not happening) and “来”(come). And the easy joint would be "has not come yet", which is the close meaning of "future". And the word “银行”(bank) can be split as “银”(silver, which is the general currency in ancient China) and “行” (an organization/commercial firm focusing on specific fields, pronounced as Hang). And it's obvious that the joint of those two meanings an organization/commercial firm focusing on money, which is bank.
Do the words ideograph and logograph mean something to you that they don't to other people? You seem quite adamant on this. I only mentioned this matter as a very unimportant one, which is why I put it last. The terms ideograph and logograph themselves are very nebulous, ill-defined, any usage of either will not be very accurate unless you append extra stuff to make something long and complicated like "ideophonographical." In my experience, an ideogram is something that represent a particular idea or concept. 上 would be an ideograph, and Chinese for 1 2 3 would be ideograph. However, 的 is not really a concrete concept, it's more of a morpheme (the smallest unit of meaning in a language), like the english -ly or -ing. Concepts, too, can be morphemes, so the label "logogram" cover both complete concepts as well as morphemes. However, this topic is ultimately really quite trivial and not all that important to the discussion at hand, so if the words "logogram" and "ideogram" have a definition to you that I'm not aware of and it's extremely importanat to you that Chinese be called "ideographic," I'm more than happy to comply.
The third example would be my own map https://osu.ppy.sh/s/598869 “花儿纳吉” (The actual correct pronunciation should be Hua Er Na Zei, which is different from normal Mandarin pronunciation Hua Er Na Ji). This title has no direct meaning from Mandarin as it's from minority Chinese language (Qiang language). The official meaning is "Being happy like a flower". However, the song title still has its similar meaning to the combination of Mandarin in Chinese culture , which was also part of intention by the song author: “花儿” is flower, “纳” is containing/accepting, “吉” is happiness. If being wrongly considered as logographic, the song title would be less valuable, which is what we cannot accept. There are just thousands of more examples so I have to stop here.
As a result, I completely don't understand why you guys keep trying to call Chinese logographic by any means. It's highly COUNTER-INTUITIVE. And in fact the change of combining characters is highly impractical (as you wanted to state below the opposite way) in this way, because simply consider each character as pronunciation (as logographic indicates) will result in MEANING LOSS and CULTURE LOSS, which is definitely a wrong way to approach to Chinese language.
Again, with how adamant you are on this topic of logo/ideograph--does ideographic and logographic have a meaning I'm not aware of? I don't have any agenda in calling Chinese logographic, that's just what it is according to the definition of ideo/logographic that I have. Maybe you have a different or more correct definition, in which case feel free to teach me.
If we are to talk about practicality, then you have to consider perspectives from both sides. Impractical to whom? You have to remember that many non-Chinese don't speak Chinese and don't know anything about Chinese meaning or culture to begin with, so joining words will not result in any meaning or culture loss for them. The only difference for them is that it will be easier to remember. Of course it will be impractical to Chinese speakers, but for English or other non-Chinese speakers it will be easier to remember song titles (I have talked about this earlier, and included relevant quote), this is what I mean by practical.To the crux of the issue.
The real dichotomy here is between practicality and officiality/aesthetics. That is a highly subjective discussion and is conducive to many (as seen here) tetchy discussions. Grouping words together will almost certainly make it more convenient for non-Chinese speakers, there should really be no question about this. I personally don't even pay attention to the name of a Chinese map if it's over three or four characters long; the profusion of capitals and spacing, to my English-speaking mind, is simply inconvenient, and I would rather memorize the mapper's name, the artist's name, and the background instead. Japanese titles, meanwhile, are multisyllabic, and I would rather have a few multisyllabic words than six monosyllabic words. How closely we adhere to "ISO 7098" really should not be a question. We're a small international circle-clicking community, not an official international organization, so shouldn't we rather consider things from a functional, practical perspective?
Sorry but I just think the way of changing is even more impractical for the reason stated above
Yes, it will be impractical for you as a Chinese speaker, but I was talking about practicality for the many non-Chinese people who play this game. Let me bring in what you said later to Wafu and address that point:Just an observation: You're trying to prove that it's easier to memorize and to pronounce for non-Chinese speakers, but we have tried super hard to prove that you haven't gone through Chinese and there are tons of fact that counters your idea of making the romanised result to audience easier and better. But I don't see your reaction of ever acknowledging that, which is very disappointing in a discussion.I'm a native English speaker, and even after having learned some Chinese for some years as a child, it is still much easier for me to remember Chinese titles if they are grouped together into longer bundles that, visually speaking, more closely resemble "words" such as one might see in English. I have addressed this in the second point of my original post. (Also I tested it on some non-Chinese speaking friends without context, and here are the results: for some people, a Chinese word is easier to remember when separated by syllable, IF there are about three or less syllables overall. For longer titles, maybe four syllables or above, grouped words is easier to remember.
When romanising emojis that contain faces, avoid using noses as they are fairly uncommonly usedIn addition, how are symbols with no clear emoticon meant to be romanised? There is no clear way to express 🍆, 👺 or 💩 with basic latin unicode.
In that case it probably belongs to the osu!mania RC, anyways you should suggest wording and bring a bit of discussion on the topic if you feel that this is worth to add to the guidelines here or there.abraker wrote:
The reason I mention this is that some maps in 8k mania tend to be either 8k or 7k+1 and it's impossible to know until you download and check them out.
Well that's kinda important. Because in UBKRC, we actually considered the compatibility within osu!. There was that agreement that if it's possible, we should choose what is similar to Modified Hepburn, to keep the metadata consistent etc. (I believe it was for Cyrillic and Chinese)Ephemeral wrote:
keep in mind that the ONLY thing that matters re: romanization is that the title itself can be readily linked back to its source material, we're not after translation or context preservation or anything like that, only transliteration
I can agree with that. It's not exact, but way closer than "yu" or "v", at least if you read it as in any other romanisation system we currently use.Monstrata wrote:
Closest pinyin sound to ü is "yi" btw. I can see why "yu" would be read as "yoo" cuz you naturally make "oo" sound with u. You naturally make "ee" sound with i. "yi" is closest imo as someone who speaks mandarin and english, what do you think?
Because searching would be much more complicated. Majority of people couldn't type characters such as ǔ, ü, etc. and wouldn't find anything in the end.Kroytz wrote:
Why don’t we just allow Unicode characters to be used when submitting beatmaps and we can solve all our problems? (serious)
Yes, but "ae", "o/oe" and "aa" seem to have a basis in pronunciation, at least. "v" is a choice based on keyboard layout. Which will not be even close if you actually try to pronounce it.CXu wrote:
Similarly, we have æ ø å in Norwegian/Danish (and ä ö å in Swedish) which usually just get changed to ae, o/oe, aa, at least in Norwegian. The ø kinda sounds like the i in first, while å is like the o in old.
In my opinion "yi" is not feasible to be implemented since there already exists "yi" as a syllable in Pin Yin system. Using yi will produce new confusions.Monstrata wrote:
Closest pinyin sound to ü is "yi" btw. I can see why "yu" would be read as "yoo" cuz you naturally make "oo" sound with u. You naturally make "ee" sound with i. "yi" is closest imo as someone who speaks mandarin and english, what do you think?
Also, before you try to regard "easier to pronounce for non-Mandarin speakers" as an advantage of any other choice than "v", you need to make sure whether the pronunciation of the new choice is, at least, inclined to the correct pronunciation of that in Mandarin, otherwise, the "easier to speak" statement will not be a valid reason. Since "yu" is far more from being similar to ü, it is already disqualified itself for having the advantage in pronunciation.Fycho wrote:
It could be read easily by English speakers.
Are you sure nobody does that? And are you sure you will end up getting ü? What do you mean by “pinyin” layout? Input method? Softwares? Human-machine interfaces? If so, why osu! cannot do that? Asserting by vocabularies like “nobody” is not convincing, you might need to provide evidence to support your idea.Wafu wrote:
nobody does that. Majority of people will still write "ü" and if you use the "v" on the pinyin layout, you will end up getting "ü" anyway.
So what do you actually mean? You posted "Majority of people will still write ü" while "Majority of people couldn't type characters such as ǔ, ü, etc."Wafu wrote:
Majority of people couldn't type characters such as ǔ, ü, etc. and wouldn't find anything in the end.
Fycho wrote:
For the TV Size thing, drop some opinions:
For example this song: https://osu.ppy.sh/s/477045
The song has a game ver that without "~TV animation ver.~", and has a TV ver later that labled with "~TV animation ver.~" to distinguish. They are different in Instrument and lyrics. In this case, (TVsize) aren't necessary but not for "~TV animation ver.~". That popular "~Anime Ban~" is pretty similar stuff.
I believe there is a metadata discretion when handling things like this.
Wafu wrote:
Kroytz wrote:
Why don’t we just allow Unicode characters to be used when submitting beatmaps and we can solve all our problems? (serious)
Because searching would be much more complicated. Majority of people couldn't type characters such as ǔ, ü, etc. and wouldn't find anything in the end.