Statistic approach to Player Skill and Beatmap Difficulty

Full Tablet

Joined September 2011

Topic Starter

Full Tablet 2015-08-03T13:16:40+00:00

I made some changes to the algorithm, inspired by this post: p/4383854

The algorithm fits the data (scores obtained by players) to logistic curves, where the parameters to fit are Player Skill, Beatmap Difficulty for 900K score, and Steepness of the difficulty curve for beatmaps.

The predicted score for a play is:

.
Where P is the player skill, B is the beatmap difficulty (for 900K score), and S is the steepness parameter of the difficulty curve.

For example, 2 different maps that have the same difficulty at 900K, but different steepness:

The orange curve represents the difficulty curve of a map with high steepness, while the blue one has lower steepness.

The regression minimizes the sum of the square of the errors of the predicted scores compared to the data.

Here are results for ranked 6K maps: https://www.dropbox.com/s/vyoi1r86m9r8t ... .xlsx?dl=0

Take beatmap difficulty results with few scores with a grain of salt (specially ones with only 1 score to base the calculation from, those ones use a default steepness parameter instead of one calculated).

For the player rankings, there is also a "Performance" value. This value is calcutated based on the associated difficulty each play the player has, with a score penalty based on map length (since it's more likely to have fluke plays on shorter maps), and reduced weighting for beatmaps that had their difficulty estimated based on few scores (since they are more likely to not be accurate). The "Player Skill" is the value used in the beatmap difficulty estimation, and is more indicative of the average performance of the player in the plays he has had.

For running the algorithms for other keycounts, I would need to select players to base the calculations on (I can't use a very large amount, since the algorithm is expensive in RAM and CPU use). Ideally, the players should have a big amount of plays, and have a consistent performance (not having many scores with a performance below their current level of play, for example, a player that has improved a lot over time, but hasn't improved their old scores), also, the players should represent a wide range of skill levels. Once the beatmap difficulty values are calculated, adding more players to the ranking is relatively simple (but the score retrieval using the osu! API is still quite slow).

Clappy

373 posts

Joined May 2013

Clappy 2015-08-03T13:23:40+00:00

Full Tablet wrote:
(I can't use a very large amount, since the algorithm is expensive in RAM and CPU use)

Get some faggot with a i7 5960X and 128 gigs of ddr4 to test it out for you

~Raising shitpost quality since 3.5k pp~

-Maus-

571 posts

Joined August 2013

-Maus- 2015-08-03T13:49:11+00:00

Your nick is my reaction

abraker

Global Moderator

8,295 posts

Joined July 2014

abraker 2015-08-03T20:51:23+00:00

FullTablet wrote:
I can't use a very large amount, since the algorithm is expensive in RAM and CPU use

How much CPU time are we talking about here? Surely leaving the computer overnight would do the trick. As for RAM usage, I'm pretty sure there can be a way to avoid too much RAM usage by doing it in C++ non recursively.

std skin 2021: link | mania skin 2021: (vanilla ver ~ hidden ver)
osu!Skills - Compare your skills in a slightly different way
OT!neus - osu off-topic subforum's very own discord server

Full Tablet

Sign In To Proceed

Don't have an account?

Statistic approach to Player Skill and Beatmap Difficulty

Full Tablet wrote:

FullTablet wrote:

snoverpk wrote:

Aqo wrote:

Khelly wrote:

coldloops wrote:

coldloops wrote:

snoverpk wrote:

abraker wrote:

New reply