Jump to content

LocalRatings mod - evaluate players' skills based on previous games


Mentula
 Share

Recommended Posts

Hello everyone!

I hereby present a 0 A.D. mod aimed at evaluating the rating of players. Official mod page on GitLab here.

Introduction

Before diving into the description, let me introduce the problem this mod aims to solve. In 0 A.D., the ELO system is used to rank players in the lobby. This is good; but is it representative of the players' skills? As you know, the rating system in 0 A.D. only takes into account 1v1 rated games. Team games do not contribute to the ELO score of a player, as well as 1v1 unrated games. Also, the scoring system only takes into account the outcome of a game (victory/defeat) and not the "performance" during the game. Can we do better?

This mod uses statistics. It extracts data from all the replays of games you (the mod user) have played. So, if you have played 20 games (1v1s, team games, other..) with a player in the lobby whose name is (for example) strangeJokes, the mod will assign a rating to strangeJokes based on the 20 games you've played with them.

The rating system

The functioning of the rating system is described in detail at the official mod page on GitLab, but in short what it does is:

  • it considers the average performance of the player during the entire game (and not only at game's end).
  • the rating assigned to a player is a percentage: for example, a player with a rating of 5.00 performs a 5% better than other players on average, while a player with a rating of -5.00 performs a 5% worse than other players on average.
  • you can customize the rating system by giving more importance to military, economy, exploration or other factors to the aim of calculating ratings.

Keep in mind that this mod is based on statistics; data are taken from your (the mod user) replays. Statistics might not be fully representative of reality; therefore, a player's rating could be inaccurate, especially if you have played few games with that player. The more you play with a player, the more accurate the rating of that player is.

Mod installation

Spoiler
  1. Download the attached zip file and extract it.
  2. Copy the LocalRatings folder into the mods folder, tipically located at:
    • Linux: ~/.local/share/0ad/mods/
    • macOS: ~/Library/Application\ Support/0ad/mods/
    • Windows: ~\Documents\My Games\0ad\mods\
  3. Launch 0 A.D. and open the Settings > Mod Selection menu.
  4. Select the LocalRatings mod, Enable it and Save Configuration.

Explanatory pictures

The main page: it lists all players with their rating and the number of games you have played with them.

Spoiler

page.thumb.png.c6441daf3d285761ab3d37e62584b857.png

The Options > Score Weights page: where you can customize the rating system based on the factors you consider more important.

Spoiler

weights.thumb.png.d34d7beef37da690fcfe03882196111c.png

The Options > Match Filters page: where you can select the games relevant for the rating calculation (for example, you might want to ignore games with duration less that 5 minutes).

Spoiler

filters.thumb.png.1045d235bb9ac89f6169bd3d7c332dab.png

Contribute

The public repository is at this page. Everybody is very welcome to contribute, suggest, fork or simply give feedback. Have fun!

 

LocalRatings-v0.25.3.zip

Edited by Mentula
fixed bug from previous version attachment
  • Like 9
  • Thanks 2
Link to comment
Share on other sites

This is pretty cool. FYI: After installing and starting it up, I had to click "Rebuild List" before I was able to get actual ratings. Once I clicked that, it worked fine.

 

If we're posting our own ratings for comparison, my rating here with this mod was 3.43. My lobby rating is 1286.

Edited by thephilosopher
  • Like 1
Link to comment
Share on other sites

17 minutes ago, seeh said:

It would be great if you could limit the time. E.g. the last x months. Or the last 50 games. Then the games where you played really badly would no longer be included

The main problem with restricting the sample of replays to a limited amount is statistics. The more replays you have, the more accurate the rating of a player is, therefore it is disadvisable to rely on data extracted from a small (though recent) sample.

Link to comment
Share on other sites

5 minutes ago, Mentula said:

disadvisable to rely on data extracted from a small (though recent) sample

sure but if you have played many years then you have data. then i not very interested to calculate the first years.
as a workaround we could remove games out of the replay folder.

or a other idea: maybe you could waigt in calculation games stronger that not very old. so it not needet to be configed :)

Edited by seeh
Link to comment
Share on other sites

@Mentula I would recommend adding an uncertainty label after the main rating, calculated based on how many games they have played with you. This will prevent players with just 1 fluked match getting ridiculously high ratings. 

For example, leGrosRobert is at the first place although he is arguably not the best player in your list. Furthermore, 2 of Yekaterina's smurf accounts are way better than you and weirdJokes, although I doubt whether she is actually that talented.

Link to comment
Share on other sites

6 minutes ago, Sevda said:

@Mentula using your default settings, this is the leaderboard for me:

image.thumb.png.60124a9593857c56eb2b4a7eafa97939.png

And I find myself with no rating :(

image.thumb.png.163356f17b74386c2d4e26eb3bcd4ac2.png

 

Why is it that so many players have a score of 0?

Hum that is a bit strange, it could be a bug. It's actually the first time I see it and at the moment it's hard to find the cause. @Sevda I'll send you a private message to investigate (maybe tomorrow, now it's late in my timezone). Thanks for reporting!

Link to comment
Share on other sites

2 hours ago, Player of 0AD said:

If someone will play only 1v1 vs much weaker players, the player can get easily a very high "rating". So you can think that its rather like a win rate than like a rating. Just saying

I personally disagree with this interpretation of the rating. First of all, it's true that someone who mostly plays against weaker players gets a higher rating, no doubt. However, I disagree on the interpretation of the rating as a win rate (in any context: 1v1s, TGs, ...). The rating assigned by the LocalRatings mod (the default one, as well as any user-customized rating using different weights) is very distant from representing a win rate. A win rate of 20% means that the player wins 1 out of 5 games, which is not good; on the other hand, a rating (as in the LocalRatings mod) of 20% means that the player's graph in the Summary chart is 20% better than the average graph, which in other words means that the player performs very well (and therefore presumably has a high win rate).

In my experience with this mod, a player with a rating of 20% is a strong player; I wouldn't say the same of a player with a 20% win rate.

However, let me clarify one thing again: this mod is based on statistics. In statistics any of us can imagine a "limit case" (it could be, for example, a player who only plays with much weaker players). On the bright side... this mod is based on statistics! This means that, generally, a player who plays with different types of players will experience more reliability in the ratings data.

56 minutes ago, Sevda said:

Furthermore, you can farm ratings easily using virtual machines (although I haven't been able to work out the maths behind the rating system), so I am inclined to trust Mentula's ratings mod more than Vanilla Lobby's ratings evaluation. 

This comment actually gives me the possibility of clarifying one thing. Assigning a rating to a player can be something very arbitrary and will never make all of us agree. So, I tend to see the rating of a player as the player's performance over the average, or, if you prefer, their contribution to the game. But this is just my interpretation of it. The lobby's rating evaluation is a system only takes into account the win rate, whereas the LocalRatings mod takes into account scores. In this sense, they don't conflict, they just represent two different things.

  • Like 2
Link to comment
Share on other sites

1 hour ago, Mentula said:

Hum that is a bit strange, it could be a bug. It's actually the first time I see it and at the moment it's hard to find the cause. @Sevda I'll send you a private message to investigate (maybe tomorrow, now it's late in my timezone). Thanks for reporting!

Sevda's issue is the same issue I had at first. I solved it by clicking "Rebuild List," and then all the scores populated.

  • Like 1
  • Thanks 1
Link to comment
Share on other sites

1273841976_Bildschirmfotovon2022-05-1512-33-38.thumb.png.9c14dcb752e0e5f8ec6e1ff15d294524.png

My ratings, based on resources (creating a resource gives a point, sending resources to allies too, destroying gives a point, capturing resources which are buildings gives almost 2 points (not exactly 2 because you often cant keep the building), basic loot is considered (its hard to consider the loot which results in units carrying resources))

 

  • Like 1
Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

 Share

×
×
  • Create New...