Story Details

  • League of Legends data scraping the hard and tedious way for fun

    Posted: 2025-02-12 11:11:38

    The author details their complex and manual process of scraping League of Legends match data, driven by a desire to analyze their own gameplay. Lacking a readily available API for detailed match timelines, they resorted to intercepting and decoding network traffic between the game client and Riot's servers. This involved using a proxy server to capture the WebSocket data, meticulously identifying the relevant JSON messages containing game events, and writing custom parsing scripts in Python. The process was complicated by Riot's obfuscation techniques and frequent changes to the game, requiring ongoing adaptation and reverse-engineering. Ultimately, the author succeeded in extracting the data, but acknowledges the fragility and unsustainability of this method.

    Summary of Comments ( 26 )
    https://news.ycombinator.com/item?id=43024173

    HN commenters generally praised the author's dedication and ingenuity in scraping League of Legends data despite the challenges. Several pointed out the inherent difficulty of scraping data from games, especially live service ones like LoL, due to frequent updates and anti-scraping measures. Some suggested alternative approaches like using the official Riot Games API, though the author explained their limitations for his specific needs. Others shared their own experiences and struggles with similar projects, highlighting the common pain points of maintaining scrapers. A few commenters expressed interest in the data itself and potential applications for analysis and research. The overall sentiment was one of appreciation for the author's persistence and the technical details shared.