Story Details

  • Bypass DeepSeek censorship by speaking in hex

    Posted: 2025-01-31 19:41:49

    The Substack post details how DeepSeek, a video search engine with content filtering, can be circumvented by encoding potentially censored keywords as hexadecimal strings. Because DeepSeek decodes hex before applying its filters, a search for "0x736578" (hex for "sex") will return results that a direct search for "sex" might block. The post argues this reveals a flaw in DeepSeek's censorship implementation, demonstrating that filtering based purely on keyword matching is easily bypassed with simple encoding techniques. This highlights the limitations of automated content moderation and the potential for unintended consequences when relying on simplistic filtering methods.

    Summary of Comments ( 320 )
    https://news.ycombinator.com/item?id=42891042

    Hacker News users discuss potential censorship evasion techniques, prompted by an article detailing how DeepSeek, a coder-focused search engine, appears to suppress results related to specific topics. Several commenters explore the idea of encoding sensitive queries in hexadecimal format as a workaround. However, skepticism arises regarding the long-term effectiveness of such a tactic, predicting that DeepSeek would likely adapt and detect such encoding methods. The discussion also touches upon the broader implications of censorship in code search engines, with some arguing that DeepSeek's approach might hinder access to valuable information while others emphasize the platform's right to curate its content. The efficacy and ethics of censorship are debated, with no clear consensus emerging. A few comments delve into alternative evasion strategies and the general limitations of censorship in a determined community.