Support this and other development on Patreon

Stories with Tag Graphics Programming

Running GPT-2 in WebGL: Rediscovering the Lost Art of GPU Shader Programming

permalink

Posted: 2025-05-27 18:02:51

Nathan Reed successfully ran a scaled-down version of the GPT-2 language model entirely within a web browser using WebGL shaders. By leveraging the parallel processing power of the GPU, he achieved impressive performance, generating text at a reasonable speed without any server-side computation. This involved creatively encoding model parameters as textures and implementing the transformer architecture's intricate operations using custom shader code, demonstrating the potential of WebGL for complex computations beyond traditional graphics rendering. The project highlights the power and flexibility of shader programming for tasks beyond its typical domain, offering a fascinating glimpse into using readily available hardware for machine learning inference.

Nathan Ross's blog post, "Running GPT-2 in WebGL: Rediscovering the Lost Art of GPU Shader Programming," details his ambitious project of implementing the GPT-2 language model entirely within a web browser, leveraging the power of WebGL for computation. Motivated by a desire to explore the limits of browser-based machine learning and rediscover the underlying principles of GPU programming, Ross embarked on this challenging endeavor.

The post begins by outlining the rationale behind choosing GPT-2, citing its manageable size and established position in the natural language processing landscape. Recognizing the computational intensity of running such a model, especially within the confines of a browser, Ross opted for WebGL, a JavaScript API providing access to the GPU. This choice necessitated a deep dive into shader programming, a domain he describes as somewhat obscured by higher-level abstractions in modern GPU programming practices.

Ross then meticulously describes the process of translating the GPT-2 architecture into a series of shader programs. He elaborates on the challenges involved in adapting the matrix multiplications, crucial for transformer models like GPT-2, to the constraints of WebGL. This included meticulously managing data layout and transfer between CPU and GPU, a crucial aspect for performance optimization. The post highlights the intricate details of how tensors, the fundamental data structures in deep learning, are represented and manipulated within the shader environment. Ross explains the necessity of flattening and packing these multi-dimensional arrays into textures, the primary data structure used by GPUs, and the subsequent unpacking within the shaders.

The narrative continues with a discussion of the limitations and workarounds encountered. Due to the constraints of WebGL 1.0, which lacks direct support for integer operations within shaders, Ross devised innovative solutions using floating-point arithmetic to mimic integer behavior. He also emphasizes the iterative development process, constantly profiling and optimizing the shader code to maximize performance within the browser's limited resources.

Further, the blog post showcases the practical application of this WebGL implementation by demonstrating text generation within a browser. Users can input a starting prompt, and the browser-based GPT-2 generates subsequent text, all powered by the GPU. Ross also provides insights into the performance characteristics, comparing inference speeds achieved with this WebGL implementation to those of CPU-based execution. While acknowledging that the WebGL version isn't as fast as optimized CPU implementations, he emphasizes the significant speedup achieved compared to a naive JavaScript implementation.

Finally, Ross reflects on the project's broader significance, emphasizing the renewed appreciation for the underlying mechanics of GPU programming gained through this experience. He suggests that understanding these low-level details can be valuable even when working with higher-level frameworks, providing a deeper insight into performance bottlenecks and optimization strategies. The post concludes with a call to further exploration of browser-based machine learning, highlighting its potential for accessibility and broader applications.
Summary of Comments ( 20 )
https://news.ycombinator.com/item?id=44109257

HN commenters largely praised the author's approach to running GPT-2 in WebGL shaders, admiring the ingenuity and "hacky" nature of the project. Several highlighted the clever use of texture memory for storing model weights and intermediate activations. Some questioned the practical applications, given performance limitations, but acknowledged the educational value and potential for other, less demanding models. A few commenters discussed WebGL's suitability for this type of computation, with some suggesting WebGPU as a more appropriate future direction. There was also discussion around optimizing the implementation further, including using half-precision floats and different texture formats. A few users shared their own experiences and resources related to shader programming and on-device inference.

The Hacker News post discussing running GPT-2 in WebGL and GPU shader programming has generated a moderate number of comments, focusing primarily on the technical aspects and implications of the approach.

Several commenters express fascination with the author's ability to implement such a complex model within the constraints of WebGL shaders. They commend the author's ingenuity and deep understanding of both GPT-2 and the nuances of shader programming. One commenter highlights the historical context, recalling a time when shaders were used for more general-purpose computation due to limited access to compute shaders. This reinforces the idea that the author is reviving a "lost art."

There's a discussion around the performance characteristics of this approach. While acknowledging the technical achievement, some commenters question the practical efficiency of running GPT-2 in a browser environment using WebGL. They point out the potential bottlenecks, such as data transfer between the CPU and GPU, and the inherent limitations of JavaScript and browser APIs compared to native implementations. A specific concern raised is the overhead of converting model weights to half-precision floating-point numbers, a requirement for WebGL 1.0. However, another commenter suggests potential optimizations, such as using WebGL 2.0 which supports 32-bit floats.

The topic of precision and its impact on model accuracy is also addressed. Some express skepticism about maintaining the model's performance with reduced precision. They posit that the quantization necessary for WebGL could significantly degrade the quality of the generated text.

A few commenters delve into the technical details of the implementation, discussing topics like memory management within shaders, the challenges of data representation, and the use of textures for storing model parameters. This provides additional insight into the complexity of the project.

Finally, there's a brief discussion about the potential applications of this approach. While acknowledging the current performance limitations, some see promise in using browser-based GPT-2 for specific use cases where client-side inference is desirable, such as privacy-sensitive applications.

In summary, the comments on Hacker News show appreciation for the technical feat of running GPT-2 in WebGL shaders, while also raising pragmatic concerns about performance and accuracy. The discussion provides valuable insights into the challenges and potential of this unconventional approach to deploying machine learning models.
Making Video Games (Without an Engine) in 2025

permalink

Posted: 2025-05-20 05:54:58

The author envisions a future (2025 and beyond) where creating video games without a traditional game engine becomes increasingly viable. This is driven by advancements in web technologies like WebGPU, which offer native performance, and readily available libraries handling complex tasks like physics and rendering. Combined with the growing accessibility of AI tools for asset creation and potentially even gameplay logic, the barrier to entry for game development lowers significantly. This empowers smaller teams and individual developers to bring their unique game ideas to life, focusing on creativity rather than wrestling with complex engine setup and low-level programming. This shift mirrors the transition seen in web development, moving from manual HTML/CSS/JS to higher-level frameworks and tools.

Noel Berry's blog post, "Making Video Games (Without an Engine) in 2025," envisions a future where game development, particularly for smaller, independent creators, shifts away from monolithic game engines toward a more modular and specialized toolset. Berry posits that the increasing complexity and "black box" nature of contemporary engines like Unity and Unreal Engine, while beneficial for large-scale projects, are becoming cumbersome and overkill for smaller endeavors. He foresees a renaissance of handcrafted development, utilizing a carefully curated collection of bespoke tools tailored to the specific needs of individual projects.

This prediction stems from several observations. First, Berry highlights the rising performance capabilities of lower-level APIs like Vulkan and WebGPU, which grant developers more direct control over hardware and potentially offer substantial performance gains compared to the abstraction layers present in conventional engines. These APIs, previously considered daunting due to their complexity, are becoming more accessible thanks to improving documentation and the emergence of helpful libraries and tools that streamline their usage.

Second, the blog post argues for the growing viability of assembling a custom "engine" by combining specialist libraries focused on particular aspects of game development, such as rendering, physics, audio, and input handling. This modular approach allows developers to choose precisely the tools they require, optimizing for performance, size, and control. The post specifically references examples like Bevy for Rust developers, offering a taste of this more granular approach.

Furthermore, Berry anticipates an increase in the adoption of open-source libraries and a shift towards a more collaborative ecosystem of tool development. This communal effort could potentially lead to a rich tapestry of interoperable tools, each specializing in a specific area and catering to a diverse range of development needs. He imagines a future where sharing and exchanging custom tools becomes a common practice within the game development community, fostering innovation and accelerating development.

The post also touches upon the advantages of data-oriented design and pre-compiled pipelines, particularly within the context of improving loading times and runtime performance. This approach, when combined with the bespoke tool philosophy, enables developers to finely tune the execution flow of their game, achieving high performance levels tailored to their specific needs.

Finally, Berry acknowledges that while this modular approach may not entirely replace established game engines for large AAA productions with their extensive resource requirements, it presents an exciting alternative for indie developers and smaller teams. This allows them to sidestep the inherent overhead of large engines, fostering a more direct, creative connection with the code and enabling the creation of more unique and specialized gaming experiences. The post ultimately paints a picture of a more democratized and adaptable game development landscape, where smaller creators are empowered by a vibrant ecosystem of specialized tools.
Summary of Comments ( 143 )
https://news.ycombinator.com/item?id=44038209

Hacker News users discussed the practicality and appeal of the author's approach to game development. Several commenters questioned the long-term viability of building and maintaining custom engines, citing the significant time investment and potential for reinventing the wheel. Others expressed interest in the minimalist philosophy, particularly for smaller, experimental projects where creative control is paramount. Some pointed out the existing tools like raylib and Love2D that offer a middle ground between full-blown engines and building from scratch. The discussion also touched upon the importance of understanding underlying principles, regardless of the chosen tools. Finally, some users debated the definition of a "game engine" and whether the author's approach qualifies as engine-less.

The Hacker News post "Making Video Games (Without an Engine) in 2025" generated a moderate discussion with several insightful comments. Many of the commenters engaged with the author's premise of building a game from scratch, using only libraries like SDL, and the implications of this approach for the future of game development.

Several commenters focused on the practicalities and trade-offs of engine-less game development. One commenter questioned the author's choice of SDL, suggesting that more modern alternatives like SFML might offer better performance and features for a similar level of control. Another pointed out the significant time investment required to build core engine functionalities, like physics and rendering, from the ground up. This commenter argued that while the learning experience is valuable, using an existing engine is drastically more efficient for most projects, especially for solo developers or small teams. Related to this, another user highlighted the potential benefits of smaller, more modular engines or libraries as a middle ground between full-fledged engines and building everything from scratch. They suggested this approach would offer more control than larger engines while still avoiding the considerable effort of completely reinventing the wheel.

The discussion also touched upon the evolving role of game engines and their potential future. One commenter predicted that engines might evolve into more specialized tools, catering to specific game genres or platforms. They envisioned a future where "micro-engines" or collections of libraries become more prevalent, empowering developers to customize their toolsets based on their individual needs. Another user suggested that the increasing complexity of modern game development might necessitate a shift towards more specialized roles within teams, with some developers focusing solely on engine-level development. They posited that this specialization might mirror the evolution of web development, where specialized frontend and backend developers have become commonplace.

A few commenters also shared their personal experiences and opinions on the matter. One commenter recounted their own experience building a game from scratch and echoed the sentiment that while challenging and time-consuming, it provided invaluable insights into the inner workings of game engines. Another commenter shared their preference for using existing engines but acknowledged the educational value and potential for innovation in taking a more DIY approach.

Overall, the comments reflect a nuanced perspective on the future of game development, acknowledging the benefits of both engine-based and engine-less approaches. The discussion highlights the importance of carefully evaluating the trade-offs between control, efficiency, and learning when choosing the right tools for a project. It also suggests a potential future where the game development landscape becomes more diverse, with a wider range of engines and tools catering to different needs and development styles.
Various Things in MetaPost (2019)

permalink

Posted: 2025-05-14 18:37:32

This MetaPost tutorial demonstrates the language's versatility by showcasing various graphical techniques. It covers creating geometric shapes, manipulating paths and curves, applying transformations like rotations and scaling, working with text and labels, and generating patterned fills. The post emphasizes practical examples, like drawing a clock face, a spiral, and a function graph, illustrating how to combine MetaPost's features for creating complex and visually appealing illustrations. It serves as a good introduction to the language's capabilities for generating vector graphics, especially for mathematical or technical diagrams.

The Habr.com article, "Various Things in MetaPost (2019)," provides a comprehensive overview of MetaPost's capabilities beyond its commonly perceived role as just a tool for creating vector graphics. The author meticulously demonstrates MetaPost's flexibility as a general-purpose programming language capable of tackling diverse tasks, from generating geometric diagrams and complex visual patterns to solving mathematical problems and even simulating physical phenomena.

The article begins by highlighting MetaPost's core functionality in defining geometric shapes and manipulating them through transformations like scaling, rotation, and translation. It emphasizes the declarative nature of MetaPost's syntax, where the user describes the desired outcome rather than specifying the precise steps, making it particularly well-suited for geometrical constructions. This is illustrated with examples showcasing the creation of intricate designs composed of interconnected circles, arcs, and polygons.

Expanding beyond basic shapes, the author delves into MetaPost's path creation and manipulation capabilities. The article demonstrates how to define complex curves using Bezier curves and splines and how to apply various operations on these paths, such as finding intersections, determining tangents, and computing enclosed areas. This section underscores the precision and control offered by MetaPost for creating intricate graphical elements.

The article further explores MetaPost's capacity for generating intricate patterns by combining loops, variables, and mathematical functions. This allows for the programmatic creation of complex visualizations like fractals, tessellations, and spirograph-like patterns. These examples showcase the potential of MetaPost for algorithmic art and design.

Moving beyond purely visual applications, the author demonstrates MetaPost's aptitude for solving mathematical problems. They illustrate how to use MetaPost to visualize mathematical concepts, such as plotting functions, depicting geometric transformations, and illustrating solutions to geometric puzzles. This aspect of MetaPost extends its utility beyond graphics creation into the realm of educational and exploratory tools.

The article also touches upon the possibility of using MetaPost for simulating physical phenomena. Although not its primary purpose, MetaPost can be used to visualize physical processes like the trajectory of projectiles or the motion of a pendulum, by combining its geometric manipulation abilities with basic physics calculations. This demonstration expands the scope of MetaPost into an unexpected realm, showcasing its versatility as a general-purpose programming environment.

Finally, the article emphasizes the extensibility of MetaPost by showcasing how to incorporate external resources like images and fonts. This enhances the practical applications of MetaPost by allowing users to integrate existing assets into their projects, further demonstrating its flexibility and utility for diverse visual communication tasks.

In conclusion, the article provides a compelling argument for reconsidering MetaPost not just as a vector graphics editor, but as a powerful and versatile programming language capable of tackling a wide range of tasks involving geometry, mathematics, and visual representation. The author's meticulous examples and clear explanations illuminate the rich functionality of MetaPost and inspire exploration of its potential beyond traditional graphic design applications.
- MetaPost
- vector graphics
- typesetting
- programming
- LaTeX
- TeX
- METAFONT
- Digital Illustration
- Graphics Programming
- Tutorial
- 2019
Summary of Comments ( 1 )
https://news.ycombinator.com/item?id=43987814

Hacker News users discuss the utility and elegance of MetaPost, particularly for diagrams and figures. Several commenters praise its declarative approach, finding it more intuitive and less fiddly than alternatives like TikZ/PGF. Some highlight the integration with LaTeX and the power of being able to programmatically generate graphics. Others note MetaPost's age and the steeper learning curve compared to newer tools, although the quality of the output and the control it offers are seen as worthwhile trade-offs. The ability to express geometric relationships directly within the code is also mentioned as a significant advantage. A few users express a desire for a modernized, actively developed version of MetaPost, suggesting it could be even more powerful with improvements to the build process and editor integration.

The Hacker News post "Various Things in MetaPost (2019)" linking to a Habr article about MetaPost usage has a modest number of comments, mostly focusing on the niche nature of MetaPost and its relationship to other tools.

One commenter expresses surprise that MetaPost is still in use and questions its advantages over alternatives like TikZ/PGF, Asymptote, or even Python libraries like matplotlib. They seem particularly interested in understanding the specific use cases where MetaPost shines, implying a preference for more mainstream tools unless a compelling reason is presented.

Another commenter highlights the integration of MetaPost within the LaTeX ecosystem as its primary strength, especially for generating figures directly within LaTeX documents. This commenter appreciates the declarative nature of MetaPost and contrasts it with the more imperative approach of libraries like matplotlib, suggesting that MetaPost is better suited for certain types of diagrams.

A subsequent reply to this comment points out that while convenient for LaTeX users, embedding MetaPost code within a LaTeX document can lead to cluttered and difficult-to-maintain code. This commenter argues that separate MetaPost files are generally preferable for organization and reusability, echoing a common sentiment in software development about separating concerns.

Another commenter mentions ConTeXt, a typesetting system, as an environment where MetaPost is frequently used and well-integrated. This comment briefly touches on the historical context of MetaPost and its association with the creator of Metafont, further solidifying its connection to the TeX world.

Finally, one commenter laments the lack of a proper MetaPost mode in Emacs, suggesting that tooling limitations might contribute to its lower adoption compared to other options. This comment hints at the practical challenges faced by users who might otherwise be interested in exploring MetaPost.

In summary, the comments revolve around the niche appeal of MetaPost, its advantages and disadvantages compared to modern alternatives, its integration within the LaTeX and ConTeXt ecosystems, and some practical considerations regarding tooling. The overall tone is one of mild curiosity and a recognition of MetaPost's specialized role in technical typesetting.
15 Years of Shader Minification

permalink

Posted: 2025-05-10 07:51:30

The blog post "15 Years of Shader Minification" reflects on the evolution of techniques to reduce shader code size, crucial for performance in graphics programming. Starting with simple regex-based methods, the field progressed to more sophisticated approaches leveraging abstract syntax trees (ASTs) and dedicated tools like Shader Minifier and GLSL optimizer. The author emphasizes the importance of understanding GLSL semantics for effective minification, highlighting challenges like varying precision and cross-compiler quirks. The post concludes with a look at future directions, including potential for machine learning-based optimization and the increasing complexity posed by newer shader languages like WGSL.

This blog post, "15 Years of Shader Minification," by Charles Bourasseau, offers a retrospective on the evolution of techniques and tools for reducing the size of shader code, a crucial process for optimizing graphics performance, especially in resource-constrained environments like web browsers and mobile devices. The author begins by establishing the importance of shader minification, emphasizing its role in improving loading times, reducing bandwidth consumption, and ultimately enhancing the user experience, particularly in the context of the growing complexity of modern shaders.

Bourasseau then delves into the historical context, tracing the development of shader minification from its early days around 2010. He highlights the initial approaches, which were often ad-hoc and relied on simple techniques like whitespace removal and renaming variables to shorter identifiers. The author meticulously documents the progression from these rudimentary methods to more sophisticated tools and algorithms, showcasing the emergence of dedicated shader minifiers like "glsl-unit" and "glsl-optimizer."

The post explores the technical intricacies of various minification strategies. It explains how techniques like dead code elimination, constant folding, and function inlining contribute to size reduction, providing detailed examples to illustrate their workings. Furthermore, Bourasseau analyzes the challenges encountered in developing effective minifiers, discussing issues such as handling preprocessor directives, preserving cross-compiler compatibility, and ensuring that the minified shader remains functionally equivalent to the original. The post emphasizes the delicate balance between aggressive minification and maintaining shader correctness, highlighting the need for robust testing and validation processes.

Beyond individual tools, the author also examines the broader ecosystem surrounding shader minification. He discusses the integration of minification into popular graphics pipelines and build systems, acknowledging the importance of seamless automation for streamlined development workflows. The author also touches upon the standardization efforts within the graphics community, referencing initiatives like the Khronos Group's work on SPIR-V, a standardized intermediate representation for shaders, and its potential impact on minification practices.

Towards the end of the post, Bourasseau reflects on the future of shader minification, speculating on potential advancements in areas such as machine learning-driven optimization and the utilization of more advanced compilation techniques. He acknowledges the ongoing need for improved tools and methodologies as shader complexity continues to escalate, concluding with a call for continued research and development in this critical area of computer graphics optimization.
- shaders
- minification
- optimization
- graphics
- GPU
- performance
- Compilation
- code size
- GLSL
- HLSL
- SPIR-V
- WebGL
- Graphics Programming
- Game Development
- computer graphics
Summary of Comments ( 22 )
https://news.ycombinator.com/item?id=43943942

HN users discuss the challenges and intricacies of shader minification, reflecting on its evolution over 15 years. Several commenters highlight the difficulty in optimizing shaders due to the complex interplay between hardware, drivers, and varying precision requirements. The effectiveness of minification is questioned, with some arguing that perceived performance gains often stem from improved compilation or driver optimizations rather than the minification process itself. Others point out the importance of considering the specific target hardware and the potential for negative impacts on precision and stability. The discussion also touches upon the trade-offs between shader size and readability, with some suggesting that smaller shaders aren't always faster and can be harder to debug. A few commenters share their experiences with specific minification tools and techniques, while others lament the lack of widely adopted best practices and the ongoing need for manual optimization.

The Hacker News post titled "15 Years of Shader Minification" (linking to an article on ctrl-alt-test.fr) has generated a moderate number of comments, mostly focusing on the technical aspects of shader minification and its evolution.

Several commenters discuss the surprising complexity of GLSL compilers and the challenges they present for minification. One commenter highlights the difficulty in optimizing shaders due to undefined behavior in older GLSL compilers, making aggressive optimization risky. They point out the need for specific compiler targeting and the inherent problems of relying on undefined behavior.

Another commenter notes the lack of resources available for understanding GLSL compilation, which further complicates the minification process. They express the desire for better documentation and tools for exploring the intricacies of shader compilation.

A few comments mention the importance of minification for performance, especially in resource-constrained environments like mobile devices or web browsers. Reducing shader size can lead to faster loading times and improved runtime performance.

One commenter shares a personal anecdote about encountering excessively long shaders in a game, highlighting the practical implications of shader size. This reinforces the value of minification in real-world scenarios.

The conversation also touches upon the trade-offs between minification and readability. While minimizing shader size is beneficial, it can also make the code more difficult to understand and debug. This introduces a tension between performance and maintainability.

Finally, some commenters discuss specific tools and techniques used for shader minification, including both general-purpose minifiers and specialized tools designed specifically for GLSL. This practical discussion offers insights into the current state of shader minification technology.

While the discussion isn't extensive, it provides a valuable perspective on the challenges and benefits of shader minification, offering insights for developers working with shaders and highlighting the ongoing need for improved tooling and documentation in this area.
WebGL Water (2010)

permalink

Posted: 2025-05-10 00:13:38

Evan Wallace's "WebGL Water" demonstrates a real-time water simulation using WebGL. The simulation calculates the height of the water surface at each point in a grid, and then renders that surface with reflections and refractions. User interaction, like dragging the mouse, creates ripples and waves that propagate realistically across the surface. The post details the technical implementation, including the use of framebuffer objects, vertex and fragment shaders, and a numerical solver for wave propagation based on a simplification of shallow water equations. It represents an early and impressive example of browser-based 3D graphics using WebGL.

Evan Wallace's 2010 blog post, "WebGL Water," details the creation of a real-time interactive water simulation using WebGL, a then-nascent web technology for rendering 3D graphics within a browser. The simulation realistically depicts the complex behavior of water surfaces, accounting for wave propagation, reflection, and refraction. Wallace meticulously explains the underlying techniques he employed to achieve this visual fidelity.

The core of the simulation leverages a heightmap, a grid of values representing the vertical displacement of the water surface at different points. This heightmap is dynamically updated based on a simplified wave equation, simulating the propagation of waves across the water's surface. To achieve interactive performance within the browser, Wallace utilizes the GPU's parallel processing capabilities via WebGL. This allows for real-time calculation of the wave equation across the entire heightmap.

Beyond simply simulating wave motion, the rendering process also incorporates realistic lighting effects. Wallace employs a method for calculating reflections based on the shape of the water surface, as defined by the heightmap. This reflection calculation considers the position of the viewer and the surrounding environment, creating a dynamic and believable interaction between the water and its surroundings. Furthermore, the rendering technique simulates refraction, the bending of light as it passes through the water surface, contributing to the overall realism. This refraction effect distorts the appearance of objects seen beneath the water's surface, accurately simulating the visual distortion caused by the change in medium for light waves.

The post also touches upon performance optimization strategies crucial for real-time rendering in a browser environment. Wallace discusses the trade-offs between visual detail and performance, and how careful management of texture resolution and calculation complexity can significantly impact the simulation's frame rate. He highlights the importance of balancing the simulation's computational load with the browser's capabilities to achieve a smooth and interactive experience.

Finally, the post features an embedded interactive demo allowing readers to directly experience the water simulation in their own browser. This demo showcases the efficacy of Wallace's techniques and provides a tangible example of the capabilities of WebGL for rendering complex interactive 3D graphics within a web page, demonstrating the potential of this technology for future web-based applications.
Summary of Comments ( 56 )
https://news.ycombinator.com/item?id=43942149

Commenters on Hacker News express appreciation for the simplicity and elegance of Evan Wallace's WebGL water simulation, particularly its age (2010) and the fact it runs smoothly even on older hardware. Several highlight the educational value of the clear, concise code, making it a good learning resource for WebGL and graphics programming. Some discuss the underlying techniques, like summing sine waves to create the wave effect, and how surprisingly realistic results can be achieved with relatively simple methods. A few commenters share their own experiences experimenting with similar simulations and offer links to related resources. Performance, particularly on mobile, and the clever use of JavaScript are also points of discussion.

The Hacker News post titled "WebGL Water (2010)" links to Evan Wallace's WebGL water simulation. The discussion in the comments section is relatively brief, with only a handful of comments, and primarily focuses on the technical aspects and the author's background.

One commenter expresses their appreciation for the demo, calling it "beautiful" and noting its smooth performance even on their older iPad 1. They highlight the technical accomplishment of achieving such performance on older hardware, particularly with the complexities of water simulation. This comment emphasizes the effectiveness of the demo in showcasing the capabilities of WebGL even in its earlier stages.

Another comment focuses on the creator of the demo, Evan Wallace, mentioning his subsequent work on Figma. The commenter points out that Wallace's work on the WebGL water demo likely contributed to his understanding of graphics and interactive systems, which ultimately proved beneficial in the development of Figma, a popular design and prototyping tool. This comment draws a connection between Wallace's early work and his later success, suggesting the water demo was a significant stepping stone in his career.

A third comment briefly remarks on the realistic appearance of the water simulation, emphasizing the impressive visuals achieved through WebGL. This adds to the general consensus that the demo is technically impressive and visually appealing.

The remaining comments are quite short, mostly expressing agreement with the other comments or offering brief observations about the demo's technical aspects or visual quality. There's no extensive discussion or debate, with the comments primarily serving as expressions of appreciation for the technical achievement and visual appeal of the WebGL water simulation.
Graphics livecoding in Common Lisp

permalink

Posted: 2025-04-23 17:48:20

The blog post details the author's process of livecoding graphics in Common Lisp using a combination of Quicklisp libraries, specifically cl-cairo2 and Qtools. They leverage the REPL's interactive nature to rapidly iterate and experiment with visual elements, modifying code and seeing immediate results in a Cairo graphics window. The author explains their setup and workflow, emphasizing the advantages of Lisp's dynamic environment for this type of creative coding, showcasing how functions can be redefined and tweaked on-the-fly to manipulate shapes, colors, and other graphical parameters. This approach allows for a fluid and exploratory development experience, turning the coding process itself into a performative act.

Kevin Gal's blog post, "Graphics livecoding in Common Lisp," details his exploration and implementation of a livecoding environment for generating visuals using Common Lisp. He begins by outlining the core concept of livecoding, where code modifications instantaneously affect the output, creating a dynamic feedback loop that fosters creative exploration and experimentation. He emphasizes the appeal of this real-time interaction for artistic endeavors, particularly within the realm of generative art.

Gal proceeds to describe his chosen tools and approach. He leverts Common Lisp's dynamic nature and the Quicklisp library manager for rapid prototyping and iteration. Specifically, he employs the CEPL library along with SDL2 and OpenGL for rendering graphics. CEPL's particular strength lies in its ability to redefine functions on-the-fly, a crucial feature for achieving the seamless updates characteristic of livecoding. This allows him to modify the code defining the visuals while the program is running, seeing the changes reflected immediately in the output window.

The post then delves into the technical details of his setup. He explains how he structures his Lisp code, using a combination of functions and global variables to manage the graphical elements. He also details how he uses CEPL to redefine functions, effectively "hot-patching" the running program with new code as he types it. He highlights the benefits of Common Lisp's interactive development environment, such as the REPL, for facilitating this livecoding workflow.

The author then shares practical examples and demonstrations of his livecoding process. He showcases how he can manipulate shapes, colors, and animations in real-time by modifying the Lisp code. He walks through the process of building up a visual scene incrementally, tweaking parameters and functions on-the-fly to achieve the desired aesthetic effect. This allows him to explore different visual possibilities quickly and intuitively, taking advantage of the immediate feedback provided by the livecoding environment.

Finally, Gal reflects on his experience and future directions. He expresses his satisfaction with the responsiveness and flexibility of his livecoding setup, noting how it empowers him to create generative art in a more fluid and interactive manner. He also discusses potential improvements and extensions, including exploring different rendering libraries and integrating sound generation into the livecoding workflow. He closes by emphasizing the potential of Common Lisp as a powerful tool for creative coding and live performance.
Summary of Comments ( 47 )
https://news.ycombinator.com/item?id=43774726

HN users generally praised the author's technical skill and the visual appeal of the live coding demo. Some expressed interest in learning more about Common Lisp and the specific libraries used. A few commenters discussed the practical applications of live coding graphics, suggesting uses in game development, generative art, and data visualization. One commenter pointed out the potential accessibility issues related to color choices in the examples. Another highlighted the historical precedent of Lisp machines and their graphical capabilities, connecting the demo to that lineage. The perceived complexity of Common Lisp was also mentioned, with some users acknowledging its steep learning curve but also its power and flexibility.

The Hacker News post titled "Graphics livecoding in Common Lisp," linking to Kevin Gal's blog post about livecoding graphics in Common Lisp, has generated a moderate amount of discussion with a number of insightful comments.

Several commenters express enthusiasm for the elegance and power of Common Lisp for this kind of work, particularly its interactive development environment and the REPL. One commenter highlights the "joy" of using Lisp for livecoding, contrasting it favorably with other languages. Another commenter specifically praises the ability to redefine functions on-the-fly without restarting the application, a key feature enabling livecoding.

The discussion touches upon the specific libraries and tools used by the author, including CEPL, SLY, and Quicklisp. Commenters share their experiences with these tools and offer alternative suggestions. One commenter suggests using a library like cl-cairo2 for 2D drawing or a game library like Trial for a potentially smoother experience. Another commenter mentions using similar setups with Common Lisp and OpenGL for 3D graphics programming.

Some commenters delve into the technical details of livecoding and image processing, discussing double-buffering techniques and how to handle window resizing events. One commenter specifically mentions a technique of using a dedicated thread for rendering to avoid blocking the main thread during intensive graphical operations.

A couple of comments reflect on the historical context of Lisp and its influence on other livecoding environments, including Smalltalk. One comment points out the rich history of Lisp machines and their emphasis on interactive programming, which paved the way for modern livecoding techniques.

The comments also branch into related areas, such as generative art and music, and the potential of Common Lisp for creating these types of interactive experiences.

While the general tone is positive and appreciative of the author's work, there are a few constructive criticisms. One comment notes the potential performance bottlenecks of Common Lisp for graphics-intensive applications and suggests exploring other languages or libraries for optimization. Another comment points out the relative niche status of Common Lisp in the contemporary programming landscape, which may limit its adoption for livecoding graphics.

Overall, the comment section provides valuable insights and perspectives on livecoding graphics with Common Lisp, touching upon both its advantages and its potential limitations. The commenters' collective knowledge and experience enrich the discussion and offer valuable pointers for anyone interested in exploring this area further.
A flowing WebGL gradient, deconstructed

permalink

Posted: 2025-04-12 10:54:53

This blog post breaks down the creation of a smooth, animated gradient in WebGL, avoiding the typical texture-based approach. It explains the core concepts by building the shader program step-by-step, starting with a simple vertex shader and a fragment shader that outputs a solid color. The author then introduces varying variables to interpolate colors across the screen, demonstrates how to create horizontal and vertical gradients, and finally combines them with a time-based rotation to achieve the flowing effect. The post emphasizes understanding the underlying WebGL principles, offering a clear and concise explanation of how shaders manipulate vertex data and colors to generate dynamic visuals.

This blog post by Alex Harri provides an in-depth, pedagogical exploration of creating smoothly varying color gradients using WebGL, breaking down the process into digestible steps. The author begins by establishing the fundamental concept of a fragment shader, a program executed on the graphics processing unit (GPU) for each pixel of an output image, determining its color. Harri emphasizes the efficiency of GPUs for these parallel computations.

The core of the gradient generation lies in manipulating normalized pixel coordinates. These coordinates, ranging from 0 to 1 across the width and height of the canvas, are used as input to the fragment shader. The author meticulously explains how these coordinates are accessed within the shader and how they directly correspond to the pixel's position on the canvas.

The initial gradient is a simple linear transition from red to blue along the horizontal axis. This is achieved by directly mapping the x-coordinate to the red and blue color channels, creating a smooth blend between the two colors as the x-coordinate varies. The author further elaborates on how this concept can be extended to create different gradient orientations. By using the y-coordinate instead of x, the gradient shifts to a vertical orientation. Furthermore, combining both x and y coordinates allows for diagonal gradients and more complex variations.

The post then progresses beyond linear gradients, introducing radial gradients. This is achieved by calculating the distance of each pixel from the center of the canvas using the Pythagorean theorem. This distance is then used to modulate the color, resulting in a circular gradient emanating from the center. Harri explains the mathematical underpinnings of this calculation and demonstrates how adjusting the center point and the radius affects the visual output.

To provide a richer understanding, the author meticulously details the WebGL code necessary for implementing these gradients. This includes setting up the WebGL context, compiling and linking shaders, and providing the necessary data to the GPU. The code examples are presented clearly, allowing readers to follow along and experiment with the concepts. Furthermore, the author integrates sliders to dynamically control gradient parameters like color stops and center points, further enhancing the interactive and educational experience.

Finally, the author encourages readers to explore beyond the presented examples, hinting at the possibility of creating more complex and visually captivating gradients by utilizing different mathematical functions and manipulating color spaces within the shader. The post concludes by showcasing the versatility and power of fragment shaders for generating a wide range of visual effects within WebGL.
Summary of Comments ( 37 )
https://news.ycombinator.com/item?id=43663290

Hacker News users generally praised the article for its clear explanation of WebGL gradients. Several commenters appreciated the author's approach of breaking down the process into digestible steps, making it easier to understand the underlying concepts. Some highlighted the effective use of visual aids and interactive demos. One commenter pointed out a potential optimization using a single draw call, while another suggested pre-calculating the gradient into a texture for better performance, particularly on mobile devices. There was also a brief discussion about alternative methods, like using a fragment shader for more complex gradients. Overall, the comments reflect a positive reception of the article and its educational value for those wanting to learn WebGL techniques.

The Hacker News post titled "A flowing WebGL gradient, deconstructed," linking to a blog post about creating flowing WebGL gradients, has a modest number of comments, sparking a discussion around performance, alternative approaches, and the educational value of the blog post.

One commenter questions the performance implications of using WebGL for this specific effect, suggesting that a simpler approach using CSS gradients might be more efficient. They argue that the overhead of WebGL context creation and shader compilation might outweigh the benefits for a relatively simple gradient animation. This sparks a brief discussion about the potential performance benefits of WebGL for more complex effects and the evolving landscape of browser rendering capabilities. Another commenter echoes this sentiment, suggesting CSS could achieve a similar look with less complexity.

Another line of discussion focuses on alternative techniques for achieving similar visual effects. One commenter mentions using a small, tiled texture with linear interpolation to create smooth gradients, potentially offering a performance advantage over the presented WebGL approach. Another user suggests using a fragment shader with noise functions for more complex and interesting gradient animations.

Some commenters appreciate the educational aspect of the blog post. One points out the clear explanation of the underlying concepts and the step-by-step breakdown of the code. They commend the author for making WebGL more accessible to developers who might be intimidated by its complexity.

A few commenters offer minor suggestions and observations. One notes the use of requestAnimationFrame and its importance for smooth animations. Another mentions the visual appeal of the effect, describing it as "mesmerizing."

The overall sentiment in the comments is one of cautious appreciation. While acknowledging the visual appeal of the WebGL gradient, many commenters express concerns about performance and suggest exploring alternative, potentially more efficient approaches. However, the clear explanation and educational value of the blog post are also recognized and praised.
High-Performance PNG Decoding

permalink

Posted: 2025-03-23 06:22:14

The Blend2D project developed a new high-performance PNG decoder, significantly outperforming existing libraries like libpng, stb_image, and lodepng. This achievement stems from a focus on low-level optimizations, including SIMD vectorization, optimized Huffman decoding, prefetching, and careful memory management. These improvements were integrated directly into Blend2D's image pipeline, further boosting performance by eliminating intermediate copies and format conversions when loading PNGs for rendering. The decoder is designed to be robust, handling invalid inputs gracefully, and emphasizes correctness and standard compliance alongside speed.

This blog post, titled "High-Performance PNG Decoding," details the development and performance characteristics of a new PNG image decoding implementation within the Blend2D graphics library. The author emphasizes the importance of fast image decoding, particularly in performance-sensitive applications like web browsers, games, and digital content creation tools. Slow image decoding can bottleneck the entire application, leading to a sluggish user experience.

The post begins by outlining the challenges inherent in PNG decoding, highlighting the format's flexibility, which, while beneficial for compression and diverse image representation, contributes to decoding complexity. This complexity stems from features like filtering, various compression levels, and support for different color types and bit depths. Existing open-source PNG decoders are often criticized for their performance limitations, particularly when handling large images or demanding workloads.

The author then dives into the design and implementation of Blend2D's new PNG decoder. A key focus was achieving high performance without sacrificing correctness or standards compliance. The new decoder leverages SIMD (Single Instruction, Multiple Data) instructions, a crucial technique for processing data in parallel and significantly accelerating decoding speed. Specifically, the implementation utilizes AVX2 instructions, allowing it to process multiple pixels simultaneously. The post explains how these SIMD instructions are employed in various stages of the decoding process, including filtering and color conversion.

Furthermore, the post discusses optimizations employed beyond SIMD. These include careful memory management to minimize cache misses, optimized Adler-32 checksum calculation, and a streamlined approach to handling different bit depths and color types. The decoder also makes use of prefetching techniques to prepare data for processing, further enhancing performance.

The author presents benchmark results comparing Blend2D's new PNG decoder against several established open-source libraries, including libpng, stb_image, and lodepng. These benchmarks demonstrate a significant performance advantage for Blend2D, often exceeding the others by a substantial margin, especially when dealing with larger images and complex scenarios. The benchmark data includes detailed metrics like decoding time, throughput, and comparisons across different hardware configurations.

Finally, the post briefly touches upon future plans for the PNG decoder, suggesting potential further optimizations and highlighting the ongoing effort to improve performance and maintain compatibility with evolving standards. The overall tone underscores the commitment to providing a fast and robust PNG decoding solution within Blend2D, catering to the demands of performance-critical applications.
Summary of Comments ( 4 )
https://news.ycombinator.com/item?id=43451187

HN commenters generally praise Blend2D's PNG decoder for its speed and clean implementation. Some appreciate the detailed blog post explaining its design and optimization strategies, highlighting the clever use of SIMD intrinsics and the decision to avoid complex dependencies. One commenter notes the impressive performance compared to LodePNG, particularly for large images. Others discuss potential further optimizations, such as using pre-calculated tables for faster filtering, and the challenges of achieving peak performance with varying image characteristics and hardware platforms. A few users also share their experiences integrating or considering Blend2D in their projects.

The Hacker News post titled "High-Performance PNG Decoding" discussing the blog post about Blend2D's new PNG codec has a moderate number of comments, sparking a discussion around performance, specific implementation details, and comparisons to other libraries.

Several commenters express admiration for the author's deep dive into optimization and the impressive performance results achieved. One commenter notes the impressive speeds, especially for the palette and grayscale formats, questioning whether further optimization is even possible or necessary. Another commends the author's dedication to thoroughly explaining their optimization process and the challenges they encountered. The detailed explanations are appreciated by other commenters as well, as they provide insight into the complexities of image decoding and the nuances of performance tuning.

A thread emerges around the use of SIMD instructions and the potential for further optimization using AVX-512. Commenters discuss the trade-offs involved in using these advanced instruction sets, considering factors like CPU compatibility and potential power consumption increases. The author of the Blend2D library chimes in to explain their reasoning for not fully utilizing AVX-512 yet, citing compilation complexities and limited practical benefits in their current implementation.

Comparisons to other popular image decoding libraries like libpng and stb_image are also made. Commenters discuss the performance differences, highlighting Blend2D's competitive edge in certain scenarios. The simplicity and ease of integration of stb_image are acknowledged, while Blend2D is praised for its focus on performance.

Finally, some comments delve into specific technical details, such as the use of premultiplied alpha and the handling of different bit depths. These comments demonstrate a deeper understanding of the technical aspects of image processing and offer specific suggestions or raise questions about the implementation choices made in Blend2D. One commenter questions the usage of premultiplied alpha by default.

Overall, the comments section reveals a general appreciation for the author's work and the performance achieved by Blend2D. The discussion offers valuable insights into the technical challenges and trade-offs involved in optimizing image decoding libraries, along with comparisons to existing solutions.
That Time I Recreated Photoshop in C++

permalink

Posted: 2025-03-15 18:22:15

Driven by a desire to understand how Photoshop worked under the hood, the author embarked on a personal project to recreate core functionalities in C++. Focusing on fundamental image manipulation like layers, blending modes, filters (blur, sharpen), and transformations, they built a simplified version without aiming for feature parity. This exercise provided valuable insights into image processing algorithms and the complexities of software development, highlighting the importance of optimization for performance, especially when dealing with large images and complex operations. The project, while not a full Photoshop replacement, served as a profound learning experience.

In a detailed blog post titled "That Time I Recreated Photoshop in C++," the author, Fabien Sanglard, recounts his ambitious personal project of building a simplified version of Adobe Photoshop from the ground up using C++. Driven by a desire to deepen his understanding of image manipulation and software development principles, Sanglard embarked on this challenging endeavor.

He began by outlining the core functionalities he aimed to replicate, including image loading and saving, basic selection tools, and a range of image processing filters like blur, sharpen, and color adjustments. Instead of relying on existing image processing libraries, Sanglard opted to implement these features from scratch, providing himself with invaluable insight into the underlying algorithms and mathematical operations involved. This approach necessitated delving into the intricacies of image formats, color spaces, and pixel manipulation techniques.

The project chronicle meticulously documents the various stages of development, beginning with the initial setup of the C++ project and the implementation of fundamental functionalities like reading and writing image files. Sanglard explains his decision to utilize the Simple DirectMedia Layer (SDL) library for window management and basic graphics rendering, which allowed him to focus on the core image processing logic.

The post then proceeds to detail the implementation of specific features. Sanglard explains how he tackled challenges such as implementing the selection tool, which involved managing pixel selection and manipulating only the selected region. He further elaborates on the development of image filters, outlining the algorithms used for blurring, sharpening, and color adjustments. The explanations are accompanied by code snippets showcasing the core logic behind each feature.

Throughout the development process, Sanglard encountered and overcame several obstacles, demonstrating his problem-solving skills. He describes how he debugged and optimized his code to improve performance, highlighting the iterative nature of software development.

Ultimately, while acknowledging that his creation wasn't a complete Photoshop replacement, Sanglard successfully built a functional image editor capable of performing basic image manipulation tasks. The project served as a profound learning experience, providing him with a deeper understanding of image processing principles, C++ programming, and software development methodologies. The blog post concludes with reflections on the lessons learned and the satisfaction derived from completing such a substantial undertaking.
Summary of Comments ( 122 )
https://news.ycombinator.com/item?id=43374278

Hacker News users generally praised the author's project, "Recreating Photoshop in C++," for its ambition and educational value. Some questioned the practical use of such an undertaking, given the existence of Photoshop and other mature image editors. Several commenters pointed out the difficulty in replicating Photoshop's full feature set, particularly the more advanced tools. Others discussed the choice of C++ and suggested alternative languages or libraries that might be more suitable for certain aspects of image processing. The author's focus on performance optimization and leveraging SIMD instructions also sparked discussion around efficient image manipulation techniques. A few comments highlighted the importance of UI/UX design, often overlooked in such projects, for a truly "Photoshop-like" experience. A recurring theme was the project's value as a learning exercise, even if it wouldn't replace existing professional tools.

The Hacker News post titled "That Time I Recreated Photoshop in C++" (linking to an article detailing the author's experience recreating Photoshop features in C++) has generated a number of comments discussing various aspects of the project and image editing software in general.

Several commenters focus on the author's choice of C++ and question its suitability for such a project. Some suggest that languages like C# with its garbage collection might have been a more productive choice, especially for managing memory when dealing with large images. Others point out that the performance benefits of C++ might not be fully realized in this type of application, given that many image processing operations are already highly optimized within existing libraries.

A significant thread of discussion revolves around the learning experience gained from such an undertaking. Commenters acknowledge the value of recreating existing software for educational purposes, emphasizing the deeper understanding of underlying principles that can be acquired through such a project. They also discuss the challenges of replicating complex software and the importance of choosing a well-defined scope to avoid becoming overwhelmed.

Some users express skepticism about the true extent of the "recreation," pointing to the potential difference between replicating the user interface and implementing the underlying image processing algorithms. They argue that the true complexity of Photoshop lies in its highly optimized algorithms and vast feature set, which would be extremely difficult to replicate fully.

Another commenter shares their own experience with writing a simplified image editor, offering insights into the intricacies of handling features like selection tools and layer management.

A few comments delve into the technical aspects of image processing, mentioning libraries like GEGL and the complexities of color management.

Finally, several commenters offer alternative approaches and tools for image editing, including GIMP and various command-line utilities, suggesting that these might be more efficient solutions depending on the specific needs. They highlight the mature and feature-rich nature of existing open-source options.
Dithering in Colour

permalink

Posted: 2025-03-09 23:28:09

Dithering is a technique used to create the illusion of more colors and smoother gradients in images with a limited color palette. The post "Dithering in Colour" explores various dithering algorithms, focusing on how they function with color images. It explains ordered dithering using matrices like the Bayer matrix, and error-diffusion dithering methods such as Floyd-Steinberg, which distribute quantization errors to neighboring pixels. The post visually demonstrates the effects of these algorithms with examples, highlighting the trade-offs between different methods in terms of perceived noise and color accuracy. It concludes by mentioning how dithering remains relevant today for stylistic effects and performance optimization, even with modern displays capable of displaying millions of colors.

The article "Dithering in Colour," hosted on the obrhubr.org website, presents an exhaustive exploration of dithering techniques, focusing specifically on their application to color images. The author meticulously deconstructs the complexities involved in representing a wide spectrum of colors using a limited palette, a common constraint in various display technologies and image formats. The core concept of dithering, which involves strategically dispersing error introduced by this quantization process, is explained in detail.

The piece commences with a foundational overview of ordered dithering, utilizing the illustrative example of the Bayer matrix. It elucidates how this predefined matrix of threshold values effectively distributes error spatially, resulting in the perception of a wider range of colors than are actually present. The mechanism by which this creates the illusion of intermediate shades and tones through the interplay of neighboring pixels is thoroughly analyzed.

Subsequently, the author delves into the intricacies of error-diffusion dithering, contrasting it with the ordered approach. This section provides a comprehensive explanation of the Floyd-Steinberg algorithm, a prominent error-diffusion technique, outlining the specific weighting scheme it employs to propagate quantization errors to adjacent pixels. The impact of this diffusion process on the visual characteristics of the dithered image, such as the reduction of banding artifacts and the creation of a more textured appearance, is meticulously discussed.

Furthermore, the article extends the exploration of dithering to encompass color images. It elucidates the challenges posed by the multi-channel nature of color data and the various approaches to address them. The author examines different color spaces, such as RGB and YUV, and explains how the choice of color space can significantly influence the final dithered result. The complexities of error diffusion in a multi-dimensional color space are thoroughly dissected, including the potential for color artifacts and strategies to mitigate them.

Finally, the article provides visual comparisons between various dithering techniques applied to color images. These comparative illustrations allow the reader to appreciate the subtle yet perceptible differences in the visual output produced by each algorithm. The strengths and weaknesses of each method are highlighted, enabling a deeper understanding of the trade-offs involved in selecting an appropriate dithering technique for a given application. The article concludes with a reinforced appreciation for the power and versatility of dithering as a tool for managing color representation in resource-constrained environments.
Summary of Comments ( 28 )
https://news.ycombinator.com/item?id=43315029

HN users generally praised the article for its clear explanation of dithering, particularly its interactive visualizations. Several commenters shared their experiences with dithering, including its use in older games and demos. Some discussed the subtle differences between various dithering algorithms, while others highlighted the continued relevance of these techniques in resource-constrained environments or for stylistic effect. One commenter pointed out a typo in the article, which the author promptly corrected. A few users mentioned alternative resources on the topic, including a related blog post and a book.

The Hacker News post titled "Dithering in Colour" (https://news.ycombinator.com/item?id=43315029) sparked a discussion with several interesting comments. Many users expressed appreciation for the clear and concise explanation of dithering techniques, particularly praising the article's visual examples and interactive elements.

One commenter highlighted the effectiveness of the ordered dithering approach, noting how it creates visually pleasing patterns that distribute the error effectively, even with a limited color palette. They specifically mentioned the use of Bayer matrices and how they help achieve this.

Another commenter delved into the history of dithering, linking its origins to printing techniques and explaining how it was adapted for digital displays. This historical context provided valuable insight into the motivation and evolution of dithering algorithms.

Several commenters discussed the practical applications of dithering, mentioning its relevance in areas like game development, image compression, and even 3D rendering. One user specifically mentioned its use in older games and how it allowed for a wider range of apparent colors despite hardware limitations. This prompted a side discussion about the nostalgia associated with the distinctive aesthetic produced by dithering.

The trade-offs between different dithering methods were also a topic of conversation. Commenters debated the merits of ordered dithering versus error diffusion, considering factors like computational cost and perceived image quality. Some users preferred the structured patterns of ordered dithering, while others favored the smoother gradients produced by error diffusion.

Finally, a few technically inclined commenters explored the mathematical underpinnings of dithering algorithms. They discussed topics like quantization error, noise shaping, and the properties of different dithering matrices. One user even linked to some external resources that provided further details on the mathematical theory behind these techniques. This provided a deeper understanding of how dithering algorithms work and the reasons behind their effectiveness.
Rotors: A practical introduction for 3D graphics (2023)

permalink

Posted: 2025-03-02 20:10:55

This post introduces rotors as a practical alternative to quaternions and matrices for 3D rotations. It explains that rotors, like quaternions, represent rotations as a single action around an arbitrary axis, but offer a simpler, more intuitive geometric interpretation based on the concept of "geometric algebra." The author argues that rotors are easier to understand and implement, visually demonstrating their geometric meaning and providing clear code examples in Python. The post covers basic rotor operations like creating rotations from an axis and angle, composing rotations, and applying rotations to vectors, highlighting rotors' computational efficiency and stability.

Jacques Heunis's blog post, "Rotors: A Practical Introduction for 3D Graphics (2023)," provides a comprehensive yet accessible exploration of rotors as a powerful alternative to other rotation representations like Euler angles, quaternions, and rotation matrices. The post begins by establishing the motivation for using rotors, highlighting the shortcomings of traditional methods, such as gimbal lock with Euler angles and the potential for ambiguity with quaternions (due to their double-covering nature). It emphasizes that rotors, based on the geometric algebra of 3D space, offer a more intuitive and mathematically elegant approach.

Heunis meticulously constructs the concept of rotors from the ground up, starting with the geometric product, a fundamental operation in geometric algebra. He explains how the geometric product combines the dot product and the wedge product, leading to a unified representation of both scalar and bivector quantities. Bivectors, representing oriented planar subspaces, are then shown to be the key to understanding rotations. The post explicitly details how the geometric product of two vectors produces a scalar and a bivector, illustrating this with clear examples.

The core of the post explains how rotors, which are normalized exponentials of bivectors, perform rotations. It meticulously derives the rotor formula and demonstrates how applying a rotor to a vector effectively rotates that vector within the plane defined by the bivector. The post clarifies that the exponential of a bivector results in a rotor, and this rotor acts as a rotation operator. The connection between rotors and quaternions is also addressed, demonstrating how a rotor can be converted to a quaternion and vice-versa, offering a deeper understanding of the relationship between these two representations. This includes a clear mapping of the bivector components to quaternion components.

Furthermore, the post delves into the practical advantages of rotors. It discusses how rotor composition, achieved through rotor multiplication, provides a simple and efficient way to combine multiple rotations. This contrasts with the more complex operations required when using rotation matrices or quaternions. The post also highlights the efficiency of interpolating between rotors, showcasing how smoothly and intuitively this can be accomplished compared to other rotation representations. Specific examples are given, demonstrating the calculations involved in interpolating between two rotors.

Finally, the post concludes by summarizing the key benefits of using rotors in 3D graphics programming, reinforcing their intuitive geometric interpretation, efficient composition, and smooth interpolation properties. It positions rotors as a powerful and practical tool for anyone working with rotations in 3D space, offering a compelling alternative to more traditional methods. Throughout the post, clear diagrams and code snippets are included to further clarify the concepts and facilitate practical implementation.
Summary of Comments ( 7 )
https://news.ycombinator.com/item?id=43234510

Hacker News users discussed the practicality and intuitiveness of using rotors for 3D rotations. Some found the rotor approach more elegant and easier to grasp than quaternions, especially appreciating the clear geometric interpretation and connection to bivectors. Others questioned the claimed advantages, arguing that quaternions remain the superior choice for performance and established library support. The potential benefits of rotors in areas like interpolation and avoiding gimbal lock were acknowledged, but some commenters felt the article didn't fully demonstrate these advantages convincingly. A few requested more comparative benchmarks or examples showcasing rotors' practical superiority in specific scenarios. The lack of widespread adoption and existing tooling for rotors was also raised as a barrier to entry.

The Hacker News post titled "Rotors: A practical introduction for 3D graphics (2023)" has generated a moderate discussion with several interesting comments. Many commenters praise the article for its clarity and insightful approach to explaining rotors.

One commenter appreciates the visual explanation of rotor interpolation, stating that it finally made the concept click for them. They highlight how the article demonstrates how rotors avoid gimbal lock, a common problem with other rotation representations like Euler angles. This comment emphasizes the practical value of the article for those struggling with 3D rotation concepts.

Another commenter points out the connection between rotors and quaternions, explaining that rotors are essentially a different way of looking at quaternions, specifically using a geometric algebra perspective. They delve a bit into the mathematical background, mentioning how rotors represent rotations as oriented arcs of great circles on a 3-sphere. This adds a layer of theoretical depth to the discussion, connecting the article's content to broader mathematical principles.

Further discussion revolves around the practical applications of rotors. One commenter mentions their use in game development, specifically for character animation and camera control. This highlights the real-world relevance of the topic and the potential benefits of using rotors in practical 3D graphics applications.

Another commenter expresses a preference for rotors over quaternions, arguing that they are easier to understand intuitively and visualize. They appreciate the geometric interpretation of rotations provided by rotors. This comment contributes to a small debate about the relative merits of rotors versus quaternions.

Finally, some commenters mention other resources for learning about rotors and geometric algebra, expanding the scope of the discussion and providing further avenues for exploration. They provide links and suggest books, giving interested readers more opportunities to deepen their understanding.

Overall, the comments section reflects a positive reception of the article, praising its clarity and practical approach to explaining rotors. The discussion touches upon the theoretical underpinnings of rotors, their practical applications, and their relationship to other rotation representations.
Don't "optimize" conditional moves in shaders with mix()+step()

permalink

Posted: 2025-02-09 12:42:54

Using mix() with step() to simulate conditional assignments in shaders is often less efficient than directly using branch instructions. While seemingly branchless, this mix()/step() approach can introduce extra computations and potentially disrupt hardware optimizations related to predication. Modern GPUs are adept at handling branches efficiently, especially when they are predictable, so relying on them is often faster and simpler than employing arithmetic workarounds. Therefore, default to standard branching unless profiling reveals a specific performance bottleneck that can be demonstrably addressed by a mix()/step() alternative.

Inigo Quilez's blog post, "Don't 'optimize' conditional moves in shaders with mix()+step()," argues against a common but misguided optimization technique used in shader programming. This technique attempts to replace explicit conditional statements (like if and else) with a combination of the mix() and step() functions, believing it will improve performance. Quilez contends that this perceived optimization is often counterproductive on modern GPUs and can actually lead to worse performance and even introduce subtle visual artifacts.

The core issue stems from how GPUs handle branching. While older GPUs suffered performance penalties from branching due to their sequential architecture, modern GPUs utilize a Single Instruction Multiple Data (SIMD) architecture. This means they execute the same instruction across multiple data points simultaneously. When encountering a branch (an if statement), both branches are evaluated for all data points, and the relevant result is then selected based on the condition. While this might seem wasteful, it avoids the complexities of thread divergence and maintains the efficiency of the SIMD architecture.

The proposed "optimization" using mix(a, b, step(x, y)) emulates a conditional move. It works by utilizing the step() function, which returns 0 if x<y and 1 otherwise. This result is then fed into the mix() function, which linearly interpolates between a and b based on the third parameter. Effectively, if x<y, mix() returns a (because the third parameter is 0); otherwise, it returns b (because the third parameter is 1). While logically equivalent to a conditional, this approach forces the GPU to evaluate both a and b regardless of the condition, even if only one result is ultimately used. This is precisely the same behavior the "optimization" was intended to avoid.

Moreover, the step() function introduces potential issues with precision and edge cases. Due to floating-point limitations, values very near the threshold of the step function can lead to unexpected blending between a and b, creating subtle visual artifacts, especially when dealing with sharp transitions or discontinuities in the data.

Quilez further emphasizes that compilers are often smart enough to recognize simple conditional statements and optimize them appropriately for the target hardware. Manually trying to outsmart the compiler with tricks like the mix()+step() combination often hinders the compiler’s ability to perform more effective optimizations.

In conclusion, Quilez advises against using mix()+step() as a replacement for conditional statements in shaders. He advocates for writing clear, readable code using explicit conditionals and trusting the compiler to generate optimized code for modern GPUs. The perceived performance gains from this "optimization" are generally illusory and can lead to performance degradation and visual artifacts. Clear and explicit code is generally preferred for maintainability and allows the compiler to perform more robust optimizations.
- shaders
- GPU
- optimization
- conditional moves
- branching
- performance
- mix()
- step()
- Graphics Programming
- HLSL
- GLSL
- computer graphics
- rendering
- fragment shader
- vertex shader
- conditional logic
Summary of Comments ( 7 )
https://news.ycombinator.com/item?id=42990324

HN users generally agreed that the article's advice is sound, particularly for modern GPUs. Several pointed out that mix() and step() can be more efficient than branching, especially when dealing with SIMD architectures where branching can lead to thread divergence. Some emphasized that profiling is crucial, as the optimal approach can vary depending on the specific GPU and shader complexity. One commenter noted that while branching might be faster in simple cases, mix() offers more predictable performance as shader complexity increases. Another cautioned against premature optimization and recommended focusing on algorithmic improvements first. A few users shared alternative techniques like using lookup textures or bitwise operations for certain conditional scenarios. Finally, there was discussion about the evolution of GPU architecture and how older advice regarding branching might no longer apply.

The Hacker News post "Don't "optimize" conditional moves in shaders with mix()+step()" sparked a discussion with several insightful comments. The central theme revolves around the performance implications of using mix() and step() to simulate conditional moves in shaders, as opposed to using actual conditional statements (e.g., if statements).

Several commenters pointed out that the performance characteristics of mix()/step() vs. conditional branching can vary significantly depending on the specific GPU architecture and the surrounding shader code. While the original article suggests mix()/step() can be less efficient, commenters noted that modern GPUs often handle branching efficiently, sometimes even converting branches into predicated instructions similar to what mix()/step() achieves. Therefore, a blanket statement about one approach always being superior is inaccurate.

One commenter highlighted the importance of profiling and benchmarking to determine the best approach for a given situation. They emphasized that theoretical considerations and general advice can be misleading, and empirical testing is crucial. Another user concurred, suggesting tools like Shader Playground for easy experimentation and performance comparison.

The impact of compiler optimizations was also discussed. Commenters noted that compilers can sometimes transform code in surprising ways, potentially negating the perceived benefits of one technique over another. Therefore, relying on assumptions about how the code will be executed at the hardware level can be problematic.

Some commenters delved into the nuances of GPU architectures, explaining how branching can affect occupancy and warp divergence. They explained how a branch might cause threads within a warp to take different paths, leading to serialization and reduced performance. However, it was also pointed out that modern GPUs have mechanisms to mitigate this, and the actual performance impact can be complex.

A few users discussed the readability and maintainability trade-offs. While mix()/step() might seem more concise, it can sometimes obscure the intent of the code compared to a more explicit if statement. This can make debugging and future modifications more challenging.

Finally, some commenters offered alternative approaches for handling conditional logic in shaders, such as using lookup tables or specialized instructions available on certain GPUs. These suggestions highlighted the importance of exploring different techniques and considering the specific hardware target when optimizing shader code.
Simulating Water over Terrain

permalink

Posted: 2025-02-06 14:15:28

This blog post details a method for realistically simulating shallow water flow over terrain. The author utilizes a heightmap to represent the terrain and employs a simplified shallow water equations model to govern water movement. This model calculates water height and velocity, accounting for factors like terrain slope and gravity. The simulation iteratively updates the water's state using numerical integration, allowing for dynamic changes in water distribution and flow patterns based on the underlying terrain. Visualization is achieved through a simple rendering technique that adjusts terrain color based on water depth, creating a visually convincing representation of shallow water flowing over varied terrain.

This blog post, titled "Simulating Water over Terrain," meticulously details the author's journey in creating a realistic water simulation flowing over a dynamically generated terrain. The author begins by establishing the foundational concept of utilizing a heightmap to represent the terrain's elevation. This heightmap is then used to construct a mesh, effectively transforming the 2D representation into a 3D landscape. The core of the simulation revolves around the Shallow Water Equations, a set of partial differential equations that govern the behavior of shallow water flow. These equations consider factors such as water height and velocity to model the movement of water across the terrain.

The author opts for a staggered grid approach for discretizing and solving these equations. This technique involves storing the water height at the center of each grid cell, while the water velocities are stored at the cell faces. This arrangement proves beneficial in preventing numerical oscillations and ensuring a more stable simulation. The blog post then delves into the specifics of discretizing each term of the Shallow Water Equations, elucidating the numerical methods employed. The process involves calculating fluxes of water across cell boundaries, taking into account the terrain's slope and the water's momentum.

A crucial aspect of the simulation is the implementation of boundary conditions. The author details how they handle the edges of the simulation domain to ensure realistic behavior. Furthermore, the blog post addresses the challenge of maintaining water within the bounds of the terrain, preventing it from seeping through the ground or becoming artificially elevated.

The author emphasizes the iterative nature of solving the Shallow Water Equations. They describe the use of time steps to update the water's state, progressively simulating its flow over the terrain. The post also touches upon performance optimization techniques, particularly in the context of rendering the water surface. To visualize the water, the author employs a method involving drawing triangles based on the calculated water heights, effectively creating a dynamic mesh that represents the water's surface. This mesh adapts to the underlying terrain and the evolving water flow, providing a visually appealing representation of the simulation. The author concludes by showcasing the results of their work, demonstrating the successful simulation of water flowing realistically over a generated terrain, complete with ripples, waves, and dynamic interaction with the landscape's features.
Summary of Comments ( 18 )
https://news.ycombinator.com/item?id=42962508

Commenters on Hacker News largely praised the clarity and educational value of the blog post on simulating water over terrain. Several appreciated the author's focus on intuitive explanation and avoidance of overly complex mathematics, making the topic accessible to a wider audience. Some pointed out the limitations of the shallow water equations used, particularly regarding their inability to model breaking waves, while others suggested alternative approaches or resources for further exploration, such as smoothed-particle hydrodynamics (SPH) and the book "Fluid Simulation for Computer Graphics." A few commenters also shared their own experiences and projects related to fluid simulation. Overall, the discussion was positive and focused on the technical aspects of the simulation.

The Hacker News post titled "Simulating Water over Terrain" (https://news.ycombinator.com/item?id=42962508) has a modest number of comments, discussing various aspects of the linked blog post about water simulation.

Several commenters praise the clarity and educational value of the blog post. One user appreciates the author's approach of starting with a simple model and gradually adding complexity, making it easy to follow the development of the simulation. Another commenter highlights the effective use of visualizations and interactive elements, which aid in understanding the concepts being presented. The clear and concise explanations are lauded, with one commenter specifically mentioning that the post is a good example of how to explain complex technical topics in an accessible way.

A few comments delve into the technical details of the simulation. One user questions the use of the term "pressure" and suggests that "water level/height" might be more appropriate. This sparks a brief discussion about the nuances of fluid dynamics and the appropriate terminology in this context. Another comment explores the computational aspects, mentioning the potential performance implications of the chosen approach and suggesting possible optimizations. There's a short exchange about the trade-offs between simulation accuracy and computational cost, highlighting the importance of finding a balance depending on the specific application.

Some comments also touch upon the potential applications of such simulations, ranging from video games and computer graphics to scientific modeling and engineering. One commenter points out the relevance of the technique for simulating flooding scenarios, which could be useful for urban planning and disaster management.

Overall, the comments section reflects a positive reception of the blog post, with commenters praising its clarity, educational value, and technical depth. The discussion also extends to the technical intricacies of the simulation and its potential applications, showcasing the community's engagement with the topic. While the number of comments is not extensive, they provide valuable insights and perspectives on various aspects of simulating water over terrain.
The Graphics Codex

permalink

Posted: 2025-01-26 05:36:33

The Graphics Codex is a comprehensive, free online resource for learning about computer graphics. It covers a broad range of topics, from fundamental concepts like color and light to advanced rendering techniques like ray tracing and path tracing. Emphasizing a practical, math-heavy approach, the Codex provides detailed explanations, interactive diagrams, and code examples to facilitate a deep understanding of the underlying principles. It's designed to be accessible to students and professionals alike, offering a structured learning path from beginner to expert levels. The resource continues to evolve and expand, aiming to become a definitive and up-to-date guide to the field of computer graphics.

The Graphics Codex, an ambitious online compendium of computer graphics knowledge, presents itself as a comprehensive and evolving resource for both novice learners and seasoned professionals in the field. It distinguishes itself from typical textbooks by adopting a practical, "bottom-up" approach, emphasizing implementation details and demonstrable code examples over abstract theoretical discussions. This approach is facilitated by its integrated coding environment, allowing readers to directly interact with and modify the presented algorithms and visualize the results in real-time. This interactivity aims to bridge the gap between theoretical understanding and practical application, a common hurdle in grasping complex graphical concepts.

The Codex covers a wide spectrum of graphics-related topics, ranging from fundamental concepts like color theory, vector mathematics, and transformations, to more advanced rendering techniques such as ray tracing, path tracing, and rasterization. It delves into the intricacies of different shading models, explaining how light interacts with surfaces to create realistic visual effects. Furthermore, it explores the underlying principles of animation and simulation, providing insights into the creation of dynamic and interactive graphical experiences. The content is structured in a modular fashion, allowing readers to navigate and focus on specific areas of interest, or follow a suggested learning path for a more structured learning experience.

A key feature of the Graphics Codex is its emphasis on mathematical rigor and clarity. While prioritizing practical implementation, it doesn't shy away from the underlying mathematical foundations, providing detailed explanations and derivations where necessary. This ensures that readers develop a robust understanding of the theoretical principles governing the presented algorithms and techniques. The code examples provided are meticulously crafted and well-documented, serving as both illustrative tools and practical templates for real-world applications.

Furthermore, The Graphics Codex is committed to staying current with the rapidly evolving landscape of computer graphics. Its online nature allows for continuous updates and additions, incorporating the latest research and advancements in the field. This ensures that the resource remains relevant and valuable for practitioners seeking to stay at the forefront of graphics technology. It aims to be a living document, constantly expanding and refining its content to reflect the latest innovations and best practices in computer graphics.
Summary of Comments ( 9 )
https://news.ycombinator.com/item?id=42827976

Hacker News users largely praised the Graphics Codex, calling it a "fantastic resource" and a "great intro to graphics". Many appreciated its practical, hands-on approach and clear explanations of fundamental concepts, contrasting it favorably with overly theoretical or outdated textbooks. Several commenters highlighted the value of its accompanying code examples and the author's focus on modern graphics techniques. Some discussion revolved around the choice of GLSL over other shading languages, with some preferring a more platform-agnostic approach, but acknowledging the educational benefits of GLSL's explicit nature. The overall sentiment was highly positive, with many expressing excitement about using the resource themselves or recommending it to others.

The Hacker News post titled "The Graphics Codex" spawned a lively discussion with a variety of comments focusing on the book's content, target audience, pricing model, and comparisons to other resources.

Several commenters praised the book's comprehensive coverage of graphics programming concepts and its focus on foundational knowledge. One user highlighted its valuable insights into topics often overlooked in other resources, particularly its treatment of textures and sampling. Another appreciated the author's deep dive into the underlying mathematics and theory, contrasting it with more superficial treatments found elsewhere. The book's rigorous approach and attention to detail were seen as strengths, offering a valuable resource for those seeking a deeper understanding of graphics principles.

The pricing model, offering both a free online version and a paid print/PDF version, was a point of discussion. Commenters generally supported this approach, viewing it as a fair way to make the content accessible while also allowing the author to be compensated for their work. The ability to access the complete content online for free was praised as being beneficial to students and others with limited resources.

Several comparisons were made to other graphics programming resources. Some commenters drew parallels to "Real-Time Rendering," another highly regarded text, noting that "The Graphics Codex" might serve as a valuable companion or alternative. The perceived target audience for each book was also discussed, with some suggesting that "The Graphics Codex" might be better suited for beginners or those seeking a more foundational understanding.

Some commenters also delved into specific technical aspects discussed in the book, such as the explanation of mipmapping and the treatment of different rendering techniques. These discussions highlighted the depth of the book's content and its potential to spark deeper exploration of specific topics.

A few criticisms were also raised. One commenter expressed concern about the potential for the content to become outdated due to the rapidly evolving nature of graphics programming. Another mentioned some perceived limitations in the book's coverage of certain topics.

Overall, the comments on Hacker News reflect a generally positive reception to "The Graphics Codex," praising its comprehensive content, accessible pricing model, and rigorous approach to graphics programming concepts. While some minor concerns were raised, the overall sentiment suggests that the book is a valuable contribution to the field.
Dissecting "Tiny Clouds" shadertoy (2017)

permalink

Posted: 2025-01-19 01:28:36

This blog post breaks down the "Tiny Clouds" Shadertoy by iq, explaining its surprisingly simple yet effective cloud rendering technique. The shader uses raymarching through a 3D noise function, but instead of directly visualizing density, it calculates the amount of light scattered backwards towards the viewer. This is achieved by accumulating the density along the ray and weighting it based on the distance traveled, effectively simulating how light scatters more in denser areas. The post further analyzes the specific noise function used, which combines several octaves of Simplex noise for detail, and discusses how the scattering calculations create a sense of depth and illumination. Finally, it offers variations and potential improvements, such as adding lighting controls and exploring different noise functions.

This blog post by demofox provides an in-depth analysis of a Shadertoy example called "Tiny Clouds," created by iq. The post meticulously breaks down the shader code, explaining the mathematical principles and techniques used to generate visually appealing, stylized clouds. The author's primary goal is to demystify the code, making it accessible to a wider audience interested in learning about shader programming and procedural generation.

The analysis begins with a general overview of the shader's structure and function, highlighting the core components responsible for cloud rendering. It then delves into the specifics of the noise function, crucial for creating the cloud's texture. The post explains how the shader uses a combination of 3D noise functions, specifically a modified version of Perlin noise, and how these functions are layered and scaled to achieve a sense of depth and detail. The author carefully unpacks the mathematical formulas involved, illustrating the impact of various parameters on the final cloud appearance. This includes a discussion on the use of fractional Brownian motion (fBm) to create more natural-looking cloud formations.

Furthermore, the post dissects the lighting model employed by the shader. It explains how the shader simulates the scattering and absorption of light within the clouds, creating the illusion of volume and three-dimensionality. The author describes how the code calculates light attenuation based on cloud density and the direction of the light source. This section also covers the techniques used to simulate the effect of light scattering through the cloud, contributing to the overall realism.

The color scheme of the clouds is also addressed. The post details how the shader blends colors to represent different parts of the cloud, using a combination of blue and white tones to depict the varying densities and lighting conditions within the cloud structure. The author explains how the blending functions are used to smoothly transition between these colors, resulting in a visually pleasing and believable cloud representation.

Finally, the post concludes by summarizing the key takeaways from the analysis and highlighting the ingenuity of the original shader code. The author emphasizes the importance of understanding the underlying mathematical principles and encourages readers to experiment with the code to further their understanding of shader programming and procedural generation techniques. The author's breakdown provides valuable insights into the creation of realistic and stylized clouds using relatively simple, yet effective, mathematical operations within a shader.
Summary of Comments ( 2 )
https://news.ycombinator.com/item?id=42752845

Commenters on Hacker News largely praised the "Tiny Clouds" shader's elegance and efficiency, admiring the author's ability to create such a visually appealing effect with minimal code. Several discussed the clever use of trigonometric functions and noise to generate the cloud shapes, and some delved into the specifics of raymarching and signed distance fields. A few users shared their own experiences experimenting with similar techniques, and offered suggestions for further exploration, like adding lighting variations or animation. One commenter linked to a related Shadertoy example showcasing a different approach to cloud rendering, prompting a brief comparison of the two methods. Overall, the discussion highlighted the technical ingenuity behind the shader and fostered a sense of appreciation for its concise yet powerful implementation.

The Hacker News discussion on the "Dissecting 'Tiny Clouds'" shadertoy post is relatively brief, containing only a handful of comments. Therefore, a comprehensive summary of compelling arguments or diverse viewpoints is not possible.

The comments primarily focus on appreciation for the original shadertoy and the author's breakdown of its functionality. One commenter expresses admiration for the "organic feel" achieved and how the dissection helps understand the underlying principles. Another comment simply points to a similar cloud rendering technique using ray marching. There's no extensive debate or contrasting perspectives offered in this particular discussion. The thread serves more as a pointer to interesting related resources and an acknowledgement of the original work's quality.
Show HN: Doom (1993) in a PDF

permalink

Posted: 2025-01-13 00:50:43

Someone has rendered the entirety of the original Doom (1993) game, including all levels, enemies, items, and even the intermission screens, as individual images within a 460MB PDF file. This allows for a static, non-interactive experience of browsing through the game's visuals like a digital museum exhibit. The PDF acts as a unique form of archiving and presenting the game's assets, essentially turning the classic FPS into a flipbook.

A highly inventive individual, going by the online moniker "b33f", has undertaken a fascinating project: the meticulous conversion of the entirety of the groundbreaking 1993 first-person shooter video game, Doom, into the portable document format (PDF). This is not simply a document about Doom, but rather a functional, albeit unconventional, rendition of the game itself. The PDF file, hosted online for public access, leverages the surprising capabilities of the PDF specification to emulate a rudimentary form of interactive gameplay within the confines of a document typically associated with static text and images.

The project utilizes a series of intricately designed pages, each representing a distinct game state or a minor incremental change within the game world. Rather than employing traditional animation or code execution, the "gameplay" progresses by manually navigating through these numerous pages, simulating movement, actions, and even the passage of time within the Doom environment. Each page is a static snapshot, depicting the game's visuals at a specific moment, including elements such as the player's perspective, enemy positions, and the state of the environment. Progression is achieved by clicking hyperlinks embedded within the PDF, each link corresponding to a potential player action, like moving forward, turning, or firing a weapon. Clicking a link transports the user to the page representing the outcome of that action, effectively creating a painstakingly constructed illusion of interactive experience.

While this approach drastically deviates from the original game's real-time dynamics and smooth animation, it serves as a remarkable demonstration of the often-overlooked flexibility inherent in the PDF format. The project is not meant to be a practical replacement for playing the original Doom, but rather an intriguing experiment and a testament to creative problem-solving. It showcases how a seemingly inflexible format can be manipulated to achieve unexpected results, blurring the lines between document and application, and offering a unique, albeit cumbersome, way to experience a classic piece of gaming history within the familiar confines of a PDF reader. The sheer volume of pages required to represent even a small portion of the game highlights the dedication and effort invested in this curious undertaking.
Summary of Comments ( 57 )
https://news.ycombinator.com/item?id=42678754

Hacker News users generally expressed amusement and appreciation for the novelty of rendering Doom as a PDF. Several commenters questioned the practicality, but acknowledged the technical achievement. Some discussed the technical aspects, wondering how it was accomplished and speculating about the use of vector graphics and custom fonts. Others shared similar projects, like rendering Quake in HTML. A few users pointed out potential issues, such as the large file size and the lack of interactivity, while others jokingly suggested printing it out. Overall, the sentiment was positive, with commenters finding the project a fun and interesting hack.

The Hacker News post titled "Show HN: Doom (1993) in a PDF" generated a fair amount of discussion, with several commenters intrigued by the concept and its execution.

One of the most compelling threads started with a user questioning the practical use of having Doom rendered within a PDF. This sparked a discussion about potential applications, with some suggesting it could be used for archival purposes, preserving the game in a format less susceptible to software and hardware obsolescence. Others saw it as a novelty or a technical curiosity, appreciating the ingenuity involved in rendering a dynamic game within a static document format. The creator of the PDF chimed in, explaining that it was mainly a technical experiment, driven by curiosity about the possibilities of the PDF format.

Several users expressed admiration for the technical feat, particularly the implementation of sound within the PDF, which some found surprising. They inquired about the methods used to achieve this, prompting the creator to explain that they utilized the PDF's multimedia capabilities and embedded a MIDI soundtrack.

There was also discussion about the limitations of the PDF version, such as performance issues and the lack of interactivity beyond basic menu navigation. Some users pondered whether it would be possible to incorporate more complex game logic within a PDF, leading to a brief exchange about the potential and limitations of PDF as a platform for interactive applications.

A few commenters also drew parallels to other projects that had explored unconventional ways of running Doom, referencing instances like Doom running on calculators or other limited hardware. This reinforced the theme of technical curiosity and the desire to push the boundaries of what's possible with existing technology.

Finally, there were some lighthearted comments appreciating the quirkiness of the project and its nostalgic connection to the original Doom game.

Page 1 of 1.

Stories with Tag Graphics Programming

Summary of Comments ( 20 ) https://news.ycombinator.com/item?id=44109257

Summary of Comments ( 143 ) https://news.ycombinator.com/item?id=44038209

Summary of Comments ( 1 ) https://news.ycombinator.com/item?id=43987814

Summary of Comments ( 22 ) https://news.ycombinator.com/item?id=43943942

Summary of Comments ( 56 ) https://news.ycombinator.com/item?id=43942149

Summary of Comments ( 47 ) https://news.ycombinator.com/item?id=43774726

Summary of Comments ( 37 ) https://news.ycombinator.com/item?id=43663290

Summary of Comments ( 4 ) https://news.ycombinator.com/item?id=43451187

Summary of Comments ( 122 ) https://news.ycombinator.com/item?id=43374278

Summary of Comments ( 28 ) https://news.ycombinator.com/item?id=43315029

Summary of Comments ( 7 ) https://news.ycombinator.com/item?id=43234510

Summary of Comments ( 7 ) https://news.ycombinator.com/item?id=42990324

Summary of Comments ( 18 ) https://news.ycombinator.com/item?id=42962508

Summary of Comments ( 9 ) https://news.ycombinator.com/item?id=42827976

Summary of Comments ( 2 ) https://news.ycombinator.com/item?id=42752845

Summary of Comments ( 57 ) https://news.ycombinator.com/item?id=42678754

Summary of Comments ( 20 )
https://news.ycombinator.com/item?id=44109257

Summary of Comments ( 143 )
https://news.ycombinator.com/item?id=44038209

Summary of Comments ( 1 )
https://news.ycombinator.com/item?id=43987814

Summary of Comments ( 22 )
https://news.ycombinator.com/item?id=43943942

Summary of Comments ( 56 )
https://news.ycombinator.com/item?id=43942149

Summary of Comments ( 47 )
https://news.ycombinator.com/item?id=43774726

Summary of Comments ( 37 )
https://news.ycombinator.com/item?id=43663290

Summary of Comments ( 4 )
https://news.ycombinator.com/item?id=43451187

Summary of Comments ( 122 )
https://news.ycombinator.com/item?id=43374278

Summary of Comments ( 28 )
https://news.ycombinator.com/item?id=43315029

Summary of Comments ( 7 )
https://news.ycombinator.com/item?id=43234510

Summary of Comments ( 7 )
https://news.ycombinator.com/item?id=42990324

Summary of Comments ( 18 )
https://news.ycombinator.com/item?id=42962508

Summary of Comments ( 9 )
https://news.ycombinator.com/item?id=42827976

Summary of Comments ( 2 )
https://news.ycombinator.com/item?id=42752845

Summary of Comments ( 57 )
https://news.ycombinator.com/item?id=42678754