hackslash dot org

MichiganTypeScript: A WebAssembly runtime implemented in TypeScript types

Posted: 2025-02-26 16:26:18

MichiganTypeScript is a proof-of-concept project demonstrating a WebAssembly runtime implemented entirely within TypeScript's type system. It doesn't actually execute WebAssembly code, but instead uses advanced type-level programming techniques to simulate its execution. By representing WebAssembly instructions and memory as types, and leveraging TypeScript's type inference and checking capabilities, the project can statically verify the behavior of a given WebAssembly program. This effectively transforms TypeScript's type checker into an interpreter, showcasing the power and flexibility of its type system, albeit in a non-practical, purely theoretical manner.

The GitHub repository "MichiganTypeScript/typescript-types-only-wasm-runtime," also known as MichiganTypeScript, presents a novel approach to WebAssembly (Wasm) runtime implementation. Instead of using traditional programming languages like C++, Rust, or JavaScript, this project leverages the TypeScript type system itself to emulate a Wasm runtime environment entirely within the type checker. This means no actual JavaScript code is generated or executed; the entire runtime logic, including loading Wasm modules, executing instructions, managing memory, and handling system calls, is encoded and enforced through complex type definitions and manipulations.

MichiganTypeScript achieves this by representing Wasm concepts as TypeScript types. For example, Wasm instructions are modeled as type-level functions, memory is represented using tuples, and the stack is simulated using recursive type definitions. The execution of a Wasm program within this system involves a series of intricate type transformations, where the TypeScript compiler effectively "steps through" the program by applying type-level functions that correspond to Wasm instructions. The compiler's type checking mechanism verifies that these transformations are valid according to the defined Wasm semantics.

The primary objective of this project is not to create a practical, performant Wasm runtime for real-world applications. Instead, it serves as an exploration of the boundaries of TypeScript's type system and a demonstration of its expressive power. By implementing a complex system like a Wasm runtime purely within the type system, the project showcases the potential of using types for advanced static analysis, program verification, and potentially even code generation in the future. While currently limited in terms of the Wasm features it supports and the size of programs it can handle, MichiganTypeScript represents a compelling experiment in pushing the limits of type-driven development. The project highlights the increasing sophistication of type systems in modern programming languages and opens up new avenues for research into leveraging them for tasks traditionally performed by runtime environments.

Summary of Comments ( 5 )
https://news.ycombinator.com/item?id=43185174

Hacker News users discussed the cleverness of using TypeScript's type system for computation, with several expressing fascination and calling it "amazing" or "brilliant." Some debated the practical applications, acknowledging its limitations while appreciating it as a demonstration of the type system's power. Concerns were raised about debugging complexity and the impracticality for larger programs. Others drew parallels to other Turing-complete type systems and pondered the potential for generating optimized WASM code from such TypeScript code. A few commenters pointed out the project's connection to the "ts-sql" project and speculated about leveraging similar techniques for compile-time query validation and optimization. Several users also highlighted the educational value of the project, showcasing the unexpected capabilities of TypeScript's type system.

The Hacker News post titled "MichiganTypeScript: A WebAssembly runtime implemented in TypeScript types" sparked a discussion with several interesting comments.

Many users expressed fascination and amusement at the project, highlighting the ingenuity and absurdity of implementing a Wasm runtime purely within TypeScript's type system. Some saw it as a clever demonstration of the power and flexibility of TypeScript's type system, pushing its boundaries beyond what might be considered practical. Others viewed it more as a playful experiment or a form of esoteric programming.

One commenter questioned the practical implications of the project, wondering about its potential use cases beyond being a proof of concept. This sparked a small thread discussing the potential for verifying Wasm modules at compile time or exploring new possibilities in type-level computation. However, the general consensus seemed to be that the project's primary value lies in its demonstration of the theoretical possibilities, rather than immediate practical applications.

Several users pointed out the similarities to other projects that explore the computational capabilities of type systems, particularly within languages like Idris and Haskell. This highlighted the connection between TypeScript's advanced type features and the concepts found in dependently-typed languages.

There was also some discussion regarding the performance and scalability of such an approach. Some commenters expressed skepticism about the feasibility of using this for real-world Wasm execution, anticipating potential performance bottlenecks.

A few users highlighted the educational value of the project, suggesting that it could be a useful tool for learning about both TypeScript's type system and the inner workings of Wasm.

Finally, some comments simply expressed amazement and appreciation for the creativity and technical skill demonstrated by the project's creators. Phrases like "mind-blowing," "absolutely bonkers," and "amazingly pointless" were used to capture the general sentiment of bewildered admiration.

Neut Programming Language

permalink

Posted: 2025-02-24 01:12:18

Neut is a statically-typed, compiled programming language designed for building reliable and maintainable systems software. It emphasizes simplicity and explicitness through its C-like syntax, minimal built-in features, and focus on compile-time evaluation. Key features include a powerful macro system enabling metaprogramming and code generation, algebraic data types for representing data structures, and built-in support for pattern matching. Neut aims to empower developers to write efficient and predictable code by offering fine-grained control over memory management and avoiding hidden runtime behavior. Its explicit design choices and limited standard library encourage developers to build reusable components tailored to their specific needs, promoting code clarity and long-term maintainability.

The Neut programming language, as described on its overview page, presents itself as a novel approach to software development, aiming to simplify the creation and maintenance of complex software systems. It achieves this through a core philosophy centered around immutability and relational programming principles, coupled with a unique execution model designed for efficiency and predictable behavior.

At the heart of Neut lies its immutable data model. All data within a Neut program is immutable, meaning that once a value is created, it cannot be modified. This inherent immutability eliminates a large class of potential bugs related to shared state and side effects, thereby increasing the reliability and predictability of the program's behavior. This contributes to a more straightforward reasoning process during development and debugging.

Neut embraces a relational programming paradigm, where computation is expressed as relationships between data, rather than as sequences of imperative instructions. This declarative style further enhances code clarity and maintainability, as it focuses on describing what the program should accomplish, rather than how it should achieve it. This approach allows the compiler to optimize execution more effectively, potentially exploiting parallelism and other performance enhancements.

The language incorporates a novel execution model based on a "pull-based" approach. Instead of explicitly specifying the order of operations, computation is driven by data dependencies. Values are computed only when they are needed, and the system automatically manages the flow of data through the program. This "lazy evaluation" strategy can lead to significant performance gains, particularly in situations where not all computed values are ultimately required. Furthermore, this demand-driven execution model inherently supports parallel processing, as independent computations can be executed concurrently.

Neut provides built-in support for concurrency and distributed computing, leveraging its immutable data model and pull-based execution to simplify the development of concurrent and distributed applications. The absence of mutable state eliminates the need for complex synchronization mechanisms, such as locks and mutexes, which are often sources of subtle bugs in concurrent programs.

The language is statically typed, providing compile-time guarantees about the correctness of programs. This static typing helps to prevent a wide range of errors early in the development cycle, leading to more robust and reliable software. Furthermore, the type system can aid in code understanding and refactoring.

Finally, Neut aims to be interoperable with existing software ecosystems, allowing integration with code written in other languages. This pragmatic approach acknowledges the reality of existing codebases and allows for gradual adoption of Neut in larger projects. This interoperability facilitates the leveraging of existing libraries and tools while benefiting from the advantages provided by Neut's innovative features.

Summary of Comments ( 27 )
https://news.ycombinator.com/item?id=43154883

HN commenters generally express interest in Neut, praising its focus on simplicity, safety, and explicitness. Several highlight the appealing aspects of linear types and the borrow checker, noting similarities to Rust but with a seemingly gentler learning curve. Some question the practical applicability of linear types for larger projects, while others anticipate its usefulness in specific domains like game development or embedded systems. A few commenters express skepticism about the limited standard library and the overall maturity of the project, but the overall tone is positive and curious about the language's potential. Performance, particularly relating to garbage collection or its lack thereof, is a recurring point of discussion, with some wondering about the potential for optimizations given the linear type system.

The Hacker News post for "Neut Programming Language" (https://news.ycombinator.com/item?id=43154883) has a modest number of comments, sparking a discussion around the language's unique features and potential applications.

Several commenters focus on Neut's core concept of "neural types," expressing interest in its potential for type-safe neural networks. One commenter highlights the challenge of representing complex neural network architectures within a type system, wondering how Neut handles concepts like skip connections and shared weights. Another commenter draws parallels with other typed functional programming languages used in machine learning, like Dex and F*. They question whether Neut offers significant advantages over these existing solutions.

The discussion also touches upon the practicalities of using Neut. One commenter inquires about the language's performance characteristics and the availability of debugging tools. Another raises the crucial question of integration with existing machine learning frameworks like TensorFlow or PyTorch. A separate comment expresses skepticism about the overall usefulness of strict typing for neural networks, arguing that the dynamic nature of the field often necessitates flexibility over rigid type safety.

A few comments delve into specific aspects of Neut's design. One points out the potential benefits of using dependent types for expressing tensor shapes and preventing common errors. Another discusses the implications of Neut's choice of Haskell as its implementation language.

Overall, the comments reflect a mixture of curiosity, skepticism, and cautious optimism. While some commenters are intrigued by Neut's novel approach to type safety in neural networks, others remain unconvinced of its practical benefits and express concerns about its integration with existing tools and workflows. The limited number of comments, however, prevents a truly in-depth exploration of the language's potential and drawbacks.

TinyCompiler: A compiler in a week-end

permalink

Posted: 2025-02-20 22:02:59

This blog post chronicles the author's weekend project of building a compiler for a simplified C-like language. It walks through the implementation of a lexical analyzer, parser (using recursive descent), and code generator targeting x86-64 assembly. The compiler handles basic arithmetic operations, variable declarations and assignments, if/else statements, and while loops. The post emphasizes simplicity and educational value over performance or completeness, providing a practical example of compiler construction principles in a digestible format. The code is available on GitHub for readers to explore and experiment with.

This blog post, "TinyCompiler: A compiler in a week-end," chronicles the author's journey in creating a simplified compiler from scratch over a weekend. The primary goal wasn't to build a production-ready tool but rather a practical learning exercise to solidify the author's understanding of compiler construction principles. The compiler targets Monkey, a language inspired by the author's previous Monkey interpreter project. The post meticulously details each stage of the compiler's development, emphasizing clarity and simplicity over optimization or feature completeness.

The process begins with lexical analysis (lexing), which transforms the raw Monkey source code into a stream of tokens. These tokens represent meaningful units like keywords, identifiers, operators, and punctuation. The author employs regular expressions to recognize these patterns in the input string and generate corresponding token objects. The post includes snippets of C++ code demonstrating the implementation of this lexing process.

Following lexing, the compiler proceeds to parsing. The parser takes the stream of tokens and organizes them into an Abstract Syntax Tree (AST). This tree-like structure represents the grammatical structure of the source code, making it easier to analyze and manipulate. The author uses a recursive descent parsing technique, writing functions to handle each grammatical rule of the Monkey language. The post explains how the parser combines tokens into higher-level constructs like expressions, statements, and program blocks, mirroring the grammar rules defined for Monkey. Code examples illustrating the recursive nature of the parsing process are provided.

The final stage covered in the post is code generation. With the AST constructed, the compiler translates it into assembly language for a hypothetical stack-based virtual machine. This process involves traversing the AST and emitting corresponding assembly instructions for each node. The post demonstrates how different AST nodes, representing various language constructs, are converted into equivalent VM instructions. The chosen assembly language targets a simple virtual machine, enabling the author to focus on the core principles of code generation without delving into the complexities of a real-world target architecture. The post includes detailed explanations and C++ code snippets showing how arithmetic expressions, variable assignments, and conditional statements are translated into assembly instructions. The author acknowledges that this simple compiler lacks optimization and error handling features, prioritizing educational value over practical utility. The post concludes by reflecting on the learning experience and offering potential avenues for extending the project further.

Summary of Comments ( 58 )
https://news.ycombinator.com/item?id=43120873

HN users largely praised the TinyCompiler project for its educational value, highlighting its clear code and approachable structure as beneficial for learning compiler construction. Several commenters discussed extending the compiler's functionality, such as adding support for different architectures or optimizing the generated code. Some pointed out similar projects or resources, like the "Let's Build a Compiler" tutorial and the Crafting Interpreters book. A few users questioned the "weekend" claim in the title, believing the project would take significantly longer for a novice to complete. The post also sparked discussion about the practical applications of such a compiler, with some suggesting its use for educational purposes or embedding in resource-constrained environments. Finally, there was some debate about the complexity of the compiler compared to more sophisticated tools like LLVM.

The Hacker News post "TinyCompiler: A compiler in a week-end" generated a fair amount of discussion, with several commenters sharing their perspectives and experiences related to compiler construction.

A prevalent theme in the comments is the accessibility and educational value of the project. Many commenters praised the author for creating a simplified yet functional compiler, making the often-daunting task of compiler development more approachable for beginners. Some users shared their personal experiences of using similar projects as a starting point for learning about compilers, emphasizing the importance of hands-on projects in grasping the underlying concepts.

Several comments delve into technical details, discussing specific aspects of the compiler's implementation, such as the parsing techniques, code generation strategies, and the choice of target language (assembly). Some commenters pointed out potential improvements or alternative approaches, fostering a constructive discussion about compiler design choices. For example, there's discussion around the use of recursive descent parsing and the handling of operator precedence.

A few comments touch upon the project's scope and limitations. While acknowledging the project's educational merit, some commenters rightly point out that it's a simplified example and doesn't cover the full complexity of real-world compilers. They mention aspects like optimization, error handling, and support for more advanced language features as areas where the tiny compiler differs from production-ready compilers.

The value of such simplified projects as learning tools is a recurring point of discussion. Commenters argue that focusing on a smaller, manageable project allows beginners to grasp the fundamental principles without being overwhelmed by the intricacies of a full-blown compiler. This sentiment reinforces the project's goal of making compiler development accessible to a wider audience.

Finally, some comments offer links to related resources, including other compiler tutorials, open-source compiler projects, and books on compiler construction. This further contributes to the educational value of the discussion, providing avenues for those interested in exploring the topic further.

F8 – an 8 bit architecture designed for C and memory efficiency [video]

permalink

Posted: 2025-02-17 21:24:17

The F8 is a new 8-bit computer architecture designed for efficiency in both code size and memory usage, especially when programming in C. It aims to achieve performance comparable to 16-bit systems while maintaining the simplicity and resource efficiency of 8-bit designs. This is accomplished through features like a hybrid stack/register-based architecture, variable-width instructions, and dedicated instructions for common C operations like pointer manipulation and function calls. The F8 also emphasizes practical applications with features like a built-in bootloader and support for direct connection to peripherals.

This FOSDEM 2025 presentation, titled "F8 – an 8-bit architecture designed for C and memory efficiency," introduces F8, a novel 8-bit computer architecture meticulously crafted for optimal performance with the C programming language while simultaneously prioritizing memory efficiency. The architecture's design philosophy centers around minimizing memory footprint and maximizing code density, crucial factors for resource-constrained embedded systems and other environments where memory is a premium. Unlike many existing 8-bit architectures that often necessitate assembly language programming for effective utilization of limited resources, F8 aims to empower developers to leverage the power and expressiveness of the C language without incurring the typical memory overhead associated with higher-level languages.

The presentation delves into the specific architectural choices made in the design of F8 that contribute to its memory efficiency and C-friendliness. This includes discussion of the instruction set architecture (ISA), which is likely optimized for common C language constructs and operations. The memory model and addressing modes are also explored, highlighting how they are structured to facilitate efficient data access and manipulation within the constraints of an 8-bit system. Further details are likely provided on the register set and how it balances the need for sufficient working registers with the desire to minimize overall processor state and memory usage.

Beyond the core architectural features, the presentation also likely covers the associated tooling and software ecosystem surrounding F8. This might include details on the available C compiler, assembler, linker, and debugger, as well as any supporting libraries or frameworks designed to simplify development for the platform. The potential benefits of using F8 are likely showcased, emphasizing its suitability for applications requiring a small memory footprint, low power consumption, or simple implementation. These applications could potentially range from small embedded controllers and sensor nodes to retro-computing projects or educational platforms. Overall, the presentation aims to provide a comprehensive overview of the F8 architecture, its underlying design principles, and its potential applications in the realm of resource-constrained computing.

Summary of Comments ( 24 )
https://news.ycombinator.com/item?id=43083429

Hacker News users discussed the F8 architecture's unusual design choices. Several commenters questioned the practical applications given the performance tradeoffs for memory efficiency, particularly with modern memory availability. Some debated the value of 8-bit architectures in niche applications like microcontrollers, while others pointed out existing alternatives like AVR. The unusual register structure and lack of hardware stack were also discussed, with some suggesting it might hinder C compiler optimization. A few expressed interest in the unique approach, though skepticism about real-world viability was prevalent. Overall, the comments reflected a cautious curiosity towards F8 but with reservations about its usefulness compared to established architectures.

The Hacker News post discussing the F8 architecture has generated several comments, delving into various aspects of the project.

Several commenters discuss the trade-offs between an 8-bit architecture like F8 and more common 32-bit architectures. One commenter questions the rationale behind using an 8-bit architecture in modern times, highlighting the prevalence and efficiency of 32-bit microcontrollers. They argue that while code size might be smaller on an 8-bit system, the performance gains of a 32-bit system likely outweigh this benefit in most scenarios. This sparks a discussion about the niche applications where an 8-bit architecture might still be relevant, such as extremely resource-constrained environments or situations requiring backward compatibility with legacy systems.

Another thread of discussion focuses on the specific design choices of the F8 architecture, particularly its register-based design and the decision to optimize for C programming. Commenters debate the merits of this approach compared to other 8-bit architectures or more specialized hardware designs. Some express skepticism about the claimed memory efficiency gains, pointing out the overhead introduced by the C compiler and the relatively limited register set. Others are intrigued by the potential of the F8 architecture for specific embedded applications, especially those involving control systems or sensor networks.

The discussion also touches upon the broader context of retrocomputing and the resurgence of interest in older or less common architectures. Some commenters see projects like F8 as valuable explorations of alternative computing paradigms, while others question their practical relevance in the face of established industry standards.

Finally, several commenters express interest in learning more about the technical details of the F8 architecture and its implementation. They inquire about the availability of documentation, simulators, or open-source code, demonstrating a desire to engage with the project beyond the initial presentation.

Zeroperl: Sandboxing Perl with WebAssembly

permalink

Posted: 2025-02-11 20:11:37

Zeroperl leverages WebAssembly (Wasm) to create a secure sandbox for executing Perl code. It compiles a subset of Perl 5 to Wasm, allowing scripts to run in a browser or server environment with restricted capabilities. This approach enhances security by limiting access to the host system's resources, preventing malicious code from wreaking havoc. Zeroperl utilizes a custom runtime environment built on Wasmer, a Wasm runtime, and focuses on supporting commonly used Perl modules for tasks like text processing and bioinformatics. While not aiming for full Perl compatibility, Zeroperl offers a secure and efficient way to execute specific Perl workloads in constrained environments.

Andrew Gallant's blog post, "Zeroperl: Sandboxing Perl with WebAssembly," details a project aiming to leverage WebAssembly (Wasm) to create a secure and portable execution environment for Perl programs. The core motivation is to address the inherent security risks associated with running untrusted Perl code, especially in contexts like online code evaluation platforms or automated systems processing user-submitted scripts. Traditional sandboxing methods for Perl, often involving intricate system calls and permission manipulation, can be complex and prone to vulnerabilities. Wasm, by its design, offers a more robust and predictable sandbox environment.

Zeroperl seeks to compile Perl programs into Wasm modules, allowing them to run within a browser or any other Wasm runtime. This compilation process involves using a specialized backend for the B::C compiler infrastructure within Perl. B::C transforms Perl code into an intermediate representation that can then be further translated into various target languages, including, in this case, Wasm. The post highlights that this isn't a full Perl interpreter running within Wasm, but rather a targeted compilation process that transforms specific Perl scripts into Wasm equivalents. This approach focuses on executing individual scripts, rather than providing a generalized Perl environment within the Wasm runtime.

Gallant outlines the benefits of this Wasm-based approach. Firstly, Wasm's inherent memory safety and restricted access to system resources provide a strong security barrier against malicious code. Secondly, the portability of Wasm enables the execution of these sandboxed Perl programs on diverse platforms without modification, simplifying deployment and management. Thirdly, Zeroperl utilizes Wasmtime, a fast and standards-compliant Wasm runtime, contributing to efficient execution of the compiled Perl scripts.

The post delves into the technical details of the compilation process. It explains how Perl's dynamic nature presents challenges for static compilation to Wasm. To address this, Zeroperl utilizes techniques like embedding pre-compiled bytecode and implementing a subset of Perl's operations within the Wasm module. This balances performance and compatibility. The implementation is described as being in its early stages, with ongoing work to expand the supported Perl features and optimize the generated Wasm code.

Gallant illustrates the concept with an example demonstrating the execution of a simple Perl script compiled to Wasm. The post concludes by emphasizing the potential of Zeroperl to empower safer execution of untrusted Perl code in various applications, paving the way for more secure and versatile scripting environments. It also acknowledges the project's experimental nature and encourages community involvement in its further development.

Summary of Comments ( 15 )
https://news.ycombinator.com/item?id=43017739

Hacker News commenters generally expressed interest in Zeroperl, praising its innovative approach to sandboxing Perl using WebAssembly. Some questioned the performance implications of this method, wondering if it would introduce significant overhead. Others discussed alternative sandboxing techniques, like using containers or VMs, comparing their strengths and weaknesses to WebAssembly. Several users highlighted potential use cases, particularly for serverless functions and other cloud-native environments. A few expressed skepticism about the viability of fully securing Perl code within WebAssembly given Perl's dynamic nature and CPAN module dependencies. One commenter offered a detailed technical explanation of why certain system calls remain accessible despite the sandbox, emphasizing the ongoing challenges inherent in securing dynamic languages.

OpenLDK: A Java JIT compiler and runtime in Common Lisp

permalink

Posted: 2025-02-05 12:17:47

OpenLDK is a project that implements a Java Virtual Machine (JVM) and Just-In-Time (JIT) compiler written entirely in Common Lisp. It aims to be a high-performance JVM alternative, leveraging Lisp's metaprogramming capabilities for dynamic code generation and optimization. The project features a modular design, encompassing a bytecode interpreter, a tiered JIT compiler using a method-based compilation strategy, and a garbage collector. OpenLDK is considered experimental and under active development, focusing on performance enhancements and broader Java compatibility.

OpenLDK (Open Lisp Development Kit) is an ambitious project aiming to create a high-performance Java implementation entirely within Common Lisp. It strives to not simply interpret Java bytecode, but to leverage the power of Lisp's metaprogramming capabilities to compile Java code into efficient native machine code using a Just-In-Time (JIT) compiler. This approach offers the potential for significant performance gains compared to traditional Java interpreters or even some existing JIT compilers.

The project's core is a custom-built JIT compiler written in Common Lisp. This compiler takes Java bytecode as input and translates it into native machine instructions for the target architecture. The choice of Common Lisp as the implementation language is driven by its powerful macro system and flexible runtime environment, which facilitate complex code transformations and optimizations required for an effective JIT compiler. This allows for a potentially more adaptable and extensible JIT compilation process compared to compilers written in lower-level languages.

OpenLDK also includes a runtime environment written in Common Lisp. This environment provides the necessary infrastructure for executing compiled Java code, including features like garbage collection, thread management, and access to the underlying operating system. By implementing the runtime in Lisp, OpenLDK gains greater control over these crucial aspects of Java execution and can potentially tailor them to specific needs or hardware platforms. This could theoretically enable experimentation with novel garbage collection strategies or concurrency models.

The project aims to be a fully compliant Java implementation, supporting a wide range of Java features and libraries. While still in its early stages of development, the roadmap indicates intentions to eventually support the full Java standard library and potentially even some popular third-party libraries. This implies significant ongoing development efforts are required to achieve full Java compatibility.

OpenLDK leverages several existing Lisp libraries for its functionality, including the cl-jclasslib library for parsing Java class files and accessing bytecode information. This reliance on existing components demonstrates a pragmatic approach to development, building upon the mature Lisp ecosystem rather than reinventing the wheel.

While the project presents a novel and intriguing approach to Java implementation, it is important to note its experimental nature. The project's documentation acknowledges that it is still under active development and may not be suitable for production use. There's likely a considerable amount of work remaining to achieve a stable and performant Java environment. However, the project's potential to explore new frontiers in JIT compilation and language interoperability makes it a compelling endeavor.

Summary of Comments ( 16 )
https://news.ycombinator.com/item?id=42947447

Commenters on Hacker News express interest in OpenLDK, primarily focusing on its unusual implementation of a Java Virtual Machine (JVM) in Common Lisp. Several question the practical applications and performance implications of this approach, wondering about its speed and suitability for real-world projects. Some highlight the potential benefits of Lisp's dynamic nature for tasks like debugging and introspection. Others draw parallels to similar projects like Clojure and GraalVM, discussing their respective advantages and disadvantages. A few express skepticism about the long-term viability of the project, while others praise the technical achievement and express curiosity about its potential. The novelty of using Lisp for JVM implementation clearly sparks the most discussion.

The Hacker News discussion on OpenLDK, a Java JIT compiler and runtime written in Common Lisp, features a moderate number of comments that explore several interesting facets of the project.

A recurring theme is the perceived unusual choice of Lisp for implementing a Java Virtual Machine (JVM). Several commenters express surprise or curiosity about this decision, questioning the performance implications and rationale behind it. Some speculate about potential benefits, like the flexibility and metaprogramming capabilities of Lisp, which could facilitate experimentation and potentially lead to innovative JVM features. However, skepticism regarding performance, particularly garbage collection and runtime speed, is also voiced.

There's significant discussion surrounding the practical applications and target audience of OpenLDK. Commenters ponder whether it's intended for specialized use cases like embedded systems or niche applications where the dynamic nature of Lisp might be advantageous. Others question its competitiveness against established JVMs like Hotspot in terms of performance for general-purpose Java development.

Some commenters delve into the technical details of the project, inquiring about specific implementation choices, like garbage collection strategies and the interaction between Lisp and the generated Java bytecode. There's interest in understanding how the Lisp environment influences the JIT compilation process and the overall runtime behavior.

The maintainability and future development of the project are also brought up. Given the relatively niche nature of Lisp, some commenters express concern about the long-term viability and potential for community contributions. There are questions about the project's roadmap and whether it aims to become a fully featured, production-ready JVM.

Finally, the historical context of Lisp in the JVM ecosystem is mentioned. Commenters recall previous attempts to bridge these two worlds, referencing projects like Clojure and ABCL, and discussing the lessons learned from those endeavors. This historical perspective adds another layer to the conversation, highlighting the challenges and opportunities of combining Lisp and Java technologies.

I Wrote a WebAssembly VM in C

permalink

Posted: 2025-02-03 14:30:11

The author details their process of creating a WebAssembly (Wasm) virtual machine (VM) written entirely in C. Driven by a desire for a lightweight, embeddable Wasm runtime for resource-constrained environments, they built the VM from scratch, implementing core features like the stack-based execution model, linear memory, and basic WebAssembly System Interface (WASI) support. The project focused on simplicity and understandability over performance, serving primarily as a learning exercise and a platform for experimentation with Wasm. The post walks through key aspects of the VM's design and implementation, including parsing the Wasm binary format, handling function calls, and managing memory. It also highlights the challenges faced and lessons learned during the development process.

In a detailed blog post titled "I Wrote a WebAssembly VM in C," the author chronicles their journey of creating a WebAssembly (Wasm) virtual machine from scratch using the C programming language. Their primary motivation stemmed from a desire to deeply understand the inner workings of Wasm, moving beyond simply utilizing existing tools and libraries. This hands-on approach allowed them to grasp the intricacies of the Wasm specification and the challenges involved in its implementation.

The post begins by outlining the core components of a Wasm VM, including the stack, memory, and execution environment. The author then meticulously describes the process of parsing and interpreting Wasm bytecode, explaining how each instruction is handled by the VM. They delve into the complexities of implementing the stack-based virtual machine architecture, covering topics such as operand evaluation, function calls, and local variable management. Specific instructions, like i32.add and local.get, are used as examples to illustrate the execution flow and data manipulation within the VM.

The development process involved several iterative steps. The author started with a basic framework capable of executing simple arithmetic operations and gradually expanded its functionality to support more complex features like function calls and control flow instructions. They emphasized the importance of rigorous testing throughout the development cycle, using carefully crafted test cases to ensure the correctness of their implementation.

The author acknowledges that their implementation is not fully compliant with the complete Wasm specification, focusing primarily on a subset of core instructions. However, this simplified approach served their educational purpose of gaining a foundational understanding of Wasm execution. The post concludes with a reflection on the lessons learned during the project and a discussion of potential future enhancements, including adding support for more advanced Wasm features and optimizing the VM's performance. The author's code, written entirely in C, is available publicly for others to explore and learn from, offering a tangible resource for anyone interested in diving into the world of WebAssembly virtual machines.

Summary of Comments ( 43 )
https://news.ycombinator.com/item?id=42918524

Hacker News users generally praised the author's clear writing style and the educational value of the post. Several commenters discussed the project's performance, noting that it's not optimized for speed and suggesting potential improvements like just-in-time compilation. Some shared their own experiences with WASM interpreters and related projects, including comparisons to other implementations and alternative approaches like using a stack machine. Others appreciated the detailed explanation of the parsing and execution process, finding it helpful for understanding WASM internals. A few users pointed out minor corrections or areas for potential enhancement in the code, demonstrating active engagement with the technical details.

The Hacker News post "I Wrote a WebAssembly VM in C" (https://news.ycombinator.com/item?id=42918524) generated a moderate amount of discussion, with several commenters engaging with the project and offering insights or related experiences.

A recurring theme was admiration for the author's undertaking, with several commenters acknowledging the complexity and difficulty of writing a Wasm VM. One commenter pointed out the educational value of such projects, emphasizing the deep understanding of Wasm's internals that one gains through implementation. They also noted that while Wasm is often perceived as a compilation target, understanding its runtime environment is equally crucial.

Another user shared a personal anecdote of a similar project, where they wrote a Wasm interpreter in Rust. They explained that their motivation stemmed from a need to run Wasm in a constrained embedded environment lacking a JIT compiler. This comment highlighted a practical use case for Wasm interpreters, contrasting with the more common JIT-based implementations.

A discussion unfolded about the performance characteristics of interpreted Wasm versus compiled Wasm. One commenter questioned the practical applicability of interpreters, speculating that their performance limitations might restrict their usefulness. Another user countered this by suggesting potential niche applications, such as debugging or educational purposes, where raw performance is less critical than other features like understandability and control. They also mentioned the possibility of using an interpreter as a fallback mechanism when JIT compilation is unavailable.

The author of the Wasm VM chimed in to address some of these questions. They clarified that the project was primarily an educational exercise, not intended for production use. They acknowledged the performance limitations of interpretation and confirmed they had no plans to add a JIT compiler. They also engaged with other commenters, discussing technical details of their implementation, such as the handling of garbage collection.

Finally, one comment drew a parallel between the author's project and the early days of Java, where interpreted execution was common before JIT compilation became prevalent. This comparison highlighted the potential evolution of Wasm runtimes, suggesting that interpreters might play a more significant role in the future, particularly in resource-constrained environments.

I wrote my own “proper” programming language (2020)

permalink

Posted: 2025-01-22 09:54:25

Mukul Rathi details his journey of creating a custom programming language, focusing on the compiler construction process. He explains the key stages involved, from lexing (converting source code into tokens) and parsing (creating an Abstract Syntax Tree) to code generation and optimization. Rathi uses his language, which he implements in OCaml, to illustrate these concepts, providing code examples and explanations of how each component works together to transform high-level code into executable machine instructions. He emphasizes the importance of understanding these foundational principles for anyone interested in building their own language or gaining a deeper appreciation for how programming languages function.

In a comprehensive blog post titled "I wrote my own “proper” programming language," author Mukul Rathi chronicles the journey of designing and implementing a programming language from its nascent conceptual stages to a functional, albeit rudimentary, state. He meticulously details the process of building a compiler, breaking down the complex task into manageable, discrete steps.

The post begins by outlining the fundamental architecture of a compiler, illustrating the typical workflow from source code to executable program. This includes lexical analysis, where the input code is tokenized; parsing, which involves constructing an Abstract Syntax Tree (AST) to represent the code's structure; semantic analysis, where type checking and other semantic rules are enforced; and finally, code generation, where the AST is translated into intermediate representations like bytecode or assembly language.

Rathi delves into the specifics of his implementation, utilizing Python as the language for his compiler. He elucidates the lexical analyzer’s role in categorizing individual components of the source code, such as keywords, identifiers, and operators, transforming the raw text into a stream of meaningful tokens. The parsing stage, he explains, involves organizing these tokens into a hierarchical tree structure – the AST – which reflects the grammatical relationships between different parts of the code. This is achieved using a recursive descent parsing technique.

Furthermore, the post underscores the importance of semantic analysis, which goes beyond mere syntax verification and delves into the meaning of the code. This crucial step involves ensuring type compatibility, checking for undeclared variables, and enforcing other language-specific semantic rules. Rathi describes how his compiler performs these checks, thereby ensuring the logical integrity of the program.

Finally, the post culminates in a discussion of code generation. While stopping short of generating machine code directly, Rathi explains how his compiler generates bytecode, a lower-level representation of the program. This bytecode can then be executed by a virtual machine, effectively bridging the gap between high-level source code and the underlying hardware. He emphasizes that while his compiler does not perform all the optimizations a production-ready compiler would, it demonstrates the essential steps involved in translating a high-level programming language into an executable format. The post concludes by acknowledging the project's limitations while highlighting its educational value as a practical exercise in compiler construction.

Summary of Comments ( 13 )
https://news.ycombinator.com/item?id=42791036

Hacker News users generally praised the article for its clarity and accessibility in explaining compiler construction. Several commenters appreciated the author's approach of building a complete, albeit simple, language instead of just a toy example. Some pointed out the project's similarity to the "Let's Build a Compiler" series, while others suggested alternative or supplementary resources like Crafting Interpreters and the LLVM tutorial. A few users discussed the tradeoffs between hand-written lexers/parsers and using parser generator tools, and the challenges of garbage collection implementation. One commenter shared their personal experience of writing a language and the surprising complexity of seemingly simple features.

The Hacker News thread for "I wrote my own “proper” programming language (2020)" contains several comments discussing various aspects of the linked article.

Many comments focus on tooling and alternative approaches to building a programming language. One user suggests using tools like Lex/Yacc or Flex/Bison for lexical analysis and parsing, offering a more robust and less error-prone method than manual implementation. This comment sparked a small discussion thread with another user pointing out that while powerful, these tools can add complexity, especially for beginners. They advocate for a simpler approach initially, recommending a hand-rolled recursive descent parser for its educational value in understanding the underlying mechanisms. This exchange highlights the trade-off between ease of implementation and the robustness of the final product.

Another commenter discusses the evolution of compiler construction and how techniques and tools have changed over time. They specifically mention the shift towards using LLVM as a backend for code generation and optimization. This offers the advantage of targeting multiple platforms without rewriting the backend for each one.

Several users commend the author of the article for undertaking such a complex project and sharing their knowledge. They praise the clear explanations and the step-by-step approach presented in the article, finding it accessible even for those without prior compiler development experience.

Some comments delve into specific aspects of the implementation, such as garbage collection, with one commenter suggesting exploring different garbage collection strategies. Another thread discusses the performance implications of different language design choices, emphasizing the importance of considering efficiency from the start.

One user expresses a common sentiment among language developers, mentioning the inherent difficulty and complexity involved in creating a "proper" programming language. They acknowledge the effort required for not just initial implementation, but also ongoing maintenance and improvement.

Finally, a few comments express interest in the language's potential applications and its future development. They inquire about specific features and express a desire to see the project evolve.

Stories with Tag virtual machine

Summary of Comments ( 5 ) https://news.ycombinator.com/item?id=43185174

Summary of Comments ( 27 ) https://news.ycombinator.com/item?id=43154883

Summary of Comments ( 58 ) https://news.ycombinator.com/item?id=43120873

Summary of Comments ( 24 ) https://news.ycombinator.com/item?id=43083429

Summary of Comments ( 15 ) https://news.ycombinator.com/item?id=43017739

Summary of Comments ( 16 ) https://news.ycombinator.com/item?id=42947447

Summary of Comments ( 43 ) https://news.ycombinator.com/item?id=42918524

Summary of Comments ( 13 ) https://news.ycombinator.com/item?id=42791036

Summary of Comments ( 5 )
https://news.ycombinator.com/item?id=43185174

Summary of Comments ( 27 )
https://news.ycombinator.com/item?id=43154883

Summary of Comments ( 58 )
https://news.ycombinator.com/item?id=43120873

Summary of Comments ( 24 )
https://news.ycombinator.com/item?id=43083429

Summary of Comments ( 15 )
https://news.ycombinator.com/item?id=43017739

Summary of Comments ( 16 )
https://news.ycombinator.com/item?id=42947447

Summary of Comments ( 43 )
https://news.ycombinator.com/item?id=42918524

Summary of Comments ( 13 )
https://news.ycombinator.com/item?id=42791036