Story Details

  • Visualizing all books of the world in ISBN-Space

    Posted: 2025-02-01 09:27:06

    The blog post explores visualizing the "ISBN space" by treating ISBN-13s as coordinates in 13-dimensional space and projecting them down to 2D using dimensionality reduction techniques like t-SNE and UMAP. The author uses a dataset of over 20 million book records from Open Library, coloring the resulting visualizations by publication year or language. The resulting scatter plots reveal interesting clusters, suggesting that ISBNs, despite being assigned sequentially, exhibit some grouping based on book characteristics. The visualizations also highlight the limitations of these dimensionality reduction methods, as some seemingly close points in the 2D projection are actually quite distant in the original 13-dimensional space.

    Summary of Comments ( 35 )
    https://news.ycombinator.com/item?id=42897120

    Commenters on Hacker News largely praised the visualization and the author's approach to exploring the ISBN dataset. Several pointed out interesting patterns revealed by the visualization, such as the clustering of books by language and subject matter. Some discussed the limitations of using ISBNs for this kind of analysis, noting that not all books have ISBNs (especially older ones) and the system itself has undergone changes over time. Others offered suggestions for improvements or further exploration, such as incorporating data about book sales or using different dimensionality reduction techniques. A few commenters shared related projects or resources, including visualizations of other datasets and tools for working with ISBNs. The overall sentiment was one of appreciation for the project and its insightful presentation of complex data.