hackslash dot org

High Available Mosquitto MQTT on Kubernetes

Posted: 2025-05-14 20:42:36

This blog post details setting up a highly available Mosquitto MQTT broker on Kubernetes. It leverages a StatefulSet to manage persistent storage and pod identity, ensuring data persistence across restarts. The setup uses a headless service for internal communication and an external LoadBalancer service to expose the broker to clients. Persistence is achieved with a PersistentVolumeClaim, while a ConfigMap manages configuration files. The post also covers generating a self-signed certificate for secure communication and emphasizes the importance of a proper Kubernetes DNS configuration for service discovery. Finally, it offers a simplified deployment using a single YAML file and provides instructions for testing the setup with mosquitto_sub and mosquitto_pub clients.

This blog post details how to deploy a highly available Mosquitto MQTT message broker on a Kubernetes cluster. The author emphasizes the importance of MQTT for IoT and other real-time applications, highlighting the need for a robust and resilient broker setup. The chosen approach utilizes a StatefulSet to manage the Mosquitto pods, ensuring persistent storage and ordered deployments, which are critical for maintaining message persistence and consistent broker state.

The guide starts by explaining the prerequisite of having a functioning Kubernetes cluster. Then, it dives into the core components of the deployment:

Persistent Storage: The tutorial strongly recommends using a persistent volume claim (PVC) to store Mosquitto's data directory. This ensures that message data persists even if pods are rescheduled or the cluster experiences disruptions. The post emphasizes the importance of this for maintaining the broker's state and preventing message loss. The example provided uses a default storage class, but encourages users to tailor this to their specific environment.
StatefulSet: This is the core of the high availability setup. The StatefulSet manages the deployment and scaling of the Mosquitto pods. It provides guarantees around ordered deployment, scaling, and deletion, crucial for maintaining a consistent broker state and facilitating proper network identification of each broker instance. The provided YAML configuration specifies the number of replicas (i.e., the number of broker instances), the container image to use, the service name, and the persistent volume claim. It also defines probes for liveness and readiness checks to ensure the health and availability of the pods. The configuration includes a section for resource limits (CPU and memory) to prevent resource starvation and ensure predictable performance.
Headless Service: A headless service is used to discover the individual Mosquitto pods. This is essential for clients to connect to the available brokers. The headless service does not perform load balancing but instead provides a stable DNS entry for each pod, allowing clients to connect directly.
Configuration: The tutorial demonstrates how to configure Mosquitto using a configmap. This allows for centralized management of the broker's configuration, making it easier to update and maintain. The example configuration includes settings for persistence, listener ports, and password authentication.

The post then walks through the deployment process, outlining the steps to apply the YAML configuration files to the Kubernetes cluster. It emphasizes the importance of verifying the deployment by checking the status of the pods, services, and persistent volume claims.

Finally, the tutorial briefly touches on client connection strategies, recommending the use of a load balancer or a client library that handles connection management and failover. This is crucial for building resilient client applications that can withstand broker outages.

The overall tone of the post is practical and aims to provide a clear, step-by-step guide for deploying a highly available Mosquitto MQTT broker on Kubernetes. It focuses on the essential components and configuration required for a robust and resilient setup, suitable for production environments. While not overly complex, the post assumes a basic understanding of Kubernetes concepts like StatefulSets, Services, and Persistent Volumes.

Summary of Comments ( 15 )
https://news.ycombinator.com/item?id=43988975

HN users generally found the tutorial lacking important details for a true HA setup. Several commenters pointed out that using a single persistent volume claim wouldn't provide redundancy and suggested using a distributed storage solution instead. Others questioned the choice of a StatefulSet without discussing scaling or the need for a headless service. The external database dependency was also criticized as a potential single point of failure. A few users offered alternative approaches, including using a managed MQTT service or simpler clustering methods outside of Kubernetes. Overall, the sentiment was that while the tutorial offered a starting point, it oversimplified HA and omitted crucial considerations for production environments.

The Hacker News post titled "High Available Mosquitto MQTT on Kubernetes" linking to a tutorial on setting up a highly available Mosquitto MQTT broker using Kubernetes has generated a modest number of comments, primarily focusing on alternative approaches and concerns regarding the complexity introduced by Kubernetes for this specific use case.

One commenter suggests exploring VerneMQ as an alternative MQTT broker, highlighting its built-in clustering capabilities, potentially simplifying the setup and avoiding the overhead of Kubernetes. This comment sparks a brief discussion about the pros and cons of VerneMQ compared to Mosquitto, touching upon aspects like performance and ease of use. Another user echoes this sentiment, recommending against using Kubernetes unless absolutely necessary, emphasizing the added operational complexity. They propose a simpler approach using a systemd service with two Mosquitto instances and a shared persistent storage, arguing this would suffice for most use cases and be significantly easier to manage.

A separate thread emerges discussing the challenges of persistent storage in Kubernetes, particularly in the context of stateful applications like MQTT brokers. Commenters mention the potential complexities and performance implications of using persistent volumes, especially when dealing with high throughput scenarios. This discussion touches upon the importance of carefully considering storage solutions and their impact on the overall performance and reliability of the MQTT broker.

Finally, a commenter expresses their preference for a simpler approach using Docker Compose, suggesting it provides a suitable level of resilience without the operational overhead of Kubernetes. They argue that for many applications, the added complexity of Kubernetes isn't justified and a more streamlined solution like Docker Compose is often sufficient.

Overall, the comments reflect a general sentiment that while Kubernetes offers robust features for high availability and scalability, it might be overkill for certain applications like a Mosquitto MQTT broker. The commenters advocate for carefully evaluating the complexity and operational overhead introduced by Kubernetes and considering simpler alternatives if they adequately address the specific requirements. They highlight the importance of choosing the right tool for the job, balancing complexity with the actual needs of the application and infrastructure.

Understanding Machine Learning: From Theory to Algorithms

permalink

Posted: 2025-04-04 18:25:23

"Understanding Machine Learning: From Theory to Algorithms" provides a comprehensive overview of machine learning, bridging the gap between theoretical principles and practical applications. The book covers a wide range of topics, from basic concepts like supervised and unsupervised learning to advanced techniques like Support Vector Machines, boosting, and dimensionality reduction. It emphasizes the theoretical foundations, including statistical learning theory and PAC learning, to provide a deep understanding of why and when different algorithms work. Practical aspects are also addressed through the presentation of efficient algorithms and their implementation considerations. The book aims to equip readers with the necessary tools to both analyze existing learning algorithms and design new ones.

"Understanding Machine Learning: From Theory to Algorithms" by Shai Shalev-Shwartz and Shai Ben-David offers a comprehensive exploration of the fascinating field of machine learning, bridging the gap between theoretical foundations and practical algorithmic implementations. The book meticulously constructs a conceptual framework for understanding how machines learn from data, starting with fundamental concepts like the Probably Approximately Correct (PAC) learning model. This model provides a rigorous mathematical framework for analyzing the ability of learning algorithms to generalize from a limited set of training examples to unseen data, taking into account factors such as sample complexity, error rates, and computational efficiency.

The authors delve into the core tenets of learnability, examining the conditions under which a concept can be effectively learned by a machine. They discuss various hypothesis classes and their representational power, highlighting the trade-off between expressiveness and the risk of overfitting, where a model learns the training data too well and fails to generalize to new instances. The book extensively covers key learning paradigms, including supervised learning, unsupervised learning, and reinforcement learning. Within supervised learning, specific techniques such as linear regression, logistic regression, support vector machines, and decision trees are explored in detail, both in terms of their mathematical underpinnings and practical implementation considerations.

Unsupervised learning, which involves learning patterns from unlabeled data, is also given considerable attention. Clustering algorithms, dimensionality reduction techniques, and generative models are discussed, providing the reader with a diverse toolkit for extracting knowledge from unstructured data. Furthermore, the book touches upon the exciting field of reinforcement learning, where agents learn to interact with an environment to maximize rewards, introducing fundamental concepts like Markov Decision Processes and various reinforcement learning algorithms.

A significant portion of the book is dedicated to a rigorous treatment of the theoretical foundations of machine learning. Concepts like Rademacher complexity, VC dimension, and stability are introduced and used to derive generalization bounds for different learning algorithms. These theoretical tools provide valuable insights into the behavior of learning algorithms and help explain why certain algorithms perform better than others in specific scenarios. The authors also address the computational aspects of machine learning, discussing optimization algorithms and their role in training complex models efficiently. They explore techniques such as gradient descent, stochastic gradient descent, and convex optimization, providing a thorough understanding of how these methods are used to find optimal model parameters.

Beyond the core theoretical and algorithmic concepts, the book also touches upon more advanced topics, including online learning, multi-class classification, structured output prediction, and learning theory in the context of non-i.i.d. data. Throughout the text, the authors maintain a balance between theoretical rigor and practical applicability, providing numerous examples, illustrations, and exercises to help the reader solidify their understanding. This detailed and comprehensive approach makes the book a valuable resource for both students embarking on their machine learning journey and seasoned practitioners seeking to deepen their understanding of the field's theoretical foundations.

Summary of Comments ( 45 )
https://news.ycombinator.com/item?id=43586073

HN users largely praised Shai Shalev-Shwartz and Shai Ben-David's "Understanding Machine Learning" as a highly accessible and comprehensive introduction to the field. Commenters highlighted the book's clear explanations of fundamental concepts, its rigorous yet approachable mathematical treatment, and the helpful inclusion of exercises. Several pointed out its value for both beginners and those with prior ML experience seeking a deeper theoretical understanding. Some compared it favorably to other popular ML resources, noting its superior balance between theory and practice. A few commenters also shared specific chapters or sections they found particularly insightful, such as the treatment of PAC learning and the VC dimension. There was a brief discussion on the book's coverage (or lack thereof) of certain advanced topics like deep learning, but the overall sentiment remained strongly positive.

The Hacker News post titled "Understanding Machine Learning: From Theory to Algorithms" linking to Shai Shalev-Shwartz and Shai Ben-David's book has a moderate number of comments, discussing various aspects of the book and machine learning education in general.

Several commenters praise the book for its clarity and accessibility, especially for those with a stronger mathematical background. One user describes it as the "most digestible theory book," highlighting its helpful explanations of fundamental concepts. Another appreciates the book's focus on proving the theory behind ML algorithms, which they found lacking in other resources. The balance between theory and practical application is also commended, with some users noting how the book helped them bridge the gap between abstract concepts and real-world implementations. Specific chapters on PAC learning and VC dimension are singled out as particularly valuable.

A recurring theme in the comments is the comparison of this book with other popular machine learning resources. "The Elements of Statistical Learning" is frequently mentioned as a more statistically-focused alternative, often considered more challenging. Some users suggest using both books in conjunction, leveraging Shalev-Shwartz and Ben-David's book as a starting point before tackling the more advanced "Elements of Statistical Learning." Another comparison is made with the "Hands-On Machine Learning" book, which is characterized as more practically oriented.

Some commenters discuss the role of mathematical prerequisites in understanding machine learning. While the book is generally praised for its clarity, a few users acknowledge that a solid foundation in linear algebra, probability, and calculus is still necessary to fully grasp the material. One comment even suggests specific resources to brush up on these mathematical concepts before diving into the book.

Beyond the book itself, the discussion touches upon broader topics in machine learning education. The importance of understanding the theoretical underpinnings of algorithms is emphasized, with several comments cautioning against relying solely on practical implementations without a deeper understanding of the underlying principles. The evolving nature of the field is also acknowledged, with some users mentioning more recent advancements that aren't covered in the book. Finally, there's a brief discussion about the role of online courses versus traditional textbooks in learning machine learning, with varying opinions on their respective merits.

Stop using the elbow criterion for k-means

permalink

Posted: 2025-03-23 02:51:38

The paper "Stop using the elbow criterion for k-means" argues against the common practice of using the elbow method to determine the optimal number of clusters (k) in k-means clustering. The authors demonstrate that the elbow method is unreliable, often identifying spurious elbows or missing genuine ones. They show this through theoretical analysis and empirical examples across various datasets and distance metrics, revealing how the within-cluster sum of squares (WCSS) curve, on which the elbow method relies, can behave unexpectedly. The paper advocates for abandoning the elbow method entirely in favor of more robust and theoretically grounded alternatives like the gap statistic, silhouette analysis, or information criteria, which offer statistically sound approaches to k selection.

The arXiv preprint "Stop using the elbow criterion for k-means" argues vehemently against the common practice of employing the elbow method for determining the optimal number of clusters (k) in k-means clustering. The authors meticulously demonstrate that the elbow method, which relies on identifying a "kink" or "elbow" in the plot of within-cluster sum of squares (WCSS) against the number of clusters, is fundamentally flawed and often leads to inaccurate and misleading results. They highlight the subjective nature of visually identifying this "elbow," making the method prone to interpreter bias and lacking reproducibility. Different observers might identify different optimal k values based on the same WCSS plot, rendering the method unreliable for scientific rigor.

The paper underscores that the WCSS metric inherently decreases monotonically with increasing k. This means that adding more clusters will always reduce the WCSS, albeit at a diminishing rate. The elbow, representing the point of diminishing returns, is thus not a definitive indicator of an inherently optimal clustering structure within the data but rather a natural consequence of the algorithm's behavior. Furthermore, the paper illustrates how the elbow, even if discernible, can occur at an incorrect k, particularly in datasets exhibiting complex cluster shapes or varying cluster densities. The authors provide numerous simulated and real-world examples where the elbow method fails to identify the true number of clusters, sometimes dramatically overestimating or underestimating the optimal k.

As a compelling alternative to the elbow method, the authors advocate for the use of gap statistics. The gap statistic compares the within-cluster dispersion of the observed data to the expected dispersion under a null reference distribution representing a dataset with no discernible clustering structure. By calculating the gap statistic for different k values and identifying the k for which the gap is maximized, one obtains a more statistically principled and robust estimate of the optimal cluster number. This approach avoids the subjective interpretation inherent in the elbow method and provides a quantifiable measure for comparing different clustering solutions. The authors emphasize that the gap statistic, while computationally more intensive than the elbow method, offers a significantly more reliable and objective way to determine k, leading to more accurate and insightful clustering results. They conclude by strongly recommending abandoning the elbow method in favor of more robust alternatives like the gap statistic, promoting a more rigorous and statistically sound approach to k-means clustering analysis.

Summary of Comments ( 13 )
https://news.ycombinator.com/item?id=43450550

HN users discuss the problems with the elbow method for determining the optimal number of clusters in k-means, agreeing it's often unreliable and subjective. Several commenters suggest superior alternatives, such as the silhouette coefficient, gap statistic, and information criteria like AIC/BIC. Some highlight the importance of considering the practical context and the "business need" when choosing the number of clusters, rather than relying solely on statistical methods. Others point out that k-means itself may not be the best clustering algorithm for all datasets, recommending DBSCAN and hierarchical clustering as potentially better suited for certain situations, particularly those with non-spherical clusters. A few users mention the difficulty in visualizing high-dimensional data and interpreting the results of these metrics, emphasizing the iterative nature of cluster analysis.

The Hacker News post titled "Stop using the elbow criterion for k-means" (https://news.ycombinator.com/item?id=43450550) discusses the linked arXiv paper which argues against using the elbow method for determining the optimal number of clusters in k-means clustering. The comments section is relatively active, featuring a variety of perspectives on the topic.

Several commenters agree with the premise of the article. They point out that the elbow method is often subjective and unreliable, leading to arbitrary choices for the number of clusters. Some users share anecdotal experiences of the elbow method failing to produce meaningful results or being difficult to interpret. One commenter suggests the gap statistic as a more robust alternative.

A recurring theme in the comments is the inherent difficulty of choosing the "right" number of clusters, especially in high-dimensional spaces. Some users argue that the optimal number of clusters is often dependent on the specific application and downstream analysis, rather than being an intrinsic property of the data. They suggest that domain knowledge and interpretability should play a significant role in the decision-making process.

One commenter points out that the elbow method is particularly problematic when the clusters are not well-separated or when the data has a complex underlying structure. They suggest using visualization techniques, like dimensionality reduction, to gain a better understanding of the data before attempting to cluster it.

Another comment thread discusses the limitations of k-means clustering itself, regardless of the method used to choose k. Users highlight the algorithm's sensitivity to initial conditions and its assumption of spherical clusters. They propose alternative clustering methods, such as DBSCAN and hierarchical clustering, which may be more suitable for certain types of data.

A few commenters defend the elbow method, arguing that it can be a useful starting point for exploratory data analysis. They acknowledge its limitations but suggest that it can provide a rough estimate of the number of clusters, which can be refined using other techniques.

Finally, some commenters discuss the practical implications of choosing the wrong number of clusters. They highlight the potential for misleading results and incorrect conclusions, emphasizing the importance of careful consideration and validation. One commenter suggests using metrics like silhouette score or Calinski-Harabasz index to assess the quality of the clustering.

Overall, the comments section reflects a general consensus that the elbow method is not a reliable technique for determining the optimal number of clusters in k-means. Commenters offer various alternative approaches, emphasize the importance of domain knowledge and data visualization, and discuss the broader challenges of clustering high-dimensional data.

Percolation Theory [pdf]

permalink

Posted: 2025-03-11 18:43:01

This paper provides a comprehensive overview of percolation theory, focusing on its mathematical aspects. It explores bond and site percolation on lattices, examining key concepts like critical probability, the existence of infinite clusters, and critical exponents characterizing the behavior near the phase transition. The text delves into various methods used to study percolation, including duality, renormalization group techniques, and series expansions. It also discusses different percolation models beyond regular lattices, like continuum percolation and directed percolation, highlighting their unique features and applications. Finally, the paper connects percolation theory to other areas like random graphs, interacting particle systems, and the study of disordered media, showcasing its broad relevance in statistical physics and mathematics.

This comprehensive document, titled "Percolation Theory," delves into the fascinating mathematical study of percolation, a phenomenon characterized by the movement and filtering of fluids through porous materials. It begins by establishing the historical context of percolation theory, tracing its origins and development as a significant area of mathematical and scientific inquiry. The paper then meticulously defines the fundamental concepts of percolation, including the lattice structure upon which percolation processes typically occur, and the concept of "occupation probability," which dictates the likelihood of a site within the lattice being occupied or open for fluid flow. The authors introduce the crucial concept of "percolation threshold," a critical probability value that determines the emergence of a connected pathway spanning the entire lattice, enabling fluid to traverse from one end to the other. This phase transition is explored in detail, highlighting the dramatic changes in system behavior around this critical point.

The document proceeds to meticulously dissect different percolation models, categorizing them into bond percolation, where connections between lattice sites are randomly open or closed, and site percolation, where the sites themselves are randomly occupied or vacant. The subtle but important differences between these models are thoroughly elucidated, and their respective applications in various scientific domains are discussed. The authors subsequently explore the mathematical tools employed to analyze percolation phenomena, introducing concepts like connectivity functions, which describe the probability of two sites belonging to the same connected cluster, and cluster size distributions, which quantify the frequency of clusters of different sizes. The concept of correlation length, which characterizes the spatial extent of connected clusters, is also rigorously defined and its significance in understanding percolation behavior is emphasized.

The paper further delves into the critical exponents associated with percolation, explaining how these exponents quantify the behavior of various physical quantities near the percolation threshold. The universality of these exponents, implying their independence from specific lattice details, is highlighted as a remarkable feature of percolation theory. Finite-size scaling, a technique used to extrapolate results from finite lattices to the thermodynamic limit of infinitely large lattices, is also discussed. The authors explore different computational approaches to studying percolation, ranging from Monte Carlo simulations, which use random sampling to approximate percolation behavior, to series expansion methods, which provide analytical approximations of relevant quantities. The document also elucidates the connections between percolation theory and other branches of physics and mathematics, such as fractal geometry, renormalization group theory, and random graph theory, showcasing the wide-ranging applicability and interdisciplinary nature of percolation research. Finally, the paper concludes with a discussion of various applications of percolation theory in diverse fields, including the study of porous media, the spread of epidemics, and the modeling of forest fires, emphasizing the practical relevance and impact of this elegant and powerful theoretical framework.

Summary of Comments ( 9 )
https://news.ycombinator.com/item?id=43335695

HN commenters discuss the applications of percolation theory, mentioning its relevance to forest fires, disease spread, and network resilience. Some highlight the beauty and elegance of the theory itself, while others note its accessibility despite being a relatively advanced topic. A few users share personal experiences using percolation theory in their work, including modeling concrete porosity and analyzing social networks. The concept of universality in percolation, where different systems exhibit similar behavior near the critical threshold, is also pointed out. One commenter links to an interactive percolation simulation, allowing others to experiment with the concepts discussed. Finally, the historical context and development of percolation theory are briefly touched upon.

The Hacker News post titled "Percolation Theory [pdf]" linking to a MIT publication on the subject has a modest number of comments, focusing primarily on the applications and implications of percolation theory. No one directly challenges the paper's content.

One commenter highlights the practical relevance of percolation theory, pointing out its use in modeling forest fires and the spread of diseases. They emphasize the "critical point" concept within the theory, where a small change in connectivity can drastically alter the system's overall behavior, such as a fire suddenly spreading rapidly or a disease becoming an epidemic. This commenter also draws a parallel to nuclear reactions and the concept of critical mass, illustrating how a slight increase in fissile material can lead to a sustained chain reaction.

Another commenter expands on the applications, mentioning how percolation theory is used in material science to understand the properties of composite materials and in epidemiology. They give a concrete example of how the theory can help determine the minimum percentage of fire-resistant trees needed in a forest to prevent a large-scale fire.

A third commenter touches upon the broader implications of percolation theory, describing its use in understanding phenomena like the flow of liquids through porous media (like coffee brewing) and the conductivity of electrical networks. They also link percolation to the concept of "phase transitions" in physics, where a system abruptly changes its state due to a small change in a parameter (like water turning to ice).

Finally, a commenter specifically mentions the use of percolation theory in studying the spread of information or influence within social networks, suggesting that it can help predict the virality of content or ideas.

While not a large number of comments, the existing discussion on Hacker News provides a concise overview of the diverse applications and fundamental concepts of percolation theory, emphasizing its importance in understanding various real-world phenomena across different scientific disciplines. The comments don't delve into the mathematical intricacies of the paper itself but rather offer accessible explanations of its practical relevance.

Show HN: Automated Sorting of group photos by user defined N people in each pic

permalink

Posted: 2025-02-04 17:21:21

Sort_Memories is a Python script that automatically sorts group photos based on the number of specified individuals present in each picture. Leveraging face detection and recognition, the script analyzes images, identifies faces, and groups photos based on the user-defined 'N' number of people desired in each output folder. This allows users to easily organize their photo collections by separating pictures of individuals, couples, small groups, or larger gatherings, automating a tedious manual process.

Summary of Comments ( 2 )
https://news.ycombinator.com/item?id=42935520

Hacker News commenters generally praised the project for its clever use of facial recognition to solve a common problem. Several users pointed out potential improvements, such as handling images where faces are partially obscured or not clearly visible, and suggested alternative approaches like clustering algorithms. Some discussed the privacy implications of using facial recognition technology, even locally. There was also interest in expanding the functionality to include features like identifying the best photo out of a burst or sorting based on other criteria like smiles or open eyes. Overall, the reception was positive, with commenters recognizing the project's practical value and potential.

The Hacker News thread discussing the "Show HN: Automated Sorting of group photos by user defined N people in each pic" project has a moderate number of comments, focusing primarily on the project's practical utility and limitations, along with suggestions for improvement and alternative approaches.

Several commenters express appreciation for the project's aim, acknowledging the common problem of managing and organizing large photo collections, particularly group photos. They point out the tediousness of manually sorting such photos and recognize the potential value of an automated solution.

One commenter highlights the specific use case of wanting to easily find photos containing particular individuals within a vast collection, a scenario where this tool could be beneficial. Another user suggests a potential application in generating personalized photo albums based on the identified individuals.

However, some commenters raise concerns about the project's current limitations. One points out the dependence on facial recognition, which can be unreliable, especially with variations in lighting, pose, and image quality. This reliance on accurate facial recognition is acknowledged as a potential bottleneck for the project's effectiveness.

Several suggestions for improvement and alternative approaches are offered. One commenter proposes incorporating metadata analysis, such as timestamps and location data, to enhance sorting accuracy and provide additional filtering options. Another suggests using clustering algorithms based on visual similarity, rather than solely relying on facial recognition, to group photos more effectively. The possibility of integrating existing photo management tools or libraries is also mentioned.

A few comments delve into the technical aspects of the project, discussing the implementation details and potential challenges. One user questions the scalability of the approach for very large photo collections and suggests exploring more efficient data structures and algorithms. Another commenter mentions the possibility of false positives in facial recognition and the need for mechanisms to handle such cases.

Overall, the comments reflect a generally positive reception of the project's concept while also acknowledging its current limitations and providing constructive feedback for improvement. The discussion emphasizes the need for robust and reliable methods for organizing large photo collections and explores various approaches to achieve this goal.

Stories with Tag clustering

High Available Mosquitto MQTT on Kubernetes

Summary of Comments ( 15 ) https://news.ycombinator.com/item?id=43988975

Understanding Machine Learning: From Theory to Algorithms

Summary of Comments ( 45 ) https://news.ycombinator.com/item?id=43586073

Stop using the elbow criterion for k-means

Summary of Comments ( 13 ) https://news.ycombinator.com/item?id=43450550

Percolation Theory [pdf]

Summary of Comments ( 9 ) https://news.ycombinator.com/item?id=43335695

Show HN: Automated Sorting of group photos by user defined N people in each pic

Summary of Comments ( 2 ) https://news.ycombinator.com/item?id=42935520

Summary of Comments ( 15 )
https://news.ycombinator.com/item?id=43988975

Summary of Comments ( 45 )
https://news.ycombinator.com/item?id=43586073

Summary of Comments ( 13 )
https://news.ycombinator.com/item?id=43450550

Summary of Comments ( 9 )
https://news.ycombinator.com/item?id=43335695

Summary of Comments ( 2 )
https://news.ycombinator.com/item?id=42935520