OpenShift Virtualization 4.21 Enhances VM Management
OpenShift Virtualization 4.21: how to simplify VM management and reduce complexity in hybrid cloud
Architecture and Infra on ThecoreGrid covers the foundations of designing and operating scalable, reliable systems at BigTech level. This category brings together system design and infrastructure practices: distributed architectures, highload patterns, cloud-native platforms, and core layers such as compute, networking, and storage. We focus on real engineering decisions — how to balance reliability, performance, cost, and long-term system evolution. Topics include Infrastructure as Code, Kubernetes, multi-region deployments, traffic management, and platform design. Content is grounded in production experience: incident post-mortems, large-scale migrations, and lessons from operating infrastructure under heavy load. Instead of abstract theory, you get practical trade-offs, proven patterns, and insights drawn from real-world systems. Architecture & Infra is built for architects, backend and platform engineers, DevOps teams, and SREs responsible for complex distributed systems and mission-critical infrastructure.
OpenShift Virtualization 4.21: how to simplify VM management and reduce complexity in hybrid cloud
In actor systems, there is no built-in channel for trace context. Discord solved this without changing the architecture and without stopping production.
Distributed inference simulation with Uniference: how DES bridges the gap between modeling and deploying AI systems.
MD5 has long been the standard for authentication in PostgreSQL. However, accumulated limitations have led to a gradual phasing out and a transition to a more robust model.
DNS round-robin stops working under load when clients start caching responses. Agoda faced this issue at the object storage level and moved the balancing to a separate layer. The problem manifested during the increase in data workloads. S3-compatible endpoints used DNS round-robin to distribute traffic. In practice, clients cached DNS responses and continued to hit … Read more
Draft materials about the new AI model became publicly accessible due to a CMS configuration error. The incident highlighted two things simultaneously: the fragility of content pipelines and the increasing risks posed by the models themselves.
Cloudflare adds Custom Regions to align global edge with local restrictions. This is a response to compliance pressures that are beginning to impact routing architecture. The problem arises when the global edge model encounters data localization requirements. Cloudflare’s architecture, by default, optimizes latency through the nearest data center. However, once requirements emerge to keep TLS … Read more
Request timeouts do not always indicate a problem in the database. Often, degradation is hidden in the path between the application and the DB. The problem manifests when database metrics appear stable, but clients experience timeouts. At the observation level, this looks like a contradiction: latency increases while database time remains the same. The reason … Read more
The connection between security and architecture breaks not in the code, but in the decisions. The analysis shows how systemic compromises turn into incidents.
In Kubescape 4.0, the focus shifts from reactive security to proactive security. The main changes include runtime detection, a redesign of the agent model, and the extraction of security data from etcd. The problem manifests at scale. As the cluster grows, security begins to compete for resources with the control plane itself. Storing security metadata … Read more
Controls: ← → to move, ↑ to rotate, ↓ to drop.
Mobile: use buttons below.