InfoQ Homepage Infrastructure Content on InfoQ
-
Virt8ra Sovereign Cloud Expands with Six New European Providers
Virt8ra is a groundbreaking European initiative aiming to establish a sovereign, interoperable cloud ecosystem, countering US cloud dominance. With significant expansion, now inclusive of six new providers, and a focus on open-source technology, Virt8ra promotes data localization and vendor independence, paving the way for an innovative digital future across Europe.
-
HashiCorp Releases Terraform MCP Server for AI Integration
HashiCorp has released the Terraform MCP Server, an open-source implementation of the Model Context Protocol designed to improve how large language models interact with infrastructure as code.
-
InfoQ Dev Summit Boston 2025: AI, Platforms, and Developer Experience
Software development is shifting fast. Senior engineers need real-world insights on AI, platforms, and developer autonomy. InfoQ Dev Summit Boston (June 9-10) offers 2 days with over 27 sessions of curated, technical talks delivered by engineers actively working at scale. We are focused on helping teams navigate the software evolution, with the clarity and context needed to make better decisions.
-
Pulumi Announces Improved Components Feature to Simplify Infrastructure as Code
Pulumi, the open-source infrastructure as code platform, has announced significant improvements to its Components feature, designed to simplify how developers build, share, and consume infrastructure code. The enhancements focus on reducing boilerplate, improving developer experience, and enabling greater reuse of infrastructure patterns.
-
Google Unveils Ironwood TPU for AI Inference
Google's Ironwood TPU, its most advanced custom AI accelerator, powers the "age of inference" with unmatched performance and scalability. With up to 9,216 liquid-cooled chips, it outpaces competitors, delivering 42.5 Exaflops. Engineered for high-efficiency, low-latency AI tasks, Ironwood redefines potential in AI hardware, leveraging AlphaChip to revolutionize chip design.
-
Optimize AI Workloads: Google Cloud’s Tips and Tricks
Google Cloud has announced a suite of new tools and features designed to help organizations reduce costs and improve efficiency of AI workloads across their cloud infrastructure. The announcement comes as enterprises increasingly seek ways to optimize spending on AI initiatives while maintaining performance and scalability.
-
Microsoft Enhances Azure Elastic SAN with Auto Scale, Snapshot Support, and CRC Protection
Microsoft's Azure Elastic SAN, launched in early 2024, revolutionizes cloud block storage with unique autoscale capabilities, snapshot support, and CRC protection for enhanced data integrity. This fully managed solution simplifies storage management and optimizes costs, making it ideal for businesses seeking efficient, high-availability solutions in the cloud.
-
Stack Refactoring for Enhanced Infrastructure Management in AWS CloudFormation Service
AWS CloudFormation's new stack refactoring feature transforms resource management, enabling seamless movement of resources between stacks. This enhances modularity and alignment with business needs, reduces misconfiguration risks, and boosts efficiency. Developers can optimize costs and improve clarity, making cloud architecture more manageable and adaptable.
-
Amazon Launches High Memory U7inh EC2 Instance for Enhanced SAP HANA Workloads
AWS has unveiled the Amazon EC2 High Memory U7inh instance, a game-changer for mission-critical in-memory databases like SAP HANA, offering 32 TB of memory and 1,920 vCPUs. Designed with HPE, it doubles the performance of previous models, ensuring seamless integration in AWS. Maximize your SAP workloads in the cloud with enhanced speed and scalability.
-
Google Cloud Launches Sixth Generation Trillium TPUs: More Performance, Scalability and Efficiency
Google Cloud's Trillium, its sixth-generation TPU, is now available. It enhances AI workloads with unmatched performance and 67% better energy efficiency. Integral to the AI Hypercomputer, Trillium boasts training speeds over 4x faster and triples inference throughput. This leap positions Google as a contender against Nvidia in the AI data center market.
-
Azure Boost DPU: Microsoft's New Silicon Solution for Enhanced Cloud Performance
At Ignite 2024, Microsoft unveiled the Azure Boost DPU, its first in-house solution for low-power, data-centric workloads. This innovative chip optimizes cloud performance and security, offering triple the efficiency of CPUs. With a robust hardware-software design, Microsoft’s advancements position it to redefine AI and cloud infrastructure.
-
Amazon EC2 R8g Instances with AWS Graviton4 Processors Generally Available
AWS has announced the general availability of Amazon EC2 R8g instances, which use AWS Graviton4 processors. These instances have been available in preview since November 2023 and are designed for memory-intensive workloads such as databases, in-memory caches, and real-time big data analytics.
-
Ahrefs Joins Others in Suggesting That On-Premises Hosting Can Be More Cost Effective than Cloud
A recent article claims that Ahrefs, an SEO software suite company, was able to prevent $400 million in expenditures over three years by not leveraging cloud resources. Similarly, 37Signals, the makers of Basecamp, has begun a cloud exodus with the stated goal of saving seven million dollars in infrastructure costs over five years.
-
Azure API Management Basic V2 and Standard V2 GA: Enhancing Scalability, Security, and Networking
Microsoft recently announced the general availability of new pricing tiers for Azure API Management, Basic v2, and Standard v2. It offers scalability and flexibility to support various development projects, from small to enterprise-level applications.
-
QCon London: Mastering Long-Running Processes in Modern Architectures
At QCon London 2024, Bernd Ruecker recommended implementing long-running tasks asynchronously with a process-orchestration platform. Such a platform provides better service boundaries and efficiencies and reduces accidental system complexity and risk. Organizing the platform centrally in an organization eases orchestration adoption by applications.