Data center
A data center or data centre is a building, dedicated space within a building, or a group of buildings used to house computer systems and associated components, such as telecommunications and storage systems.
Since IT operations are crucial for business continuity, it generally includes redundant or backup components and infrastructure for power supply, data communication connections, environmental controls and various security devices. A large data center is an industrial-scale operation using as much electricity as a small town.
History
Data centers have their roots in the huge computer rooms of the 1940s, typified by ENIAC, one of the earliest examples of a data center. Early computer systems, complex to operate and maintain, required a special environment in which to operate. Many cables were necessary to connect all the components, and methods to accommodate and organize these were devised such as standard racks to mount equipment, raised floors, and cable trays. A single mainframe required a great deal of power and had to be cooled to avoid overheating. Security became important – computers were expensive, and were often used for military purposes. Basic design-guidelines for controlling access to the computer room were therefore devised.During the boom of the microcomputer industry, and especially during the 1980s, users started to deploy computers everywhere, in many cases with little or no care about operating requirements. However, as information technology operations started to grow in complexity, organizations grew aware of the need to control IT resources. The advent of Unix from the early 1970s led to the subsequent proliferation of freely available Linux-compatible PC operating-systems during the 1990s. These were called "servers", as timesharing operating systems such as Unix rely heavily on the client-server model to facilitate sharing unique resources between multiple users. The availability of inexpensive networking equipment, coupled with new standards for the network structured cabling, made it possible to use a hierarchical design that put the servers in a specific room inside the company. The use of the term "data center", as applied to specially designed computer rooms, started to gain popular recognition about this time.
The boom of data centers came during the dot-com bubble of 1997–2000. Companies needed fast Internet connectivity and non-stop operation to deploy systems and to establish a presence on the Internet. Installing such equipment was not viable for many smaller companies. Many companies started building very large facilities, called Internet data centers, which provide enhanced capabilities, such as crossover backup: "If a Bell Atlantic line is cut, we can transfer them to... to minimize the time of outage."
The term cloud data centers has been used. Data centers typically cost a lot to build and to maintain. Increasingly, the division of these terms has almost disappeared and they are being integrated into the term "data center".
Requirements for modern data centers
Modernization and data center transformation enhances performance and energy efficiency.Information security is also a concern, and for this reason, a data center has to offer a secure environment that minimizes the chances of a security breach. A data center must, therefore, keep high standards for assuring the integrity and functionality of its hosted computer environment.
Industry research company International Data Corporation puts the average age of a data center at nine years old. Gartner, another research company, says data centers older than seven years are obsolete. The growth in data is one factor driving the need for data centers to modernize.
Focus on modernization is not new: concern about obsolete equipment was decried in 2007, and in 2011 Uptime Institute was concerned about the age of the equipment therein. By 2018 concern had shifted once again, this time to the age of the staff: "data center staff are aging faster than the equipment."
Meeting standards for data centers
The Telecommunications Industry Association's Telecommunications Infrastructure Standard for Data Centers specifies the minimum requirements for telecommunications infrastructure of data centers and computer rooms including single tenant enterprise data centers and multi-tenant Internet hosting data centers. The topology proposed in this document is intended to be applicable to any size data center.Telcordia GR-3160, NEBS Requirements for Telecommunications Data Center Equipment and Spaces, provides guidelines for data center spaces within telecommunications networks, and environmental requirements for the equipment intended for installation in those spaces. These criteria were developed jointly by Telcordia and industry representatives. They may be applied to data center spaces housing data processing or Information Technology equipment. The equipment may be used to:
- Operate and manage a carrier's telecommunication network
- Provide data center based applications directly to the carrier's customers
- Provide hosted applications for a third party to provide services to their customers
- Provide a combination of these and similar data center applications
Data center transformation
- Standardization/consolidation: Reducing the number of data centers and avoiding server sprawl often includes replacing aging data center equipment, and is aided by standardization.
- Virtualization: Lowers capital and operational expenses, reduces energy consumption. Virtualized desktops can be hosted in data centers and rented out on a subscription basis. Investment bank Lazard Capital Markets estimated in 2008 that 48 percent of enterprise operations will be virtualized by 2012. Gartner views virtualization as a catalyst for modernization.
- Automating: Automating tasks such as provisioning, configuration, patching, release management and compliance is needed, not just when facing fewer skilled IT workers.
- Securing: Protection of virtual systems is integrated with existing security of physical infrastructures.
Machine room
Aside from air-conditioning, there must be monitoring equipment, one type of which is to detect water prior to flood-level situations. One company, for several decades, has had share-of-mind: Water Alert. The company, as of 2018, has 2 competing manufacturers and 3 competing distributors.A [|raised floor] standards guide named GR-2930 was developed by Telcordia Technologies, a subsidiary of Ericsson.
Although the first raised floor computer room was made by IBM in 1956, and they've "been around since the 1960s," it was the 1970s that made it more common for computer centers to thereby allow cool air to circulate more efficiently.
The first purpose of the raised floor was to allow access for wiring.
Lights out
The "lights-out" data center, also known as a darkened or a dark data center, is a data center that, ideally, has all but eliminated the need for direct access by personnel, except under extraordinary circumstances. Because of the lack of need for staff to enter the data center, it can be operated without lighting. All of the devices are accessed and managed by remote systems, with automation programs used to perform unattended operations. In addition to the energy savings, reduction in staffing costs and the ability to locate the site further from population centers, implementing a lights-out data center reduces the threat of malicious attacks upon the infrastructure.Data center levels and tiers
The two organizations in the United States that publish data center standards are the Telecommunications Industry Association and the Uptime Institute.Telecommunications Industry Association
The Telecommunications Industry Association's TIA-942 standard for data centers, published in 2005 and updated four times since, defined four infrastructure levels.- Level 1 - basically a server room, following basic guidelines
- Level 4 - designed to host the most mission critical computer systems, with fully redundant subsystems, the ability to continuously operate for an indefinite period of time during primary power outages.
Uptime Institute – Data center Tier Classification Standard
- Tier I: is described as BASIC CAPACITY and must include a UPS
- Tier II: is described as REDUNDANT CAPACITY and adds redundant power and cooling
- Tier III: is described as CONCURRENTLY MAINTAINABLE and insures that ANY component can be taken out of service without affecting production
- Tier IV: is described as FAULT TOLERANT allowing any production capacity to be insulated from ANY type of failure.
Data center design
- a 65-story data center has already been proposed
- the number of data centers as of 2016 had grown beyond 3 million USA-wide, and more than triple that number worldwide
- size - one room of a building, one or more floors, or an entire building, and can hold 1,000 or more servers
- space, power, cooling, and costs in the data center.
- Mechanical engineering infrastructure - heating, ventilation and air conditioning ; humidification and dehumidification equipment; pressurization.
- Electrical engineering infrastructure design - utility service planning; distribution, switching and bypass from power sources; uninterruptible power source systems; and more.
Design criteria and trade-offs
- Availability expectations: Cost of avoiding downtime should not exceed the cost of downtime itself
- Site selection: Location factors include proximity to power grids, telecommunications infrastructure, networking services, transportation lines and emergency services. Others are flight paths, neighboring uses, geological risks and climate.
- * Often available power is hardest to change.
High availability
Modularity and flexibility
Modularity and flexibility are key elements in allowing for a data center to grow and change over time. Data center modules are pre-engineered, standardized building blocks that can be easily configured and moved as needed.A modular data center may consist of data center equipment contained within shipping containers or similar portable containers. Components of the data center can be prefabricated and standardized which facilitates moving if needed.
Environmental control
and humidity are controlled via:- Air conditioning
- indirect cooling, such as using outside air, Indirect Evaporative Cooling units, and also using sea water.
Electrical power
To prevent single points of failure, all elements of the electrical systems, including backup systems, are typically fully duplicated, and critical servers are connected to both the "A-side" and "B-side" power feeds. This arrangement is often made to achieve N+1 redundancy in the systems. Static transfer switches are sometimes used to ensure instantaneous switchover from one supply to the other in the event of a power failure.
Low-voltage cable routing
Options include:- Data cabling can be routed through overhead cable trays
- Raised floor cabling, for security reasons and to avoid the addition of cooling systems above the racks.
- Smaller/less expensive data centers without raised flooring may use anti-static tiles for a flooring surface.
Air flow
Aisle containment
Cold aisle containment is done by exposing the rear of equipment racks, while the fronts of the servers are enclosed with doors and covers..
Computer cabinets are often organized for containment of hot/cold aisles. Ducting prevents cool and exhaust air from mixing. Rows of cabinets are paired to face each other so that cool air can reach equipment air intakes and warm air can be returned to the chillers without mixing.
Alternatively, a range of underfloor panels can create efficient cold air pathways directed to the raised floor vented tiles. Either the cold aisle or the hot aisle can be contained.
Another alternative is fitting cabinets with vertical exhaust ducts Hot exhaust exits can direct the air into a plenum above a drop ceiling and back to the cooling units or to outside vents. With this configuration, traditional hot/cold aisle configuration is not a requirement.
Fire protection
Data centers feature fire protection systems, including passive and Active Design elements, as well as implementation of fire prevention programs in operations. Smoke detectors are usually installed to provide early warning of a fire at its incipient stage.Two water-based options are:
- sprinkler
- mist
- No water - some of the benefits of using chemical suppression.
Security
Logging access is required by some data protection regulations; some organizations tightly link this to access control systems. Multiple log entries can occur at the main entrance, entrances to internal rooms, and at equipment cabinets. Access control at cabinets can be integrated with intelligent power distribution units, so that locks are networked through the same appliance.
Energy use
Energy use is a central issue for data centers. Power draw ranges from a few kW for a rack of servers in a closet to several tens of MW for large facilities. Some facilities have power densities more than 100 times that of a typical office building. For higher power density facilities, electricity costs are a dominant operating expense and account for over 10% of the total cost of ownership of a data center.Power costs for 2012 often exceeded the cost of the original capital investment. Greenpeace estimated worldwide data center power consumption for 2012 as about 382 billion kWh. Global data centers used roughly 416 TWh in 2016, nearly 40% more than the entire United Kingdom; USA DC consumption was 90 billion kWh.
Greenhouse gas emissions
In 2007 the entire information and communication technologies or ICT sector was estimated to be responsible for roughly 2% of global carbon emissions with data centers accounting for 14% of the ICT footprint. The US EPA estimates that servers and data centers are responsible for up to 1.5% of the total US electricity consumption, or roughly.5% of US GHG emissions, for 2007. Given a business as usual scenario greenhouse gas emissions from data centers is projected to more than double from 2007 levels by 2020.In an 18-month investigation by scholars at Rice University's Baker Institute for Public Policy in Houston and the Institute for Sustainable and Applied Infodynamics in Singapore, data center-related emissions will more than triple by 2020.
Energy efficiency and overhead
The most commonly used energy efficiency metric of data center energy efficiency is power usage effectiveness, calculated as the ratio of total power entering the data center divided by the power used by IT equipment.It measures the percentage of power used by overhead. The average USA data center has a PUE of 2.0, meaning two watts of total power for every watt delivered to IT equipment. State-of-the-art is estimated to be roughly 1.2. Google publishes quarterly efficiency from data centers in operation.
The U.S. Environmental Protection Agency has an Energy Star rating for standalone or large data centers. To qualify for the ecolabel, a data center must be within the top quartile of energy efficiency of all reported facilities. The Energy Efficiency Improvement Act of 2015 requires federal facilities — including data centers — to operate more efficiently. California's title 24 of the California Code of Regulations mandates that every newly constructed data center must have some form of airflow containment in place to optimize energy efficiency.
European Union also has a similar initiative: EU Code of Conduct for Data Centres.
Energy use analysis and projects
The focus of measuring and analyzing energy use goes beyond what's used by IT equipment; facility support hardware such as chillers and fans also use energy.In 2011 server racks in data centers were designed for more than 25 kW and the typical server was estimated to waste about 30% of the electricity it consumed. The energy demand for information storage systems was also rising. A high availability data center was estimated to have a 1 mega watt demand and consume $20,000,000 in electricity over its lifetime, with cooling representing 35% to 45% of the data center's total cost of ownership. Calculations showed that in two years the cost of powering and cooling a server could be equal to the cost of purchasing the server hardware. Research in 2018 has shown that substantial amount of energy could still be conserved by optimizing IT refresh rates and increasing server utilization.
In 2011 Facebook, Rackspace and others founded the Open Compute Project to develop and publish open standards for greener data center computing technologies. As part of the project Facebook published the designs of its server, which it had built for its first dedicated data center in Prineville. Making servers taller left space for more effective heat sinks and enabled the use of fans that moved more air with less energy. By not buying commercial off-the-shelf servers, energy consumption due to unnecessary expansion slots on the motherboard and unneeded components, such as a graphics card, was also saved. In 2016 Google joined the project and published the designs of its 48V DC shallow data center rack. This design had long been part of Google data centers. By eliminating the multiple transformers usually deployed in data centers, Google had achieved a 30% increase in energy efficiency. In 2017 sales for data center hardware built to OCP designs topped $1.2 billion and are expected to reach $6 billion by 2021.
Power and cooling analysis
Power is the largest recurring cost to the user of a data center. Cooling it at or below wastes money and energy. Furthermore, overcooling equipment in environments with a high relative humidity can expose equipment to a high amount of moisture that facilitates the growth of salt deposits on conductive filaments in the circuitry.A power and cooling analysis, also referred to as a thermal assessment, measures the relative temperatures in specific areas as well as the capacity of the cooling systems to handle specific ambient temperatures. A power and cooling analysis can help to identify hot spots, over-cooled areas that can handle greater power use density, the breakpoint of equipment loading, the effectiveness of a raised-floor strategy, and optimal equipment positioning to balance temperatures across the data center. Power cooling density is a measure of how much square footage the center can cool at maximum capacity. The cooling of data centers is the second largest power consumer after servers. The cooling energy varies from 10% of the total energy consumption in the most efficient data centers and goes up to 45% in standard air-cooled data centers.
Energy efficiency analysis
An energy efficiency analysis measures the energy use of data center IT and facilities equipment. A typical energy efficiency analysis measures factors such as a data center's power use effectiveness against industry standards, identifies mechanical and electrical sources of inefficiency, and identifies air-management metrics. However, the limitation of most current metrics and approaches is that they do not include IT in the analysis. Case studies have shown that by addressing energy efficiency holistically in a data center, major efficiencies can be achieved that are not possible otherwise.Computational fluid dynamics (CFD) analysis
This type of analysis uses sophisticated tools and techniques to understand the unique thermal conditions present in each data center—predicting the temperature, airflow, and pressure behavior of a data center to assess performance and energy consumption, using numerical modeling. By predicting the effects of these environmental conditions, CFD analysis in the data center can be used to predict the impact of high-density racks mixed with low-density racks and the onward impact on cooling resources, poor infrastructure management practices and AC failure or AC shutdown for scheduled maintenance.Thermal zone mapping
Thermal zone mapping uses sensors and computer modeling to create a three-dimensional image of the hot and cool zones in a data center.This information can help to identify optimal positioning of data center equipment. For example, critical servers might be placed in a cool zone that is serviced by redundant AC units.
Green data centers
Data centers use a lot of power, consumed by two main usages: the power required to run the actual equipment and then the power required to cool the equipment. Power-efficiency reduces the first category.Cooling cost reduction from natural ways includes location decisions: When the focus is not being near good fiber connectivity, power grid connections and people-concentrations to manage the equipment, a data center can be miles away from the users. 'Mass' data centers like Google or Facebook don't need to be near population centers. Arctic locations can use outside air, which provides cooling, are getting more popular.
Renewable electricity sources are another plus. Thus countries with favorable conditions, such as: Canada, Finland, Sweden, Norway and Switzerland, are trying to attract cloud computing data centers.
Bitcoin mining is increasingly being seen as a potential way to build data centers at the site of renewable energy production. Curtailed and clipped energy can be used to secure transactions on the Bitcoin blockchain providing another revenue stream to renewable energy producers.
Energy reuse
It is very difficult to reuse the heat which comes from air cooled data centers. For this reason, data center infrastructures are more often equipped with heat pumps. An alternative to heat pumps is the adoption of liquid cooling throughout a data center. Different liquid cooling techniques are mixed and matched to allow for a fully liquid cooled infrastructure which captures all heat in water. Different liquid technologies are categorized in 3 main groups, Indirect liquid cooling, Direct liquid cooling and Total liquid cooling. This combination of technologies allows the creation of a thermal cascade as part of temperature chaining scenarios to create high temperature water outputs from the data center.Dynamic infrastructure
Dynamic Infrastructure provides the ability to intelligently, automatically and securely move workloads within a data center anytime, anywhere, for migrations, provisioning, to enhance performance, or building co-location facilities. It also facilitates performing routine maintenance on either physical or virtual systems all while minimizing interruption.Side benefits include
- reducing cost
- facilitating business continuity and high availability
- enabling cloud and grid computing.
Network infrastructure
Some of the servers at the data center are used for running the basic Internet and intranet services needed by internal users in the organization, e.g., e-mail servers, proxy servers, and DNS servers.
Network security elements are also usually deployed: firewalls, VPN gateways, intrusion detection systems, and so on. Also common are monitoring systems for the network and some of the applications. Additional off site monitoring systems are also typical, in case of a failure of communications inside the data center.
Software/data backup
Non-mutually exclusive options for backup are:- Onsite
- Offsite
Offsite backup storage
Data backup techniques include having an encrypted copy of the data offsite. Methods used for transporting data are:- having the customer write the data to a physical medium, such as magnetic tape, and then transporting the tape elsewhere.
- directly transferring the data to another site during the backup, using appropriate links
- uploading the data "into the cloud"
Modular data center