C data types


In the C programming language, data types constitute the semantics and characteristics of storage of data elements. They are expressed in the language syntax in form of declarations for memory locations or variables. Data types also determine the types of operations or methods of processing of data elements.
The C language provides basic arithmetic types, such as integer and real number types, and syntax to build array and compound types. Headers for the C standard library, to be used via include directives, contain definitions of support types, that have additional properties, such as providing storage with an exact size, independent of the language implementation on specific hardware platforms.

Basic types

Main types

The C language provides the four basic arithmetic type specifiers char, int, float and double, and the modifiers signed, unsigned, short, and long. The following table lists the permissible combinations in specifying a large set of storage size-specific declarations.
TypeExplanationFormat specifier
charSmallest addressable unit of the machine that can contain basic character set. It is an integer type. Actual type can be either signed or unsigned. It contains CHAR_BIT bits.%c
signed charOf the same size as char, but guaranteed to be signed. Capable of containing at least the range;%c
unsigned charOf the same size as char, but guaranteed to be unsigned. Contains at least the range.%c
short
short int
signed short
signed short int
Short signed integer type. Capable of containing at least the range; thus, it is at least 16 bits in size. The negative value is −32767 due to the one's-complement and sign-magnitude representations allowed by the standard, though the two's-complement representation is much more common.%hi
unsigned short
unsigned short int
Short unsigned integer type. Contains at least the range;%hu
int
signed
signed int
Basic signed integer type. Capable of containing at least the range; thus, it is at least 16 bits in size.%i or %d
unsigned
unsigned int
Basic unsigned integer type. Contains at least the range;%u
long
long int
signed long
signed long int
Long signed integer type. Capable of containing at least the range; thus, it is at least 32 bits in size.%li
unsigned long
unsigned long int
Long unsigned integer type. Capable of containing at least the range;%lu
long long
long long int
signed long long
signed long long int
Long long signed integer type. Capable of containing at least the range; thus, it is at least 64 bits in size. Specified since the C99 version of the standard.%lli
unsigned long long
unsigned long long int
Long long unsigned integer type. Contains at least the range; Specified since the C99 version of the standard.%llu
floatReal floating-point type, usually referred to as a single-precision floating-point type. Actual properties unspecified, however on most systems this is the IEEE 754 single-precision binary floating-point format. This format is required by the optional Annex F "IEC 60559 floating-point arithmetic".Converting from text:
doubleReal floating-point type, usually referred to as a double-precision floating-point type. Actual properties unspecified, however on most systems this is the IEEE 754 double-precision binary floating-point format. This format is required by the optional Annex F "IEC 60559 floating-point arithmetic".
long doubleReal floating-point type, usually mapped to an extended precision floating-point number format. Actual properties unspecified. It can be either x86 extended-precision floating-point format, the non-IEEE "double-double", IEEE 754 quadruple-precision floating-point format, or the same as double. See the article on long double for details.%Lf %LF
%Lg %LG
%Le %LE
%La %LA

The actual size of the integer types varies by implementation. The standard requires only size relations between the data types and minimum sizes for each data type:
The relation requirements are that the long long is not smaller than long, which is not smaller than int, which is not smaller than short. As char's size is always the minimum supported data type, no other data types can be smaller.
The minimum size for char is 8 bits, the minimum size for short and int is 16 bits, for long it is 32 bits and long long must contain at least 64 bits.
The type int should be the integer type that the target processor is most efficiently working with. This allows great flexibility: for example, all types can be 64-bit. However, several different integer width schemes are popular. Because the data model defines how different programs communicate, a uniform data model is used within a given operating system application interface.
In practice, char is usually 8 bits in size and short is usually 16 bits in size. This holds true for platforms as diverse as 1990s SunOS 4 Unix, Microsoft MS-DOS, modern Linux, and Microchip MCC18 for embedded 8-bit PIC microcontrollers. POSIX requires char to be exactly 8 bits in size.
Various rules in the C standard make unsigned char the basic type used for arrays suitable to store arbitrary non-bit-field objects: its lack of padding bits and trap representations, the definition of object representation, and the possibility of aliasing.
The actual size and behavior of floating-point types also vary by implementation. The only guarantee is that long double is not smaller than double, which is not smaller than float. Usually, the 32-bit and 64-bit IEEE 754 binary floating-point formats are used.
The C99 standard includes new real floating-point types float_t and double_t, defined in <math.h>. They correspond to the types used for the intermediate results of floating-point expressions when FLT_EVAL_METHOD is 0, 1, or 2. These types may be wider than long double.
C99 also added complex types: float _Complex, double _Complex, long double _Complex.

Boolean type

added a boolean type _Bool. Additionally, the <stdbool.h> header defines bool as a convenient alias for this type, and also provides macros for true and false. _Bool functions similarly to a normal integer type, with one exception: any assignments to a _Bool that are not 0 are stored as 1. This behavior exists to avoid integer overflows in implicit narrowing conversions. For example, in the following code:

unsigned char b = 256;
if

Variable b evaluates to false if unsigned char has a size of 8 bits. This is because the value 256 does not fit in the data type, which results in the lower 8 bits of it being used, resulting in a zero value. However, changing the type causes the previous code to behave normally:

_Bool b = 256;
if

The type _Bool also ensures true values always compare equal to each other:

_Bool a = 1, b = 2;
if

Size and pointer difference types

The C language specification includes the s size_t and ptrdiff_t to represent memory-related quantities. Their size is defined according to the target processor's arithmetic capabilities, not the memory capabilities, such as available address space. Both of these types are defined in the header.
size_t is an unsigned integer type used to represent the size of any object in the particular implementation. The operator yields a value of the type size_t. The maximum size of size_t is provided via SIZE_MAX, a macro constant which is defined in the header. size_t is guaranteed to be at least 16 bits wide. Additionally, POSIX includes ssize_t, which is a signed integer type of the same width as size_t.
ptrdiff_t is a signed integer type used to represent the difference between pointers. It is guaranteed to be valid only against pointers of the same type; subtraction of pointers consisting of different types is implementation-defined.

Interface to the properties of the basic types

Information about the actual properties, such as size, of the basic arithmetic types, is provided via macro constants in two headers: header defines macros for integer types and header defines macros for floating-point types. The actual values depend on the implementation.

Properties of integer types

The C99 standard includes definitions of several new integer types to enhance the portability of programs. The already available basic integer types were deemed insufficient, because their actual sizes are implementation defined and may vary across different systems. The new types are especially useful in embedded environments where hardware usually supports only several types and that support varies between different environments. All new types are defined in header and also are available at header. The types can be grouped into the following categories:
The following table summarizes the types and the interface to acquire the implementation details :

Printf and scanf format specifiers

The header provides features that enhance the functionality of the types defined in the header. It defines macros for printf format string and scanf format string specifiers corresponding to the types defined in and several functions for working with the intmax_t and uintmax_t types. This header was added in C99.
;Printf format string
The macros are in the format PRI'. Here ' defines the output formatting and is one of d, x, o, u and i. ' defines the type of the argument and is one of n, FASTn, LEASTn, PTR, MAX, where n corresponds to the number of bits in the argument.
;Scanf format string
The macros are in the format SCN
'. Here ' defines the output formatting and is one of d, x, o, u and i. ' defines the type of the argument and is one of n, FASTn, LEASTn, PTR, MAX, where n corresponds to the number of bits in the argument.
;Functions

Additional floating-point types

Similarly to the fixed-width integer types, ISO/IEC TS 18661 specifies floating-point types for IEEE 754 interchange and extended formats in binary and decimal:
Structures aggregate the storage of multiple data items, of potentially differing data types, into one memory block referenced by a single variable. The following example declares the data type struct birthday which contains the name and birthday of a person. The structure definition is followed by a declaration of the variable John that allocates the needed storage.

struct birthday ;
struct birthday John;

The memory layout of a structure is a language implementation issue for each platform, with a few restrictions. The memory address of the first member must be the same as the address of structure itself. Structures may be initialized or assigned to using compound literals. A function may directly return a structure, although this is often not efficient at run-time. Since C99, a structure may also end with a flexible array member.
A structure containing a pointer to a structure of its own type is commonly used to build linked data structures:

struct node ;

Arrays

For every type T, except void and function types, there exist the types "array of N elements of type T". An array is a collection of values, all of the same type, stored contiguously in memory. An array of size N is indexed by integers from 0 up to and including N−1. Here is a brief example:

int cat; // array of 10 elements, each of type int

Arrays can be initialized with a compound initializer, but not assigned. Arrays are passed to functions by passing a pointer to the first element. Multidimensional arrays are defined as "array of array …", and all except the outermost dimension must have compile-time constant size:

int a; // array of 10 elements, each of type 'array of 8 int elements'

Pointers

Every data type T has a corresponding type pointer to T. A pointer is a data type that contains the address of a storage location of a variable of a particular type. They are declared with the asterisk type declarator following the basic storage type and preceding the variable name. Whitespace before or after the asterisk is optional.

char *square;
long *circle;
int *oval;

Pointers may also be declared for pointer data types, thus creating multiple indirect pointers, such as and, including pointers to array types. The latter are less common than an array of pointers, and their syntax may be confusing:

char *pc; // array of 10 elements of 'pointer to char'
char ; // pointer to a 10-element array of char

The element pc requires ten blocks of memory of the size of pointer to char, but element pa is only one pointer, and the data it refers to is an array of ten bytes.

Unions

A union type is a special construct that permits access to the same memory block by using a choice of differing type descriptions. For example, a union of data types may be declared to permit reading the same data either as an integer, a float, or any other user declared type:

union u;

The total size of u is the size of u.s — which happens to be the sum of the sizes of u.s.u and u.s.d — since s is larger than both i and f. When assigning something to u.i, some parts of u.f may be preserved if u.i is smaller than u.f.
Reading from a union member is not the same as casting since the value of the member is not converted, but merely read.

Function pointers

Function pointers allow referencing functions with a particular signature. For example, to store the address of the standard function abs in the variable my_int_f:

int = &abs;
// the & operator can be omitted, but makes clear that the "address of" abs is used here

Function pointers are invoked by name just like normal function calls. Function pointers are separate from pointers and void pointers.

Type qualifiers

The aforementioned types can be characterized further by type qualifiers, yielding a qualified type. and C11, there are four type qualifiers in standard C: const, volatile, restrict and _Atomic the latter has a private name to avoid clashing with user names, but the more ordinary name atomic can be used if the header is included. Of these, const is by far the best-known and most used, appearing in the standard library and encountered in any significant use of the C language, which must satisfy const-correctness. The other qualifiers are used for low-level programming, and while widely used there, are rarely used by typical programmers.