Const (computer programming)
In the C, C++, D, JavaScript and Julia programming languages, const is a type qualifier: a keyword applied to a data type that indicates that the data is read only. While this can be used to declare constants,
const
in the C family of languages differs from similar constructs in other languages in being part of the type, and thus has complicated behavior when combined with pointers, references, composite data types, and type-checking.Introduction
When applied in an object declaration, it indicates that the object is a constant: its value may not be changed, unlike a variable. This basic use – to declare constants – has parallels in many other languages.However, unlike in other languages, in the C family of languages the
const
is part of the type, not part of the object. For example, in C, declares an object x
of int const
type – the const
is part of the type, as if it were parsed " x" – while in Ada, declares a constant X
of INTEGER
type: the constant
is part of the object, but not part of the type.This has two subtle results. Firstly,
const
can be applied to parts of a more complex type – for example, int const * const x;
declares a constant pointer to a constant integer, while int const * x;
declares a variable pointer to a constant integer, and int * const x;
declares a constant pointer to a variable integer. Secondly, because const
is part of the type, it must match as part of type-checking. For example, the following code is invalid:void f;
//...
int const i;
f;
because the argument to
f
must be a variable integer, but i
is a constant integer. This matching is a form of program correctness, and is known as const-correctness. This allows a form of programming by contract, where functions specify as part of their type signature whether they modify their arguments or not, and whether their return value is modifiable or not. This type-checking is primarily of interest in pointers and references – not basic value types like integers – but also for composite data types or templated types such as containers. It is concealed by the fact that the const
can often be omitted, due to type coercion and C being call-by-value.Consequences
The idea of const-ness does not imply that the variable as it is stored in computer memory is unwritable. Rather,const
-ness is a compile-time construct that indicates what a programmer should do, not necessarily what they can do. Note, however, that in the case of predefined data, C const
is often unwritable.Distinction from constants
While a constant does not change its value while the program is running, an object declaredconst
may indeed change its value while the program is running. A common example are read only registers within embedded systems like the current state of a digital input. The data registers for digital inputs are often declared as const
and volatile
. The content of these registers may change without the program doing anything but you shall not write to them either.Other uses
In addition, a member-function can be declared asconst
. In this case, the this
pointer inside such a function is of type object_type const *
rather than merely of type object_type *
. This means that non-const functions for this object cannot be called from inside such a function, nor can member variables be modified. In C++, a member variable can be declared as mutable
, indicating that this restriction does not apply to it. In some cases, this can be useful, for example with caching, reference counting, and data synchronization. In these cases, the logical meaning of the object is unchanged, but the object is not physically constant since its bitwise representation may change.Syntax
In C, C++, and D, all data types, including those defined by the user, can be declaredconst
, and const-correctness dictates that all variables or objects should be declared as such unless they need to be modified. Such proactive use of const
makes values "easier to understand, track, and reason about," and it thus increases the readability and comprehensibility of code and makes working in teams and maintaining code simpler because it communicates information about a value's intended use. This can help the compiler as well as the developer when reasoning about code. It can also enable an optimizing compiler to generate more efficient code.Simple data types
For simple non-pointer data types, applying theconst
qualifier is straightforward. It can go on either side of some types for historical reasons. On some implementations, using const
twice generates a warning but not an error.Pointers and references
For pointer and reference types, the meaning ofconst
is more complicated – either the pointer itself, or the value being pointed to, or both, can be const
. Further, the syntax can be confusing. A pointer can be declared as a const
pointer to writable value, or a writable pointer to a const
value, or const
pointer to const
value. A const
pointer cannot be reassigned to point to a different object from the one it is initially assigned, but it can be used to modify the value that it points to. Reference variables in C++ are an alternate syntax for const
pointers. A pointer to a const
object, on the other hand, can be reassigned to point to another memory location, but it cannot be used to modify the memory that it is pointing to. A const
pointer to a const
object can also be declared and can neither be used to modify the pointee nor be reassigned to point to another object. The following code illustrates these subtleties:void Foo
C convention
Following usual C convention for declarations, declaration follows use, and the*
in a pointer is written on the pointer, indicating dereferencing. For example, in the declaration int *ptr
, the dereferenced form *ptr
is an int
, while the reference form ptr
is a pointer to an int
. Thus const
modifies the name to its right. The C++ convention is instead to associate the *
with the type, as in int* ptr,
and read the const
as modifying the type to the left. int const * ptrToConst
can thus be read as "*ptrToConst
is a int const
", or "ptrToConst
is a int const *
". Thus:int *ptr; // *ptr is an int value
int const *ptrToConst; // *ptrToConst is a constant
int * const constPtr; // constPtr is a constant
int const * const constPtrToConst; // constPtrToConst is a constant
// as is *constPtrToConst
C++ convention
Following C++ convention of analyzing the type, not the value, a rule of thumb is to read the declaration from right to left. Thus, everything to the left of the star can be identified as the pointee type and everything to the right of the star are the pointer properties. For instance, in our example above,int const *
can be read as a writable pointer that refers to a non-writable integer, and int * const
can be read as a non-writable pointer that refers to a writable integer.A more generic rule that helps you understand complex declarations and definitions works like this:
- find the identifier whose declaration you want to understand
- read as far as possible to the right
- back up to where you began, and read backwards to the left
- when you've reached the beginning of the declaration you're done. If not, continue at step 2, beyond the closing parenthesis that was matched last.
When reading to the left, it is important that you read the elements from right to left. So an
int const *
becomes a pointer to a const int and not a const pointer to an int.In some cases C/C++ allows the
const
keyword to be placed to the left of the type. Here are some examples:const int *ptrToConst; //identical to: int const *ptrToConst,
const int *const constPtrToConst; //identical to: int const *const constPtrToConst
Although C/C++ allows such definitions, the compiler still reads the definitions according to the abovementioned procedure: from right to left. But putting
const
before what must be constant quickly introduces mismatches between what you intend to write and what the compiler decides you wrote. Consider pointers to pointers:int **ptr; // a pointer to a pointer to ints
int const **ptr // a pointer to a pointer to constant int value
//
int *const *ptr // a pointer to a const pointer to int values
//
int **const ptr // a constant pointer to pointers to ints
//
int const **const ptr // a constant pointer to pointers to constant int values
As a final note regarding pointer definitions: always write the pointer symbol as much as possible to the right. Attaching the pointer symbol to the type is tricky, as it strongly suggests a pointer type, which isn't the case. Here are some examples:
int* a; /* write: */ int *a; // a is a pointer to an int
int* a, b; // CONFUSING
/* write: */ int *a, b; // a is a pointer to an int,
// but b is a mere int
int* a, *b; // UGLY: both a and b are pointers to ints
/* write: */ int *a, *b;
Bjarne Stroustrup's FAQ recommends only declaring one variable per line if using the C++ convention, to avoid this issue.
The same considerations apply to defining references and rvalue references:
int var = 22;
int const &refToConst = var; // OK
int const& ref2 = var, ref3 = var; // CONFUSING:
// ref2 is a reference, but ref3 isn't:
// ref3 is a constant int initialized with
// var's value
int &const constRef = var; // ERROR: as references can't change anyway.
// C++:
int&& rref = int, value = 10; // CONFUSING:
// rref is an rvalue reference, but value is
// a mere int.
/* write: */ int &&rref = int, value = 10;
More complicated declarations are encountered when using multidimensional arrays and references to pointers. Although it is sometimes argued that such declarations are confusing and error-prone and that they therefore should be avoided or be replaced by higher-level structures, the procedure described at the top of this section can always be used without introducing ambiguities or confusion.
Parameters and variables
const
can be declared both on function parameters and on variables. The interpretation varies between uses. A const
static variable is a constant, and may be used for data like mathematical constants, such as double const PI = 3.14159
– realistically longer, or overall compile-time parameters. A const
automatic variable means that single assignment is happening, though a different value may be used each time, such as int const x_squared = x * x
. A const
parameter in pass-by-reference means that the referenced value is not modified – it is part of the contract – while a const
parameter in pass-by-value does not add anything to the interface, but indicates that internally, the function does not modify the local copy of the parameter. For this reason, some favor using const
in parameters only for pass-by-reference, where it changes the contract, but not for pass-by-value, where it exposes the implementation.C++
Methods
In order to take advantage of the design by contract approach for user-defined types, which can have methods as well as member data, the programmer may tag instance methods asconst
if they don't modify the object's data members.Applying the
const
qualifier to instance methods thus is an essential feature for const-correctness, and is not available in many other object-oriented languages such as Java and C# or in Microsoft's C++/CLI or Managed Extensions for C++.While
const
methods can be called by const
and non-const
objects alike, non-const
methods can only be invoked by non-const
objects.The
const
modifier on an instance method applies to the object pointed to by the "this
" pointer, which is an implicit argument passed to all instance methods.Thus having
const
methods is a way to apply const-correctness to the implicit "this
" pointer argument just like other arguments.This example illustrates:
class C
void Foo
In the above code, the implicit "
this
" pointer to Set
has the type "C *const
"; whereas the "this
" pointer to Get
has type "C const *const
", indicating that the method cannot modify its object through the "this
" pointer.Often the programmer will supply both a
const
and a non-const
method with the same name in a class to accommodate both types of callers. Consider:class MyArray
void Foo
The
const
-ness of the calling object determines which version of MyArray::Get
will be invoked and thus whether or not the caller is given a reference with which he can manipulate or only observe the private data in the object.The two methods technically have different signatures because their "
this
" pointers have different types, allowing the compiler to choose the right one.Loopholes to const-correctness
There are several loopholes to pure const-correctness in C and C++. They exist primarily for compatibility with existing code.The first, which applies only to C++, is the use of
const_cast
, which allows the programmer to strip the const
qualifier, making any object modifiable.The necessity of stripping the qualifier arises when using existing code and libraries that cannot be modified but which are not const-correct. For instance, consider this code:
// Prototype for a function which we cannot change but which
// we know does not modify the pointee passed in.
void LibraryFunc;
void CallLibraryFunc
However, any attempt to modify an object that is itself declared
const
by means of a const cast results in undefined behavior according to the ISO C++ Standard.In the example above, if
ptr
references a global, local, or member variable declared as const
, or an object allocated on the heap via new int const
, the code is only correct if LibraryFunc
really does not modify the value pointed to by ptr
.The C language has a need of a loophole because a certain situation exists. Variables with static storage duration are allowed to be defined with an initial value. However, the initializer can use only constants like string constants and other literals, and is not allowed to use non-constant elements like variable names, whether the initializer elements are declared
const
or not, or whether the static duration variable is being declared const
or not. There is a non-portable way to initialize a const
variable that has static storage duration. By carefully constructing a typecast on the left hand side of a later assignment, a const
variable can be written to, effectively stripping away the const
attribute and 'initializing' it with non-constant elements like other const
variables and such. Writing into a const
variable this way may work as intended, but it causes undefined behavior and seriously contradicts const-correctness:size_t const bufferSize = 8*1024;
size_t const userTextBufferSize; //initial value depends on const bufferSize, can't be initialized here
...
int setupUserTextBox
Another loophole applies both to C and C++. Specifically, the languages dictate that member pointers and references are "shallow" with respect to the
const
-ness of their owners — that is, a containing object that is const
has all const
members except that member pointees are still mutable. To illustrate, consider this C++ code:struct S
void Foo
Although the object
s
passed to Foo
is constant, which makes all of its members constant, the pointee accessible through s.ptr
is still modifiable, though this may not be desirable from the standpoint of const
-correctness because s
might solely own the pointee.For this reason, Meyers argues that the default for member pointers and references should be "deep"
const
-ness, which could be overridden by a mutable
qualifier when the pointee is not owned by the container, but this strategy would create compatibility issues with existing code.Thus, for historical reasons, this loophole remains open in C and C++.
The latter loophole can be closed by using a class to hide the pointer behind a
const
-correct interface, but such classes either do not support the usual copy semantics from a const
object or allow other loopholes by permitting the stripping of const
-ness through inadvertent or intentional copying.Finally, several functions in the C standard library violate const-correctness, as they accept a
const
pointer to a character string and return a non-const
pointer to a part of the same string. strtol
and strchr
are among these functions.Some implementations of the C++ standard library, such as Microsoft's try to close this loophole by providing two overloaded versions of some functions: a "
const
" version and a "non-const
" version.Problems
The use of the type system to express constancy leads to various complexities and problems, and has accordingly been criticized and not adopted outside the narrow C family of C, C++, and D. Java and C#, which are heavily influenced by C and C++, both explicitly rejectedconst
-style type qualifiers, instead expressing constancy by keywords that apply to the identifier. Even within C and C++, the use of const
varies significantly, with some projects and organizations using it consistently, and others avoiding it.strchr
problem
The const
type qualifier causes difficulties when the logic of a function is agnostic to whether its input is constant or not, but returns a value which should be of the same qualified type as an input. In other words, for these functions, if the input is constant, the return value should be as well, but if the input is variable, the return value should be as well. Because the type signature of these functions differs, it requires two functions with the same logic – a form of generic programming.This problem arises even for simple functions in the C standard library, notably
strchr
; this observation is credited by Ritchie to Tom Plum in the mid 1980s. The strchr
function locates a character in a string; formally, it returns a pointer to the first occurrence of the character c
in the string s
, and in classic C its prototype is:char *strchr;
The
strchr
function does not modify the input string, but the return value is often used by the caller to modify the string, such as:if
*p = ' ';
Thus on the one hand the input string can be
const
, and if the input string is const
the return value should be as well – most simply because it might return exactly the input pointer, if the first character is a match – but on the other hand the return value should not be const
if the original string was not const
, since the caller may wish to use the pointer to modify the original string.In C++ this is done via function overloading, typically implemented via a template, resulting in two functions, so that the return value has the same
const
-qualified type as the input:char* strchr;
char const* strchr;
These can in turn be defined by a template:
template
T* strchr
In D this is handled via the
inout
keyword, which acts as a wildcard for const, immutable, or unqualified, yielding:inout* strchr;
However, in C neither of these is possible since C does not have function overloading, and instead, this is handled by having a single function where the input is constant but the output is writable:
char *strchr;
This allows idiomatic C code but does strip the const qualifier if the input actually was const-qualified, violating type safety. This solution was proposed by Ritchie and subsequently adopted. This difference is one of the failures of compatibility of C and C++.
D
In Version 2 of the D programming language, two keywords relating to const exist. Theimmutable
keyword denotes data that cannot be modified through any reference.The
const
keyword denotes a non-mutable view of mutable data.Unlike C++
const
, D const
and immutable
are "deep" or transitive, and anything reachable through a const
or immutable
object is const
or immutable
respectively.Example of const vs. immutable in D
int foo = new int; // foo is mutable.
const int bar = foo; // bar is a const view of mutable data.
immutable int baz = foo; // Error: all views of immutable data must be immutable.
immutable int nums = new immutable; // No mutable reference to nums may be created.
const int constNums = nums; // Works. immutable is implicitly convertible to const.
int mutableNums = nums; // Error: Cannot create a mutable view of immutable data.
Example of transitive or deep const in D
class Foo
immutable Foo foo = new immutable;
foo.next.num = 5; // Won't compile. foo.next is of type immutable.
// foo.next.num is of type immutable.
History
const
was introduced by Bjarne Stroustrup in C with Classes, the predecessor to C++, in 1981, and was originally called readonly
. As to motivation, Stroustrup writes:The first use, as a scoped and typed alternative to macros, was analogously fulfilled for function-like macros via the
inline
keyword. Constant pointers, and the * const
notation, were suggested by Dennis Ritchie and so adopted.const
was then adopted in C as part of standardization, and appears in C89 along with the other type qualifier, volatile
. A further qualifier, noalias
, was suggested at the December 1987 meeting of the X3J11 committee, but was rejected; its goal was ultimately fulfilled by the restrict
keyword in C99. Ritchie was not very supportive of these additions, arguing that they did not "carry their weight", but ultimately did not argue for their removal from the standard.D subsequently inherited
const
from C++, where it is known as a type constructor and added two further type constructors, immutable
and inout
, to handle related use cases.Other languages
Other languages do not follow C/C++ in having constancy part of the type, though they often have superficially similar constructs and may use theconst
keyword. Typically this is only used for constants.C# has a
const
keyword, but with radically different and simpler semantics: it means a compile-time constant, and is not part of the type.Nim has a
const
keyword similar to that of C#: it also declares a compile-time constant rather than forming part of the type. However, in Nim, a constant can be declared from any expression that can be evaluated at compile time. In C#, only C# built-in types can be declared as const
; user-defined types, including classes, structs, and arrays, cannot be const
.Java does not have
const
– it instead has final
, which can be applied to local "variable" declarations and applies to the identifier, not the type. It has a different object-oriented use for object members, which is the origin of the name.The Java language specification regards
const
as a reserved keyword – i.e., one that cannot be used as variable identifier – but assigns no semantics to it: it is a reserved word but not a keyword. It is thought that the reservation of the keyword occurred to allow for an extension of the Java language to include C++-style const
methods and pointer to const
type. An enhancement request ticket for implementing const
correctness exists in the Java Community Process, but was closed in 2005 on the basis that it was impossible to implement in a backwards-compatible fashion.The contemporary Ada 83 independently had the notion of a constant object and a
constant
keyword, with input parameters and loop parameters being implicitly constant. Here the constant
is a property of the object, not of the type.JavaScript has a
const
declaration that defines a block-scoped variable that cannot be reassigned nor redeclared. It defines a read-only reference to a variable that cannot be redefined, but in some situations the value of the variable itself may potentially change, such as if the variable refers to an object and a property of it is altered.