Cython is a programming language that aims to be a superset of the Python programming language, designed to give C-like performance with code that is written mostly in Python with optional additional C-inspired syntax. Cython is a compiled language that is typically used to generate CPython extension modules. Annotated Python-like code is compiled to C or C++ then automatically wrapped in interface code, producing extension modules that can be loaded and used by regular Python code using the import statement, but with significantly less computational overhead at run time. Cython also facilitates wrapping independent C or C++ code into python-importable modules. Cython is written in Python and C and works on Windows, macOS, and Linux, producing source files compatible with CPython 2.6, 2.7, and 3.3 through 3.8. Cython 3.0.0 is in development.
Design
Cython works by producing a standard Python module. However, the behavior differs from standard Python in that the module code, originally written in Python, is translated into C. While the resulting code is fast, it makes many calls into the CPython interpreter and CPython standard libraries to perform actual work. Choosing this arrangement saved considerably on Cython's development time, but modules have a dependency on the Python interpreter and standard library. Although most of the code is C-based, a small stub loader written in interpreted Python is usually required. However, this is not a major problem due to the presence of the Python interpreter. Cython has a foreign function interface for invoking C/C++ routines and the ability to declare the static type of subroutine parameters and results, local variables, and class attributes. A Cython program that implements the same algorithm as a corresponding Python program may consume fewer computing resources such as core memory and processing cycles due to differences between the CPython and Cython execution models. A basic Python program is loaded and executed by the CPython virtual machine, so both the runtime and the program itself consume computing resources. A Cython program is compiled to C code, which is further compiled to machine code, so the virtual machine is used only briefly when the program is loaded. Cython employs:
Performance depends both on what C code is generated by Cython and how that code is compiled by the C compiler.
History
Cython is a derivative of the Pyrex language, and supports more features and optimizations than Pyrex. Cython was forked from Pyrex in 2007 by developers of the Sagecomputer algebra package, because they were unhappy with Pyrex's limitations and could not get patches accepted by Pyrex's maintainer Greg Ewing, who envisioned a much smaller scope for his tool than the Sage developers had in mind. They then forked Pyrex as SageX. When they found people were downloading Sage just to get SageX, and developers of other packages were also maintaining forks of Pyrex, SageX was split off the Sage project and merged with cython-lxml to become Cython. Cython files have a .pyx extension. At its most basic, Cython code looks exactly like Python code. However, whereas standard Python is dynamically typed, in Cython, types can optionally be provided, allowing for improved performance, allowing loops to be converted into C loops where possible. For example: def primes: # The argument will be converted to int or raise a TypeError. cdef int n, k, i # These variables are declared with C types. cdef int p # Another C type result = # A Python type if kmax > 1000: kmax = 1000 k = 0 n = 2 while k < kmax: i = 0 while i < k and n % p != 0: i = i + 1 if i k: p = n k = k + 1 result.append n = n + 1 return result
Example
A sample hello world program for Cython is more complex than in most languages because it interfaces with the Python C API and the setuptools extension building facility. At least three files are required for a basic project:
A setup.py file to invoke the setuptools build process that generates the extension module
A more straightforward way to start with Cython is through command-line IPython : In : %load_ext Cython In : %%cython ...: def f: ...: a = 0 ...: for i in range: ...: a += i ...: return a ...: ...: cpdef g: ...: cdef long a = 0 ...: cdef int i ...: for i in range: ...: a += i ...: return a ...: In : %timeit f 10 loops, best of 3: 26.5 ms per loop In : %timeit g 1000 loops, best of 3: 279 µs per loop
which gives a 95 times improvement over the pure-python version. More details on the subject in the official quickstart page.
Uses
Cython is particularly popular among scientific users of Python, where it has "the perfect audience" according to Python creator Guido van Rossum. Of particular note:
Some high-traffic websites such as Quora use Cython.
Cython's domain is not limited to just numerical computing. For example, the lxml XML toolkit is written mostly in Cython, and like its predecessor Pyrex, Cython is used to provide Python bindings for many C and C++ libraries such as the messaging library ZeroMQ. Cython can also be used to develop parallel programs for multi-core processor machines; this feature makes use of the OpenMP library.