Most ints are not floats

John

from John D. Cook on 2025-06-27 13:26 (#6Y98A)

All integers are real numbers, but most computer representations of integers do not equal computer representations of real numbers.

To make the statement above precise, we have to be more specific about what we mean by computer integers and floating point numbers. I'll use int32 and int64 to refer to 32-bit and 64-bit signed integers. I'll use float32 and float64 to refer to IEEE 754 single precision and double precision numbers, what C calls float and double.

Most int32 numbers cannot be represented exactly by a float32. All int32 numbers can be represented approximately as float32 numbers, but usually not exactly. The previous statements remain true if you replace 32" everywhere by 64."

32 bit details

The int32 data type represents integers -2³¹ through 2³¹ - 1. The float32 data type represents numbers of the form

1.f * 2^e

where 1 bit represents the sign, 23 bits represent f, and 8 bits represent e.

The numbern = 2²⁴ can be represented by setting the fractional part f to 0 and setting the exponente to 24. But the number n + 1 cannot be represented as a float32 because its last bit would fall off the end of f:

2²⁴ + 1 = (1 + 2^-24) 2²⁴ = 1.000000000000000000000001_two * 2²⁴

The bits in f fill up representing the 23 zeros after the decimal place. There's no 24th bit to store the final 1.

For each value ofe, there are 2²³ possible values off. So foreach of e = 24, 25, 26, ..., 31 there are 2²³ representable integers, for a total of 8 * 2²³.

So of the 2³¹ non-negative integers that can be represented by an int32 data type, only 9 * 2²³ can also be represented exactly as a float32 data type. This means about 3.5% of 32-bit integers can be represented exactly by a 32-bit float.

64 bit details

The calculations for 64-bit integers and 64-bit floating point numbers are analogous. Numbers represented by float64 data types have the form

1.f * 2^e

where nowf has 52 bits ande has 11.

Of the 2⁶³non-negative integers that can be represented by an int64 data type, only 11 * 2⁵²can also be represented exactly as a float64 data type. This means about 0.5% of 64-bit integers can be represented exactly by a 64-bit float.

A note on Python

Python's integers have unlimited range, while its floating point numbers correspond to float64. So there are two reasons an integer might not be representable as a float: it may be larger than the largest float, or it may require more than 53 bits of precision.

The post Most ints are not floats first appeared on John D. Cook.

Source	RSS or Atom Feed
Feed Location	http://feeds.feedburner.com/TheEndeavour?format=xml
Feed Title	John D. Cook
Feed Link	https://www.johndcook.com/blog