(http://www.jhauser.us/arithmetic/SoftFloat.html) with RISCV specialization and cmake build
25 KiB
Berkeley SoftFloat Release 3e: Source Documentation
John R. Hauser
2018 January 20
Contents
1. Introduction 2. Limitations 3. Acknowledgments and License 4. SoftFloat Package Directory Structure 5. Issues for Porting SoftFloat to a New Target 5.1. Standard Headers <stdbool.h>
and<stdint.h>
5.2. Specializing Floating-Point Behavior 5.3. Macros for Build Options 5.4. Adapting a Template Target Directory 5.5. Target-Specific Optimization of Primitive Functions 6. Testing SoftFloat 7. Providing SoftFloat as a Common Library for Applications 8. Contact Information
1. Introduction
This document gives information needed for compiling and/or porting Berkeley
SoftFloat, a library of C functions implementing binary floating-point
conforming to the IEEE Standard for Floating-Point Arithmetic.
For basic documentation about SoftFloat refer to
SoftFloat.html
.
The source code for SoftFloat is intended to be relatively machine-independent
and should be compilable with any ISO-Standard C compiler that also supports
64-bit integers.
SoftFloat has been successfully compiled with the GNU C Compiler
(gcc
) for several platforms.
Release 3 of SoftFloat was a complete rewrite relative to
Release 2 or earlier.
Changes to the interface of SoftFloat functions are documented in
SoftFloat.html
.
The current version of SoftFloat is Release 3e.
2. Limitations
SoftFloat assumes the computer has an addressable byte size of either 8 or 16 bits. (Nearly all computers in use today have 8-bit bytes.)
SoftFloat is written in C and is designed to work with other C code. The C compiler used must conform at a minimum to the 1989 ANSI standard for the C language (same as the 1990 ISO standard) and must in addition support basic arithmetic on 64-bit integers. Earlier releases of SoftFloat included implementations of 32-bit single-precision and 64-bit double-precision floating-point that did not require 64-bit integers, but this option is not supported starting with Release 3. Since 1999, ISO standards for C have mandated compiler support for 64-bit integers. A compiler conforming to the 1999 C Standard or later is recommended but not strictly required.
C Standard header files <stdbool.h>
and
<stdint.h>
are required for defining standard Boolean and
integer types.
If these headers are not supplied with the C compiler, minimal substitutes must
be provided.
SoftFloat’s dependence on these headers is detailed later in
section 5.1, Standard Headers <stdbool.h>
and <stdint.h>
.
3. Acknowledgments and License
The SoftFloat package was written by me, John R. Hauser. Release 3 of SoftFloat was a completely new implementation supplanting earlier releases. The project to create Release 3 (now through 3e) was done in the employ of the University of California, Berkeley, within the Department of Electrical Engineering and Computer Sciences, first for the Parallel Computing Laboratory (Par Lab) and then for the ASPIRE Lab. The work was officially overseen by Prof. Krste Asanovic, with funding provided by these sources:
Par Lab: Microsoft (Award #024263), Intel (Award #024894), and U.C. Discovery (Award #DIG07-10227), with additional support from Par Lab affiliates Nokia, NVIDIA, Oracle, and Samsung. ASPIRE Lab: DARPA PERFECT program (Award #HR0011-12-2-0016), with additional support from ASPIRE industrial sponsor Intel and ASPIRE affiliates Google, Nokia, NVIDIA, Oracle, and Samsung.
The following applies to the whole of SoftFloat Release 3e as well as to each source file individually.
Copyright 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018 The Regents of the University of California. All rights reserved.
Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
-
Redistributions of source code must retain the above copyright notice, this list of conditions, and the following disclaimer.
-
Redistributions in binary form must reproduce the above copyright notice, this list of conditions, and the following disclaimer in the documentation and/or other materials provided with the distribution.
-
Neither the name of the University nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS “AS IS”, AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE, ARE DISCLAIMED. IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
4. SoftFloat Package Directory Structure
Because SoftFloat is targeted to multiple platforms, its source code is slightly scattered between target-specific and target-independent directories and files. The supplied directory structure is as follows:
The majority of the SoftFloat sources are provided in thedoc source include 8086 8086-SSE ARM-VFPv2 ARM-VFPv2-defaultNaN build template-FAST_INT64 template-not-FAST_INT64 Linux-386-GCC Linux-386-SSE2-GCC Linux-x86_64-GCC Linux-ARM-VFPv2-GCC Win32-MinGW Win32-SSE2-MinGW Win64-MinGW-w64
source
directory.
The include
subdirectory contains several header files
(unsurprisingly), while the other subdirectories of source
contain
source files that specialize the floating-point behavior to match particular
processor families:
If other specializations are attempted, these would be expected to be other subdirectories of
8086
- Intel’s older, 8087-derived floating-point, extended to all supported floating-point types
8086-SSE
- Intel’s x86 processors with Streaming SIMD Extensions (SSE) and later compatible extensions, having 8087 behavior for 80-bit double-extended-precision (
extFloat80_t
) and SSE behavior for other floating-point typesARM-VFPv2
- ARM’s VFPv2 or later floating-point, with NaN payload propagation
ARM-VFPv2-defaultNaN
- ARM’s VFPv2 or later floating-point, with the “default NaN” option
source
alongside the ones listed above.
Specialization is covered later, in section 5.2, Specializing
Floating-Point Behavior.
The build
directory is intended to contain a subdirectory for each
target platform for which a build of the SoftFloat library may be created.
For each build target, the target’s subdirectory is where all derived
object files and the completed SoftFloat library (typically
softfloat.a
or libsoftfloat.a
) are created.
The two template
subdirectories are not actual build targets but
contain sample files for creating new target directories.
(The meaning of FAST_INT64
will be explained later.)
Ignoring the template
directories, the supplied target directories
are intended to follow a naming system of
<execution-environment>-<compiler>
.
For the example targets,
<execution-environment>
is
Linux-386
, Linux-386-SSE2
,
Linux-x86_64
,
Linux-ARM-VFPv2
, Win32
,
Win32-SSE2
, or Win64
, and
<compiler>
is GCC
,
MinGW
, or MinGW-w64
.
All of the supplied target directories are merely examples that may or may not be correct for compiling on any particular system. Despite requests, there are currently no plans to include and maintain in the SoftFloat package the build files needed for a great many users’ compilation environments, which can span a huge range of operating systems, compilers, and other tools.
As supplied, each target directory contains two files:
The providedMakefile platform.h
Makefile
is written for GNU make
.
A build of SoftFloat for the specific target is begun by executing the
make
command with the target directory as the current directory.
A completely different build tool can be used if an appropriate
Makefile
equivalent is created.
The platform.h
header file exists to provide a location for
additional C declarations specific to the build target.
Every C source file of SoftFloat contains a #include
for
platform.h
.
In many cases, the contents of platform.h
can be as simple as one
or two lines of code.
At the other extreme, to get maximal performance from SoftFloat, it may be
desirable to include in header platform.h
(directly or via
#include
) declarations for numerous target-specific optimizations.
Such possibilities are discussed in the next section, Issues for Porting
SoftFloat to a New Target.
If the target’s compiler or library has bugs or other shortcomings,
workarounds for these issues may also be possible with target-specific
declarations in platform.h
, avoiding the need to modify the main
SoftFloat sources.
5. Issues for Porting SoftFloat to a New Target
5.1. Standard Headers <stdbool.h>
and <stdint.h>
The SoftFloat sources make use of standard headers
<stdbool.h>
and <stdint.h>
, which have
been part of the ISO C Standard Library since 1999.
With any recent compiler, these standard headers are likely to be supported,
even if the compiler does not claim complete conformance to the latest ISO C
Standard.
For older or nonstandard compilers, substitutes for
<stdbool.h>
and <stdint.h>
may need to be
created.
SoftFloat depends on these names from <stdbool.h>
:
and on these names frombool true false
<stdint.h>
:
uint16_t uint32_t uint64_t int32_t int64_t UINT64_C INT64_C uint_least8_t uint_fast8_t uint_fast16_t uint_fast32_t uint_fast64_t int_fast8_t int_fast16_t int_fast32_t int_fast64_t
5.2. Specializing Floating-Point Behavior
The IEEE Floating-Point Standard allows for some flexibility in a conforming
implementation, particularly concerning NaNs.
The SoftFloat source
directory is supplied with some
specialization subdirectories containing possible definitions for this
implementation-specific behavior.
For example, the 8086
and 8086-SSE
subdirectories have source files that specialize SoftFloat’s behavior to
match that of Intel’s x86 line of processors.
The files in a specialization subdirectory must determine:
- whether tininess for underflow is detected before or after rounding by default;
- how signaling NaNs are distinguished from quiet NaNs;
- what (if anything) special happens when exceptions are raised;
- the default generated quiet NaNs;
- how NaNs are propagated from function inputs to output; and
- the integer results returned when conversions to integer type raise the invalid exception.
As provided, the build process for a target expects to involve exactly
one specialization directory that defines all of these
implementation-specific details for the target.
A specialization directory such as 8086
is expected to contain a
header file called specialize.h
, together with whatever other
source files are needed to complete the specialization.
A new build target may use an existing specialization, such as the ones
provided by the 8086
and 8086-SSE
subdirectories.
If a build target needs a new specialization, different from any existing ones,
it is recommended that a new specialization directory be created for this
purpose.
The specialize.h
header file from any of the provided
specialization subdirectories can be used as a model for what definitions are
needed.
5.3. Macros for Build Options
The SoftFloat source files adapt the floating-point implementation according to several C preprocessor macros:
LITTLEENDIAN
- Must be defined for little-endian machines; must not be defined for big-endian machines.
INLINE
- Specifies the sequence of tokens used to indicate that a C function should be inlined. If macro
INLINE_LEVEL
is defined with a value of 1 or higher, this macro must be defined; otherwise, this macro is ignored and need not be defined. For compilers that conform to the C Standard’s rules for inline functions, this macro can be defined as the single keywordinline
. For other compilers that follow a convention pre-dating the standardization ofinline
, this macro may need to be defined toextern
inline
.THREAD_LOCAL
- Can be defined to a sequence of tokens that, when appearing at the start of a variable declaration, indicates to the C compiler that the variable is per-thread, meaning that each execution thread gets its own separate instance of the variable. This macro is used in header
softfloat.h
in the declarations of variablessoftfloat_roundingMode
,softfloat_detectTininess
,extF80_roundingPrecision
, andsoftfloat_exceptionFlags
. If macroTHREAD_LOCAL
is left undefined, these variables will default to being ordinary global variables. Depending on the compiler, possible valid definitions of this macro include_Thread_local
and__thread
.
SOFTFLOAT_ROUND_ODD
- Can be defined to enable support for optional rounding mode
softfloat_round_odd
.
INLINE_LEVEL
- Can be defined to an integer to determine the degree of inlining requested of the compiler. Larger numbers request that more inlining be done. If this macro is not defined or is defined to a value less than 1 (zero or negative), no inlining is requested. The maximum effective value is no higher than 5. Defining this macro to a value greater than 5 is the same as defining it to 5.
SOFTFLOAT_FAST_INT64
- Can be defined to indicate that the build target’s implementation of 64-bit arithmetic is efficient. For newer 64-bit processors, this macro should usually be defined. For very small microprocessors whose buses and registers are 8-bit or 16-bit in size, this macro should usually not be defined. Whether this macro should be defined for a 32-bit processor may depend on the target machine and the applications that will use SoftFloat.
SOFTFLOAT_FAST_DIV32TO16
- Can be defined to indicate that the target’s division operator in C (written as
/
) is reasonably efficient for dividing a 32-bit unsigned integer by a 16-bit unsigned integer. Setting this macro may affect the performance of functionf16_div
.SOFTFLOAT_FAST_DIV64TO32
- Can be defined to indicate that the target’s division operator in C (written as
/
) is reasonably efficient for dividing a 64-bit unsigned integer by a 32-bit unsigned integer. Setting this macro may affect the performance of division, remainder, and square root operations other thanf16_div
.
Following the usual custom for C, for most of these macros (all
except INLINE
, THREAD_LOCAL
, and
INLINE_LEVEL
), the content of any definition is irrelevant;
what matters is a macro’s effect on #ifdef
directives.
It is recommended that any definitions of macros LITTLEENDIAN
,
INLINE
, and THREAD_LOCAL
be made in a build
target’s platform.h
header file, because these macros are
expected to be determined inflexibly by the target machine and compiler.
The other five macros select options and control optimization, and thus might
be better located in the target’s Makefile (or its equivalent).
5.4. Adapting a Template Target Directory
In the build
directory, two template
subdirectories
provide models for new target directories.
Two different templates exist because different functions are needed in the
SoftFloat library depending on whether macro SOFTFLOAT_FAST_INT64
is defined.
If macro SOFTFLOAT_FAST_INT64
will be defined,
template-FAST_INT64
is the template to use;
otherwise, template-not-FAST_INT64
is the appropriate
template.
A new target directory can be created by copying the correct template directory
and editing the files inside.
To avoid confusion, it would be wise to refrain from editing the files within a
template directory directly.
5.5. Target-Specific Optimization of Primitive Functions
Header file primitives.h
(in directory
source/include
) declares macros and functions for numerous
underlying arithmetic operations upon which many of SoftFloat’s
floating-point functions are ultimately built.
The SoftFloat sources include implementations of all of these functions/macros,
written as standard C code, so a complete and correct SoftFloat library can be
created using only the supplied code for all functions.
However, for many targets, SoftFloat’s performance can be improved by
substituting target-specific implementations of some of the functions/macros
declared in primitives.h
.
For example, primitives.h
declares a function called
softfloat_countLeadingZeros32
that takes an unsigned
32-bit integer as an argument and returns the number of the
integer’s most-significant bits that are zeros.
While the SoftFloat sources include an implementation of this function written
in standard C, many processors can perform this same function
directly in only one or two machine instructions.
An alternative, target-specific implementation that maps to those instructions
is likely to be more efficient than the generic C code from the SoftFloat
package.
A build target can replace the supplied version of any function or macro of
primitives.h
by defining a macro with the same name in the
target’s platform.h
header file.
For this purpose, it may be helpful for platform.h
to
#include
header file primitiveTypes.h
, which defines
types used for arguments and results of functions declared in
primitives.h
.
When a desired replacement implementation is a function, not a macro, it is
sufficient for platform.h
to include the line
where#define <function-name> <function-name>
<function-name>
is the name of the
function.
This technically defines <function-name>
as a macro, but one that resolves to the same name, which may then be a
function.
(A preprocessor that conforms to the C Standard is required to limit recursive
macro expansion from being applied more than once.)
The supplied header file opts-GCC.h
(in directory
source/include
) provides an example of target-specific
optimization for the GCC compiler.
Each GCC target example in the build
directory has
#include "opts-GCC.h"
in its platform.h
header file.
Before opts-GCC.h
is included, the following macros must be
defined (or not) to control which features are invoked:
On some machines, these improvements are observed to increase the speeds of
SOFTFLOAT_BUILTIN_CLZ
- If defined, SoftFloat’s internal ‘
countLeadingZeros
’ functions use intrinsics__builtin_clz
and__builtin_clzll
.SOFTFLOAT_INTRINSIC_INT128
- If defined, SoftFloat makes use of GCC’s nonstandard 128-bit integer type
__int128
.
f64_mul
and f128_mul
by around 20 to 25%, although
other functions receive less dramatic boosts, or none at all.
Results can vary greatly across different platforms.
6. Testing SoftFloat
SoftFloat can be tested using the testsoftfloat
program by the
same author.
This program is part of the Berkeley TestFloat package available at the Web
page
http://www.jhauser.us/arithmetic/TestFloat.html
.
The TestFloat package also has a program called timesoftfloat
that
measures the speed of SoftFloat’s floating-point functions.
7. Providing SoftFloat as a Common Library for Applications
Header file softfloat.h
defines the SoftFloat interface as seen by
clients.
If the SoftFloat library will be made a common library for programs on a
system, the supplied softfloat.h
has a couple of deficiencies for
this purpose:
-
As supplied,
softfloat.h
depends on another header,softfloat_types.h
, that is not intended for public use but which must also be visible to the programmer’s compiler. -
More troubling, at the time
softfloat.h
is included in a C source file, macrosSOFTFLOAT_FAST_INT64
andTHREAD_LOCAL
must be defined, or not defined, consistent with how these macro were defined when the SoftFloat library was built.
#include
header
file softfloat.h
, it is recommended that a custom, self-contained
version of this header file be created that eliminates these issues.
8. Contact Information
At the time of this writing, the most up-to-date information about SoftFloat
and the latest release can be found at the Web page
http://www.jhauser.us/arithmetic/SoftFloat.html
.