Initial commit of version 0.1

dbry · Sep 11, 2022 · 1a3f1e9 · 1a3f1e9
commit 1a3f1e9
Show file tree

Hide file tree

Showing 7 changed files with 1,612 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -0,0 +1,113 @@
+## AUDIO-RESAMPLER
+
+Audio Resampling Engine & Command-Line Tool
+
+Copyright (c) 2022 David Bryant.
+
+All Rights Reserved.
+
+Distributed under the [BSD Software License](https://github.com/dbry/audio-resampler/blob/master/license.txt).
+
+## What is this?
+
+This is a simple audio resampler, written entirely in C and specifically targeting embedded systems. It
+provides fine control over both the CPU load and memory footprint so it can be easily adapted to a wide
+range of hardware (e.g., ESP32 to high-end ARM). It is also well suited for ASRC (asynchronous sample rate
+converter) applications because it provides a function to query the exact phase position of the resampler
+(which is required in the feedback loop of an ASRC).
+
+The package includes a command-line program (**ART**) to experiment with the resampler and serve as example
+code for the engine API. The resampling and filtering code works with only 32-bit float audio data, however
+the command-line program includes examples of how to convert to and from integer audio samples, including
+the use of highpass TPDF dither and 1st-order noise shaping.
+
+**ART** works with Microsoft WAV files and includes four quality presets that set the number and size of
+the sinc filters and serve as a starting point for experimentation:
+
+Preset|Number of sinc filters|Number of taps per filter|RAM use
+------|----------------------|-------------------------|-----------
+-1    |       16             |            16           | 1 Kbyte
+-2    |       64             |            64           | 16 Kbytes
+-3    |      256             |           256           | 256 Kbytes
+-4    |     1024             |          1024           | 4 Mbytes
+
+Preset **-3** is the default and is a reasonable compromise for high-quality resampling on a PC. Presets **-1**
+and **-2** are more suited for realtime use on embedded systems and **-4** represents the highest quality
+available for this tool.
+
+**ART** supports 8-bit, 16-bit, and 24-bit integer samples, as well as 32-bit floating-point samples. Any
+number of channels are supported. Normally the output bitdepth is set to the same as the input file, however
+this can be forced to one of the other supported bitdepths with the **-o** option.
+
+Both the resampling and filter engines are endian-safe and the command-line program is endian-aware
+(although, of course, WAV files are always little-endian).
+
+## Technical Description
+
+The resampler uses windowed sinc interpolation filters. A configurable number of filters are generated on
+initialization representing subdivisions of the unit circle. Normally, the output sample value is calculated
+by convolving the input samples (the required history is stored in the resampler) with the two sinc filters
+on either side of the desired phase angle, and then linear interpolating to get the precise result. If it is
+known that there will always be an exact sinc filter (or there's enough room for many filters) then the
+interpolation can be skipped and only the nearest sinc filter is used (controlled with the **-n** option
+in the CLI).
+
+The sinc filters are generated with either Hann or Blackman-Harris (4 term) windowing functions. The
+Blackman-Harris is usually the best choice (and the default in the CLI) because it has very good stopband
+(side-lobe) rejection. However, in some situations (e.g., short filters) the Hann window might be better
+because it has a sharper transition (this is controlled in the CLI with the **-b** and **-h** options).
+
+For upsampling or simple resampling operations the sinc filters preserve the original waveform (i.e., they
+are purely interpolative). However, for downsampling applications this is not sufficient because aliasing
+occurs if content frequency moves above the Nyquist frequency of the new sampling rate. For these cases
+the sinc filters are constructed with an implicit lowpass. This is also available for upsampling if
+desired to eliminate frequencies close to the Nyquist frequency of the input and reduce aliasing
+further at the expense of some HF loss (e.g., it might be desirable to set a lowpass of 20 kHz when
+resampling up from or down to 44.1 kHz even though it's not strictly required). The lowpass option is
+enabled with the **-l** option in the CLI).
+
+It is sometimes desirable to reduce aliasing further with lowpass filters either before downsampling
+or after upsampling as this can be more efficient than increasing the length of the sinc filters. This is
+enabled with the **-p** option in the CLI and implements a cascaded pair of 2nd-order biquads. Note that
+unlike the sinc filters, these filters are not linear-phase and will introduce group delay.
+
+## Building
+
+To build the command-line tool (**ART**) on Linux or OS-X:
+
+> $ gcc -Ofast art.c resampler.c biquad.c -lm -o art
+
+The "help" display from the command-line app:
+
+```
+ Usage:     ART [-options] infile.wav outfile.wav
+
+ Options:  -1|2|3|4    = quality presets, default = 3
+           -r<Hz>      = resample to specified rate
+           -g<dB>      = apply gain (default = 0 dB)
+           -s<degrees> = add specified phase shift (+/-360 degrees)
+           -l<Hz>      = optional lower lowpass frequency
+           -f<num>     = number of sinc filters (2-1024)
+           -t<num>     = number of sinc taps (4-1024, multiples of 4)
+           -o<bits>    = change output file bitdepth (8|16|24|32)
+           -n          = use nearest filter (don't interpolate)
+           -b          = Blackman-Harris windowing (best stopband)
+           -h          = Hann windowing (fastest transition)
+           -p          = pre/post filtering (cascaded biquads)
+           -q          = quiet mode (display errors only)
+           -v          = verbose (display lots of info)
+           -y          = overwrite outfile if it exists
+
+ Web:       Visit www.github.com/dbry/audio-resampler for latest version and info
+```
+
+## Caveats
+
+- The resampling engine is a single C file, with another C file for the biquad filters. Don't expect
+the quality and performance of more advanced libraries, but also don't expect much difficulty integrating
+it. The simplicity and flexibility of this code might make it appealing for many applications, especially
+on limited-resource systems.
+- In the command-line program, unknown RIFF chunk types are correctly parsed on input files, but are
+*not* passed to the output file, and pipes are not supported.
+- The command-line program is not very restrictive about the option parameters, so it's very easy to
+get bad results or even crashes with crazy input.