This post is hopefully the first of many more to come demonstrating my efforts to port as many advanced projects written in C/C++ to WASI as possible. What is WASI? WASI is a system interface to run WebAssembly outside the web. If you would like to learn more, I cannot recommend enough the fantastic blog post by Lin Clark Standardising WASI. You should definitely go and check it out!
We’ll start our journey with a fantastic, open-source text-to-speech program called flite. Without further ado, let’s dig in!
Firstly, make sure you’ve got all the right tools for the job. In this case, I mean
the WASI SDK which you can get from CraneStation/wasi-sdk.
The SDK features clang-8
which is capable of targetting wasm32-unknown-wasi
triple plus
it also features the WASI sysroot. So head over to the website, and get the SDK.
When porting any C program to WASI, there are a few things to watch out for. Firstly,
WASI currently doesn’t support setjmp
nor sockets so the source code will need to modified
accordingly (see here
for missing functionality in WASI). In case of flite
, it’s somewhat easier
as there are two macros DIE_ON_ERROR
and CST_NO_SOCKETS
which pretty much handle most it for us.
Secondly, remember to correctly set the paths to the compiler, llvm-ar
, llvm-ranlib
, etc. That is, don’t trust
configure
to get it right (since WASI is still experimental, the tools won’t be included
as OS packages any time soon). Otherwise, you are almost guaranteed to experience missing symbols
during linking or other weird errors.
Finally, if the project is using configure
, you’ll need to take extra care at including WASI as a
supported host. A brilliant, short-and-simple version
suggested by Frank Denis
is using grep
and sed
:
$ grep -q -F -- '-wasi' config.sub || sed -i -e 's/-nacl\*)/-nacl*|-wasi)/' config.sub
I’ve modified the source of flite
to make it easier to port it to WASI. The source code can be cloned
from kubkon/flite. Furthermore, if you’re interested in the changes
I’ve had to introduce to the original project, simply compare the fork against the original in Github.
Thus, let’s clone it
$ git clone https://github.com/kubkon/flite
At this point, you need to have the WASI SDK installed. If you don’t have it yet, head over to
CraneStation/wasi-sdk and install it. For the rest
of this blog post, I’ll assume that you have the SDK installed in /opt/wasi-sdk
.
Let’s spin configure
and make
then!
$ ./configure --host=wasm32-unknown-wasi CC="/opt/wasi-sdk/bin/clang --sysroot=/opt/wasi-sdk/share/sysroot" AR=/opt/wasi-sdk/bin/llvm-ar RANLIB=/opt/wasi-sdk/bin/llvm-ranlib
$ make
If everything went well, you should now have bin/flite
program compiled to WASI, and you should be able
to run it in any WASI compatible runtime such as CraneStation/wasmtime:
$ wasmtime bin/flite -- --help
flite: a small simple speech synthesizer
Carnegie Mellon University, Copyright (c) 1999-2016, all rights reserved
version: flite-2.2-current Sep 2018 (http://cmuflite.org)
usage: flite TEXT/FILE [WAVEFILE]
Converts text in TEXTFILE to a waveform in WAVEFILE
If text contains a space the it is treated as a literal
textstring and spoken, and not as a file name
if WAVEFILE is unspecified or "play" the result is
played on the current systems audio device. If WAVEFILE
is "none" the waveform is discarded (good for benchmarking)
Other options must appear before these options
--version Output flite version number
--help Output usage string
-o WAVEFILE Explicitly set output filename
-f TEXTFILE Explicitly set input filename
-t TEXT Explicitly set input textstring
-p PHONES Explicitly set input textstring and synthesize as phones
--set F=V Set feature (guesses type)
-s F=V Set feature (guesses type)
--seti F=V Set int feature
--setf F=V Set float feature
--sets F=V Set string feature
-ssml Read input text/file in ssml mode
-b Benchmark mode
-l Loop endlessly
-voice NAME Use voice NAME (NAME can be pathname/url to flitevox file)
-voicedir NAME Directory containing (clunit) voice data
-lv List voices available
-add_lex FILENAME add lex addenda from FILENAME
-pw Print words
-ps Print segments
-psdur Print segments and their durations (end-time)
-pr RelName Print relation RelName
-voicedump FILENAME Dump selected (cg) voice to FILENAME
-v Verbose mode
$ wasmtime --dir=. bin/flite -- test.txt test.wav
$ file test.wav
test.wav: RIFF (little-endian) data, WAVE audio, Microsoft PCM, 16 bit, mono 8000 Hz
I hope this was useful for you! If you have any questions, comments or suggestions, feel free to drop me a line!