The usage of Sally is covered in a classic manual page (man page), including command line options, configuration files and different operation modes.
Sally is developed in plain C. Sally's functionality for embedding of strings is organized in different modules that are documented using Doxygen annotation. Though Sally should not be directly used as a library, this reference might help integrating Sally with other software frameworks.
Sally programming reference
Reference for modules and data structures
(Automatically generated from source code)
The following technical articles detail the background of the embeddeding implemented in Sally, starting with the design and extraction of string features and reaching over to computation of distance and kernel functions for strings
Sally: A Tool for Embedding Strings in Vector Spaces
Konrad Rieck, Christian Wressnegger, and Alexander Bikadorov.
Journal of Machine Learning Research (JMLR), 13(Nov):3247−3251, 2012.
Linear-time Computation of Similarity Measures for Sequential
Konrad Rieck and Pavel Laskov.
Journal of Machine Learning Research (JMLR), 9(Jan):23-48, 2008.