HOME >> PROJECTS >> THESAGE >> DOCUMENTATION

TheSage Documentation

Requirements, Features, Limitations, Examples

Documentation is always a work in progress as it is only useful when users are actually able to benefit from it.

We have compiled here the information that we believe will be of practical use. If you find that something merits being included or discussed in more detail, contact us at info@sequencepublishing.com and we will adjust this document as appropriate.



Table of Contents





TheSage's English Dictionary and Thesaurus





1. What is TheSage? [^]

It is best to think of TheSage as two different systems: a knowledge database and a multi-tool interface. The knowledge database consists of a tightly integrated English dictionary and thesaurus.

The dictionary comprises:

The thesaurus encompasses:

The interface allows the user to extract information from the knowledge database in a variety of ways. This is accomplished by means of the following tools:





2. System requirements [^]

TheSage has been developed for the Windows XP/Vista operating systems (versions 1.4.0 and below work in 95/98/ME/2K/XP/Vista). It is highly recommended that the latest OS Service Packs be installed.

TheSage requires downloading a setup file. Once installed, TheSage occupies approximately 20 MB of hard drive space. TheSage has no special RAM requirements and has a small memory footprint.



2.1 Installing [^]

To install TheSage, execute the setup program and follow the instructions.

Once installed, TheSage come be invoked from the Start menu, the Desktop, the Quick Launch toolbar, or directly from its folder in the hard drive.



2.1.1 Portable install [^]

The installer comes with a 'portable' option.

TheSage is 100% portable in and of itself. If the contents of the folder where TheSage is installed are copied elsewhere (a USB flash drive, for example), TheSage will work perfectly. In other words, the 'portable' option in the installer is simply provided for convenience.



2.1.2 Silent install[^]

Run the setup package from the command line with the parameter /S (case sensitive).



2.2 Uninstalling [^]

TheSage's package includes an uninstaller utility (except in 'portable' installations). To run the uninstaller, navigate through the Start Menu or through the hard drive to TheSage's folder. Alternatively, it is possible to uninstall TheSage through the 'Add/Remove Programs' applet located in the 'Control Panel'.



3. GUI elements [^]

The screenshot above shows the main elements of the GUI. These are:





4. Tools [^]

The interface provides a number of tools with which to access TheSage's knowledge database. All tools can be customized in terms of functionality and appearance via the Options dialog.



4.1 Lookup tool [^]

To look up a word or words in TheSage's Dictionary, type it (or them) and then click on 'LookUp'. Alternatively, type the word(s) and then simply press ENTER.

For example: chatoyant, african violet, prestigious, natural law, atomic weight, titanosaur, legerdemain,...

If the word is found in TheSage's index, a new tab will be created displaying this word's information as shown in the screenshot above.

The screenshot shows what is called a lookup tree. The color buttons on each node of the tree (numbers 2, 3, 4, 5 in the screenshot) are referred to as semaphores and serve to toggle (open/close) the display of nested levels.

The information in the tree is conceptually divided into lemma, part(s) of speech, definition(s), thesaurus, and example(s).

Note that any and all displayed words are clickable, automatically triggering a new lookup.

To the right of the lookup tree is the lookup toolbar (number 8 in the screenshot above) which provides quick access to several useful tasks. From top to bottom, these are:





4.2 Wildcard tool [^]

To select index entries that match a wildcard pattern, type the pattern and click on 'Wildcard'. Alternatively, type '=' followed by the pattern and press ENTER.

Patterns are formed by a combination of letters/numbers and special characters (see below).

Special characters:

Examples:

Note that any and all displayed words are clickable, automatically triggering a lookup.



4.3 Anagram tool [^]

To find all anagrams of a character string in TheSage's index, type the string and click on 'Anagram'. Alternatively, type '>' followed by the string and press ENTER.

The anagram search returns only those words that are found in the dictionary, that is, valid English words. It does not return all possible anagrams of a character string and, importantly, will not return inflected forms if these are not in TheSage's index.

Examples:

Note, once more, that any and all displayed words are clickable, automatically triggering a lookup.



4.4 Definition and example concordancer tools [^]

Located in 'Other searches' in the navigation control (number 12 in the screenshot above), the concordancer allows a user to search for words inside definitions and examples. It is sometimes referred to as 'reverse search'.

The definition concordancer searches through definitions and the example concordancer through examples. Each tool can be configured independently in terms of functionality via the 'Concordancer tool' pages in the Options dialog. The appearance of results for both concordancers can also be customized.

To use the definition concordancer, type a word (a single word) in the lookup search box and press the button labeled 'Definitions' (alternatively, type '!' followed by the word and press ENTER). TheSage will parse the definitions in the database and return those that contain the word.

To use the example concordancer, type a word (a single word) in the lookup search box and press the button labeled 'Examples' (alternatively, type '!!' followed by the word and press ENTER). TheSage will parse the examples in the database and return those that contain the word.

Note that the result tabs for the concordancer use tooltips to display information not visible otherwise.



4.5 Semantic similarity tool [^]

Located in 'Other searches' in the navigation control (number 12 in the screenshot above), the semantic similarity tool allows a user to create a custom thesaurus. In the 'Semantic similarity tool' pages in the Options dialog, a user can decide what semantic relationships constitute 'similarity' and how these relationships rank.

To use the similarity tool, type a word (or words) in the lookup search box and press the button labeled 'Similarity'. Alternatively, type '^' followed by the word (or words) and press ENTER.



4.6 Online search tool [^]

Located in 'Other searches' in the navigation control (number 12 in the screenshot above), the online search tool allows a user to search online directly from TheSage. The built-in online searches include dictionary lookups (i.e. Google define), encyclopedia (i.e. Wikipedia), translation (i.e. Google translate), etc. Note that new web addresses can be added via the 'Online search tool' page in the Options dialog.

To use the online search tool, type a word (or words) in the lookup search box and press the button labeled 'Go online' (alternatively, type '@' followed by the word and press ENTER). If the internal browser is used, a new results tab will be created that will include one sub-tab per web adress. If the default (external) browser is used, TheSage will send the web adresses to that browser.



4.7 'As you type' tool [^]

Located in the navigation control (number 12 in the screenshot above), the 'As you type' tool allows a user to conduct a real-time search of TheSage's database index.

To use the 'As you type' tool, start typing in the entry field. A list of entries in the index will be displayed below the entry field. Note that a double-click on any of these entries will trigger a lookup.



4.8 Phonetic wildcard search tool [^]

Located in the navigation control (number 12 in the screenshot above), the phonetic wildcard search tool allows a user to carry out relatively sophisticated pattern searches of the phonetic transcriptions (pronunciations) in the database. Results are displayed by means of tabs, each label showing the pattern employed. Tabs can be closed by middle-clicking on them. A right-click on any of the tabs brings up a context menu. Note that clicking on any of the orthographic/phonetic pairs in the result lists triggers a look up.

TheSage makes use of previously installed fonts that contain phonetic gliphs. In particular, any one of the following fonts needs to be installed in the target system for TheSage to display phonetic fonts: Arial Unicode MS, Lucida Sans Unicode, Charis SIL, Doulos SIL, Thryomanes, Gentium. The first two are generally installed on all Windows OS's. However, if it is the case that none of these fonts are found, TheSage will report the problem and use SAMPA (see table below) for display.

All six fonts can be downloaded, free of charge, from the internet. They are free for personal use but may be restricted for other uses (check the respective licenses if relevant). For your convenience, we supply here links to their location but, be warned, the fleeting nature of internet content may render these links obsolete at any moment:

Assuming several fonts are installed, switching between fonts can be done at any moment via the 'Phonetic search tool' pages in the Options dialog.

TheSage uses the SAMPA encoding to input phonetic characters. That is, typing 'E' will display 'ɛ', typing '@' will display 'ə', typing '{' will display 'æ', typing 'D' will display 'ð', typing 'N' will display 'ŋ', and so on.

Note that a right-click in the entry field of the phonetic wildcard search combobox will invoke a context menu listing all SAMPA characters for ease of reference and insertion.

The following table has been adapted from the SAMPA page.


Keyboard Symbol Description
Vowels
A ɑ Open back unrounded, Cardinal 5, Eng. start
{ æ Near-open front unrounded, Eng. trap
6 ɐ Open schwa, Ger. besser
Q ɒ Open back rounded, Eng. lot
E ɛ Open-mid front unrounded, C3, Fr. même
@ ə Schwa, Eng. banana
3 ɜ Long mid central, Eng. nurse
I ɪ Lax close front unrounded, Eng. kit
O ɔ Open-mid back rounded, Eng. thought
2 ø Close-mid front rounded, Fr. deux
9 œ Open-mid front rounded, Fr. neuf
& ɶ Open front rounded
U ʊ Lax close back rounded, Eng. foot
} ʉ Close central rounded, Swedish sju
V ʌ Open-mid back unrounded, Eng. strut
Y ʏ Lax [y], Ger. hübsch
Consonants
B β Voiced bilabial fricative, Sp. cabo
C ç Voiceless palatal fricative, Ger. ich
D ð Voiced dental fricative, Eng. then
G ɣ Voiced velar fricative, Sp. fuego
L ʎ Palatal lateral, It. famiglia
J ɲ Palatal nasal, Sp. año
N ŋ Velar nasal, Eng. thing
R ʁ Voiced uvular fricative or trill, Fr. roi
S ʃ Voiceless palatoalveolar fricative, Eng. ship
T θ Voiceless dental fricative, Eng. thin
H ɥ Labial-palatal semivowel, Fr. huit
Z ʒ Voiced palatoalveolar fricative, Eng. measure
Length and Stress
: ː Length mark
" ˈ Primary stress
% ˌ Secondary stress


TheSage checks for the validity of a pattern before conducting a search and, therefore, understanding the syntax of patterns is important. Wilcard patterns can contain the following elements in addition to phonetic characters/symbols:



 Wildcard  Description
* Matches any number of (including none) consonant and vowel symbols.
? Matches any one consonant or vowel symbol.
$ Matches any one consonant symbol.
# Matches any one vowel symbol.
(ŋɛθə) Matches any one of the symbols inside the parenthesis ('ŋ', 'ɛ', 'θ', or 'ə', in this example).
(?-ŋɛθə) Matches any one consonant or vowel symbol except those listed ('ŋ', 'ɛ', 'θ', or 'ə', in this example).
($-ŋθ) Matches any one consonant symbol except those listed ('ŋ' and 'θ', in this example).
(#-ɛə) Matches any one vowel symbol except those listed ('ɛ' and 'ə', in this example).


The following examples are meant to illustrate phonetic search patterns.


 Pattern  Description  Would match words like
p#t Words that start with /p/, end with /t/, and have any single vowel as nucleus. pet, pot, put.
$?#?# Words of exactly five phonemes in length that start with a consonant, followed by a vowel or consonant, followed by a vowel, followed by a vowel or consonant, and end in a vowel. baby, beauty, daily.
*##$##* Words that contain the sequence vowel-vowel-consonant-vowel-vowel in any position. association, daylight, highway.
(pb)(#-əæ)(td) Words that start with either /p/ or /b/, end with either /t/ or /d/, and have any single vowel as nucleus except /ə/ and /æ/. beat, bed, put.
*ðo* Words that contain the sequence /ðo/ in any position. although, though, without.
ə*ə(rn) Words that start with schwa and end with schwa plus either /r/ or /n/. admirer, opinion, unsure.
str*(tθ) Words that start with 'str' and end with either /t/ or /θ/. straight, street, strict.
ɔ(lf)*(?-r) Words of at least three phonemes in length that start with /ɔ/ followed by either /l/ or /f/ and end with any single consonant or vowel except /r/. almost, already, office.
*dʒ##* Words that contain, in any position, the sequence /dʒ/ followed by at least two vowels. joint, joke, rejoice.
(?-pbtdkg)#(ɪʊ)*(nl)d Words of at least five phonemes in length that start with any vowel or consonant except /p/, /b/, /t/, /d/, /k/, and /g/ followed by any vowel pairing with either /ɪ/ or /ʊ/, and end in either /n/ or /l/ pairing with a /d/. find, hold, round.

Note that a double-click on any of the results will trigger a lookup.



4.9 'Not found' tool [^]

When a word is not found in TheSage (that is, when a lookup fails), a tab is created in the results tab control (number 7 in the screenshot above) that displays two lists. The first list contains those words in TheSage's index that sound like or are spelled similarly to the one looked for. The second list shows TheSage's index in ascending alphabetical order highlighting the entry that matches the most amount of leading characters. In short, the 'Not found' tool acts as a spell-/sounds-like assistant.

The online search is also available in each of these tabs.



4.10 Speech tool [^]

If the Speech SDK is installed in the user's computer, TheSage can be turned into a speaking dictionary. Click on the phonetic transcription of a lookup to hear how it is pronounced.

Moreover, TheSage can read part or all of a lookup by clicking on the speech icon in lookup toolbar (number 8 in the screenshot above). How much or little is read is determined in the 'Speech tool' page in the Options dialog.

The quality of the voice depends on preinstalled software. If the default voices that come with Windows are not sufficient, there exist third-party solutions. The 'Speech tool' page in the Options dialog allows the user to select which voice to use.



4.11 History tool [^]

TheSage keeps a record of all words (lookups and other searches) carried out.

There are two ways to access the History. First, there is a dropdown list in the Lookup search box (number 1 in the screenshot above) that shows only those entries typed in the current session. Second, there is a panel in the Navigation control (number 12 in the screenshot above) that shows all entries from the current as well as previous sessions.

There are a number of ways to customized the History tool. See the 'History tool' pages in the Options dialog.



5. Other features [^]

In order to make TheSage as useful as possible, supplementary features have been implemented.



5.1 Word capture [^]

TheSage can look up words directly from almost any program (IE, Word, Firefox, Outlook, Thunderbird,...). This is accomplished by selecting the relevant word in the appropriate program and by pressing TheSage's Hotkey. TheSage will then read the word and carry out a lookup automatically.

A hotkey is a combination of keys that, when pressed simultaneoulsy, invoke a program. By default, TheSage's hotkey is CTRL + SHIFT + A but users can chose whatever combination they prefer via the 'Word capture' page in the Options dialog.



5.2 System tray menu [^]

A right-click on TheSage's icon in the System Tray brings up a menu where the following are available (top to bottom):





5.3 Color choosers [^]

In order to ease the selection of custom colors, the 'color chooser' controls (in the Options dialog) implement a magnifier-dropper (RIGHT-CLICK or SHIFT+P) in addition to the standard Windows color dialog (LEFT-CLICK or SHIFT+C). The magnifier moves with the mouse cursor (or arrow keys) and can be used to inspect any location on the screen. The magnifier zooms in/out by pressing SHIFT + UP/DOWN, and increases/decreases its size by pressing SHIFT + RIGHT/LEFT. Once over the desired pixel, LEFT click on it to select its color or RIGHT click to dismiss the magnifier.

Moreover, the color chooser controls remember the default color (shown in a small top-right window) which can be restored by a middle-click (or SHIFT+R) on the control.



6. Updates and contact [^]

TheSage's project has accomplished all its core objectives and the package is fully functional. Nonetheless, updates will be made available as the project evolves.

Visit www.SequencePublishing.com for up-to-date information regarding TheSage.



Feedback Form

We value feedback. Feel free to contact us via the form below (or send us an email if you prefer) with comments and questions.


Name:
Email:
Comments:




7. License agreement [^]

READ CAREFULLY ALL TERMS OF THIS LICENSE AGREEMENT BEFORE USING THIS SOFTWARE

IF YOU DO NOT AGREE WITH THE TERMS OF THIS LICENSE AGREEMENT, YOU ARE NOT AUTHORIZED TO USE THIS SOFTWARE

Permission is hereby granted to anyone to freely use TheSage's English Dictionary and Thesaurus (hereafter referred to as 'this software') for any purpose with the exception of including it in a product, in which case both permission and acknowledgment are required.

To the maximum extent permitted by applicable law, the software and documentation are provided "as is". Franc Morales and Leah Gilner (hereafter referred to as 'the authors') disclaim all other warranties and conditions, either express or implied, including, but not limited to, implied warranties of merchantability, fitness for a particular purpose, conformance with description, title and non-infringement of third party rights. In no event shall the authors be liable for any indirect, incidental, consequential, special or exemplary damages or lost profits whatsoever (including, without limitation, damages for loss of business profits, business interruption, loss of business information, or any other pecuniary loss) arising out of the use or inability to use the software product, even if the authors have been advised of the possibility of such damages.

The authors allow you to distribute this software if all of the following conditions are met: you are not charging any money for it, the distribution files are kept together and unmodified, and the authors' permission is obtained before distribution. You may give this software package to friends or colleagues, burn it onto cd-rom's (or other media) and upload it to free/shareware sites as long as the original package remains unmodified.

All rights to this software (including any images or text incorporated into this software) are owned by the authors.

You may not disassemble or reverse engineer any part of this software.

You may not rent or lease this software.

The authors are not required to make available technical support for this software. The authors may, from time to time, revise or update this software. In so doing, the authors incur no obligation to furnish such revision or updates to you.

This license agreement will immediately and automatically terminate without notice if you fail to comply with any one of the terms and conditions cited. Upon termination of this license agreement, you agree to promptly remove the software from your system.

TheSage's English Dictionary and Thesaurus. Copyright 2001-2009 by Franc Morales and Leah Gilner. All rights reserved.

WordNet 2.0 Copyright 2003 by Princeton University. All rights reserved.
WordNet 3.0 Copyright 2006 by Princeton University.

THIS SOFTWARE AND DATABASE IS PROVIDED "AS IS" AND PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES OF MERCHANT-ABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF THE LICENSED SOFTWARE, DATABASE OR DOCUMENTATION WILL NOT INFRINGE ANY THIRD PARTY PATENTS, COPYRIGHTS, TRADEMARKS OR OTHER RIGHTS.