This is a script for a workshop on using Voyant for the CWRC community.
1.0 Introduction
- The workshop leaders will introduce themselves:
- Stéfan Sinclair, McGill University
- Susan Brown, University of Guelph
- Overview
Voyant is currently a beta release by Stéfan Sinclair and Geoffrey Rockwell. It was previously called “Voyeur” so do not be confused if that name is used. Voyant is the next generation in a series of text analysis tools that include HyperPo and TAPoRware. It provides tables and graphs related to word use across a single document or a collection. Voyant adds, among other things, the ability to handle much larger files than the previous tools could. - Outline
In this workshop we will:- First, look at how to use a single Voyant tool, Cirrus, with a small corpus of Austen texts.
- Then learn how to use the normal “skin” (multi-tool interface) of Voyant with a single text.
- Finally, show how to load your own text into Voyant.
- Now make sure you can connect to the wireless.
- Help
If you need help, connect to Hermeneuti.ca and explore the resources there. Here are some useful links:- The CWRC workshop outline is at – http://cwrc.cs.ualberta.ca/index.php/General:October_2011_Toronto_workshop
- This workshop outline is at – http://hermeneuti.ca/workshops/cwrc2
- Voyant/Voyeur Tools Introduction – http://cwrc.cs.ualberta.ca/index.php/General:Voyeur
- Quick Guide to Voyant/Voyeur – http://hermeneuti.ca/voyeur/users
- Voyant Tools – http://voyeurtools.org (backup version – http://beta.voyant-tools.org)
- Individual Voyant tool descriptions and links – http://hermeneuti.ca/voyeur/tools
2.0 Using a single Voyant Tool: Cirrus
Voyant Tools has a number of different tools that can be composed into skins or used individually. We will start with just one tool called Cirrus that can then spawn other tools. We will try it with Jane Austen’s Persuasion.
Cirrus (Austen’s Persuasion): http://voyeurtools.org/tool/Cirrus/?corpus=JaneAusten&docIndex=5&stopList=stop.en.taporware.txt&toolFlow=simple (backup)
The Cirrus tool shows you a word cloud of high frequency words. Some questions to ask yourself:
- What words did you expect? What words are missing? What words are interesting?
- How does the tool arrange words and choose colours? Is there any correspondence between size and frequency?
Here are some more Cirrus visualizations to consider:
These types of word clouds are prevalent from academia to advertising – they quickly provide an intriguing representation of a text, as demonstrated by this example of studying gendered languages in toy advertising. But they’re ability to rapidly convey a picture with words comes at the cost of information reduction, and some are highly critical of word clouds as hermeneutical tools. What do you think?
Try It: Try clicking on a word. It will launch a second tab or window with a list of the texts in the corpus with the frequency of the word you clicked on.
Try It: Now try double-clicking on one of the texts. This should launch another tab or window with a Key Word In Context (KWIC) of the word in that text.
3.0 Using a Reading Skin
Voyant Tools can also be composed into “skins” that combine tools as panels so that they can be used interactively. Here is the same Austen corpus in a simple skin:
http://voyeurtools.org/?corpus=JaneAusten&stopList=stop.en.taporware.txt (backup)
In this skin clicking in one window will often (but not always) update other windows. Try the following:
- Triggering: Click on words in the Cirrus word cloud. Then click on a text in the Word Trends and play with the KWIC.
- Changing Settings: Try changing the settings for the Cirrus by clicking on the small gear icon. Try playing with the Word Trends
- Showing and Hiding Panels: Try showing and hiding panels using the small up and down arrows in the upper-right of the panels.
When in doubt just restart the session by hitting refresh.
4.0 Using Voyant on You Own Text
Voyant Tools can be used on your own text or corpus. To do that you go to the simple URL for the tool:
Voyant: http://voyeurtools.org
Just the Cirrus tool in Voyant: http://voyeurtools.org/tool/Cirrus/
Backup version: http://beta.voyant-tools.org/
You will get panel that asks you for a text. You can provide:
- One or more URLs to texts on the web
- Upload a text or a zipped collection of texts
- Upload plain text, HTML, or XML texts
- Upload a PDF (and Voyant will try to extract the text)
Voyant is forgiving, but there are none-the-less bugs.
Note that you can create a persistent URL for your corpus – that way your link can be shared or bookmarked and you won’t need to reload the texts into Voyant. Click the save icon in the blue bar at the top and the first URL will be the link for your Voyant corpus.
5.0 Other Stuff
- Other Voyant Tools:
- Austen in Bubblelines
- Persuasion in Bubbles
- Persuasion in Knots
- Austen in the Tool Browser
- Other Voyant Skins:
- Austen in the original Voyeur skin (more analysis, less reading)
- Austen in the Scatter Skin for correspondence analysis (which sounds worse than it is:)
- Austen in Desktop Skin
- Other Tools