Download the ICE-GB R1 Sample Corpus
This Sample is from Release 1 of ICE-GB and is supplied with Version 3.0 of ICECUP.
Note that this package has been superseded by ICECUP 3.1 and ICE-GB Release 2. The new download package is available here.
Questions:
Whats in the package? | What
do I need to run ICECUP? | What can ICECUP
do? | Will ICECUP continue to be developed?
| Feedback | Download now
Other Frequently Asked Questions (including solutions to many download problems)
The ICE-GB Sample Corpus is available for download NOW.
It comes complete with 10 texts selected by Gerry Nelson from the ICE-GB Corpus, and the state-of-the-art ICECUP III software written by Sean Wallis.
The Sample Corpus comes in two flavours:
- The Minimum sampler without Help (for easy downloading).
- The Complete sampler, with the main Help file and Getting Started tutorial.
WHAT IS IN THE SAMPLE CORPUS PACKAGE?
- Ten texts (over 20,000 words), fully parsed and annotated, exactly as they are in ICE-GB.
- The latest release of ICECUP III. This is a full working version of the software (see below).
- Example Fuzzy Tree Fragments.
- Option: Full help files (around 3Mb extra).
The sample contains the following ten texts, shown in the last column. You can view these texts and their classification when you download and install the software. The complete ICE structure is visible from ICECUPs Corpus Map.
Spoken Texts (300) | Dialogues (180) | Private (100) | face-to-face conversations (90) |
S1A-010 S1A-094 |
Public (80) | classroom lessons (20) |
|||
Monologues (100) | Unscripted (70) | spontaneous commentaries (20) |
S2A-011 | |
Scripted (30) | broadcast talks (20) non-broadcast speeches (10) |
S2B-026 | ||
Mixed (20) | broadcast news (20) |
S2B-002 | ||
Written Texts (200) | Non-printed (50) | Non-professional writing (20) | untimed student essays (10) student examination scripts (10) |
W1A-001 |
Correspondence (30) | social letters (15) business letters (15) |
W1B-001 | ||
Printed (150) | Academic writing (40) | humanities (10) social sciences (10) natural sciences (10) technology (10) |
W2A-005 | |
Non-academic writing (40) | humanities (10) social sciences (10) natural sciences (10) technology (10) |
|||
Reportage (20) | press news reports (20) | W2C-009 | ||
Instructional writing (20) | administrative / regulatory (10) skills / hobbies (10) |
W2D-018 | ||
Persuasive writing (10) | press editorials (10) | |||
Creative writing (20) | novels / stories (20) |
WHAT IS NOT INCLUDED?
Release 1.0 of ICE-GB is supplied on CD-ROM. ICE-GB contains five hundred texts of spoken and written contemporary British English. To obtain the other 490 texts, you must order the CD-ROM. If you want to do this, click here.
WHAT DO I NEED TO RUN ICECUP III?
ICECUP runs on PCs under Windows 3.1 and above. It has been tested exhaustively on 3.1, 95 and 98. Owing to the nature of the program, we recommend a fast processor and a fast hard disk, although these are not essential. ICECUP will run on any stand-alone or networked PC from a 386 running Windows 3.1 with 8MB upwards.
Sampler system requirements There are two install packages, with hard disk capacity requirements as follows:
We have tested the software extensively on platforms that we have access to, and we have witnessed it working OK on others. The most up-to-date list is shown below.
Feedback: Please tell us if you try to run ICECUP on platforms other than PCs running Windows 95/98/ME or Windows 3.1. We want to know about your software problems, to help you and other end users. Since we are supplying software free, and as is (see the licence agreement) with a Sample Corpus taken from ICE-GB, we think that it is only fair that you give us some feedback. If ICECUP runs brilliantly on Windows XP or crashes dismally on a Mac, tell us. Email us at the Survey (s.wallis@ucl.ac.uk) so that (a) we can try to solve any outstanding problems, and (b) tell others about them. System requirements for the ICE-GB corpus (CD-ROM) These differ from the above only in terms of hard disk space. You need 83Mb to install the entire corpus. Note that you can also run searches off the CD without installing anything. The software is identical to that supplied with the sample corpus. You can therefore try before you buy. If in doubt, install the sample corpus and software before ordering the CD. |
WHAT CAN ICECUP DO?
ICECUP is a corpus exploration tool designed for syntactically parsed corpora. It allows you to experiment with, and explore the corpus. ICECUP has a number of facilities to enable you to do this.
- The corpus map depicts the structure of the corpus and its texts from the top down.
- The text browser allows you to browse the text and the results of queries, reveal and hide annotation, and perform concordancing.
- The tree viewer allows you to see the full parse analysis of the corpus.
But thats only the beginning...
There are a number of sophisticated query systems for searching the corpus, including
- Markup queries
- Exact and inexact grammatical node queries
- Text fragment queries
- Fuzzy Tree Fragment queries
- Sociolinguistic variable queries
- Random sampling
These queries may be combined using Drag and Drop logic.
WILL ICECUP CONTINUE TO BE DEVELOPED?
Yes. ICECUP is being developed under the auspices of the ESRC Corpus Queries project. We will continue to develop ICECUP at least until the end of January 1999 and new versions of ICECUP, with the sampler, will be freely available from this site. We suggest that you bookmark this page and watch this space.
This means that if you buy the ICE-GB CD-ROM now, you will be able to upgrade to later versions of ICECUP at cost price. We will email users of the CD-ROM to let them know when a new version is available.
Version 3.0 of ICECUP is available to download from this section of the website. The new ICECUP 3.1 is available in a beta form from here.
FEEDBACK
As mentioned above, we would like feedback on technical problems and successes encountered running ICECUP on platforms that we have not been able to test ourselves.
PREPARE TO DOWNLOAD!
Download ICECUP 3.0 and the ICE-GB Sample Corpus by clicking here.
This page last modified 14 May, 2020 by Survey Web Administrator.