SMART

Tutorial for beginners.

Written & maintained by Hans Paijmans

paai@kub.nl

Work in progress! Not finished! Use 'reload' often!

Last changes: first week august 1996.

Please start with reading the instructions in the frame below!

Alternatively download the complete tutorial as hands.tgz.



Introduction

The goal of this part of the course is to give enough information, that the student can conduct experiments with the SMART information Retrieval system.
During this session we will demonstrate the use of the system by annotated screendumps rather than by allowing direct contact with the system. There are a number of reasons for this.

  1. SMART is on our university only installed on my small Intel PC with a 80486/66 processor and 16 Megabyte of RAM. Such a machine obviously cannot pull fifty users at the same time.
  2. The user interface of SMART is not really friendly for the beginner. Also, much of its functionality is closely tied up with the powerful Unix operating system, whereas most users are used to the pink and cozy, but heavily mutilated MS-Windows environment.
  3. The documentation of SMART is almost non-existent and very unaccessible.

Please note that this tutorial is by no means complete and may contain various errors!

Organization of the course

Selftest

Occasionally a question will appear in this tutorial. Test how, eh..., Smart you are by trying to answer those questions without immediately sneaking a look at the solution. Try one now!

Does Smart run on a PC?



The creation of a very simple Smart database.

The spec-files

Making the indices



Simple Retrieval with Smart

The interactive mode

Formatting



Advanced actions

So far we only had the 'inv.nnn' and 'inv.nnn.var' files that stored the keywords together with the term frequency. This is about what the 'normal' IR systems of today do and if this was all, we would not need to bother with the quirks of Smart documentation.

Field control

Advanced keyword weighting

Using the atc-weights

Other advanced commands



Smart, Unix and HTML

SMART is very much an Unix program in that it accepts input from stdin and writes to stdout. This makes it possible to use the program for complicated or specialized needs in a way that would have been impossible in an MS-Windows environment. Indeed all the examples that originally come with Smart are written in the Unix cshell script language, which is an added hurdle for non-Unix adepts.

Unix Lives!

Smart and HTML



Other programs

For anyone interested in the analysis of text and documents in the Unix environment it should be useful to realize that there exist quite an number of programs that can be used together with Smart or that in a different way analyse texts. I collected a spate of them (that run under Linux) as Paai's Text Utilities



Where to get it & other pointers

SMART itself is available as C-source ftp://ftp.cs.cornell.edu/pub/smart/smart.11.0.tar.z . It was written for Unix on a Sun station, but after some minor tweaking it compiles and runs beautifully on a common Intel-PC (if you use the freeware Unix-clone Linux in stead of the ubiquitous MS-DOS or MS-Windows). The Linux binaries can be downloaded from ftp:pi0959.kub.nl/pub/smart/smart_linux.tgz, but please refer to the original sources for legal limitations.

There exists a mailinglist: 'smart-people@CS.Cornell.EDU' for problems, but don't expect to be helped with newbee-questions. For a HTML-version of the extant documentation see my WWW-site( the use of Smart). Some links from this page will also refer to these original files.