Imaging in Bulk for the Internet

Michael Greenhalgh
Professor of Art History
Australian National University,
Canberra ACT 2600
Email: Michael.Greenhalgh@anu.edu.au
URL: http://rubens.anu.edu.au

1 Introduction: History and problems of the site

I began preparing images at video resolution (768x525) in the autumn of 1993, and my server ( http://rubens.anu.edu.au) went live on 4 January 1994. From the beginning I have been convinced that what the Web lacks is high-quality, authoritatively documented content. It was clear from the beginning that the attractiveness of Web browsers was a poisoned chalice and that most presentations were superficial and of no use for anything serious to bear comparison with a book or a scholarly paper.

I have not changed my view over the intervening two-and-a-half years, because the growth of the Web has not changed its basic complexion. The Web now has some excellent sites, but the majority are trivial, poorly maintained, often self-indulgent and low in nourishment value. I have aimed to make ArtServe interesting both to the student of Art History, and to the general viewer interested in art and architecture in the Mediterranean.

ArtServe is accessed from all over the world some 36,000 times per day on average (that is, well over 3 gigabytes of material per week). Access statistics are available at http://rubens/current.html .

The server has some notable deficiencies, many of which are to do with a dearth of staff to help in its upkeep, others of which have to do with the nature of Web servers. Staffing problems underline a basic difficulty with serious servers - namely that, the bigger they get, the more complicated they get, in a progression which is far from linear. This problem is highlighted by the manifest deficiencies of current server technology. Briefly, any organisation (and its concomitant, searching) must be arranged using external programs: it is not imposed - as it should be - by the server. Maintenance of large sites - and Artserve currently holds some 19,000 images - therefore becomes more difficult than it need be and users do not get the full benefit from them because they cannot necessarily find what they seek.

'Content' for an Art Historian means images, mapped at the very least by a datafile, and preferably with a searchable forms interface to make life easier for the user. This paper discusses the equipment used to take the server from the initial 2,800 images to the current 19,000 - well over five gigabytes of data - and the programs used to regiment the images to make them smoothly accessible to the server. It is a straightforward account, with illustrations to pretty things up a bit, and an Appendix which details the various programs written to process the images and make them available in a couth fashion to the server.

1.1 The Processes Involved

To be able to display digital images in bulk, the following processes are necessary:

  1. Digitising the 'real world' photograph, object, slide or negative;
  2. Conversion to suitable formats (JPEG and GIF);
  3. Sizing the images appropriately;
  4. Organising the images into directories; and
  5. Grouping the images into HTML pages, or suites of pages;

There are various possibilities along the way. The laserdisk provides a convenient half-way house, storing analog frames at video resolution, which the Sony PHV-A7E Photo Video camera presents to it as a video or S-VHS stream for capture on disk. The DEC Alpha computer has a digital capture card in it, but direct presentation of the frames to it would be very tedious. Using the two devices described here, a 'hit' rate of well over 200 images per hour can be attained, and these are then digitised in batch (into JPEG images) whilst the operator does something else.

Direct digital devices such as the Nikon E2 digital camera and the Nikon scanner present opportunities in that they offer four times video resolution (in the case of the camera) and much higher resolutions in the case of the scanner. Such resolutions present problems at the same time because the speed of the Web, and of people's connections, means that such large images cannot conveniently be viewed without custom-built software, called zoom, which we are currently developing. As for speed, the Nikon E2 (in my cheaper configuration) manages an image every couple of seconds (a sports variety, with more memory, can do continuous frames); written to PCMCIA cards (typically 30 images per 10Mb card, depending on quality desired), the images are of excellent quality. The camera is certainly capable of bulk-processing, but I do not use it on a copystand, preferring the Sony device detailed below. As for the scanner, the hopper holds about 50 (thin) mounted slides, and the speed of processing depends on the memory the machine has available.

A very flexible device is the Sony video camera with framebuffer, which can be used not only for photographs and prints (for example), but also for negatives and slides, with light transmitted through a lightbox. The lens (a Fujinon S16x6.7 BERM-18) allows such close-focusing that one-third the area of a 35mm slide can be captured as a full frame.

2 Hardware

Rubens began as a Digital Ultrix machine, and then got upgraded to a Digital Alpha. Here it would have stayed but, rapidly running out of SCSI connections, it proved cheaper to change machines, leaving the Alpha for development work. Rubens is now a Pentium 133 with 64Mb of memory, a dual SCSI chain, 2 CDROM drives, and 20Gb of storage.

Imaging hardware has evolved over the past three years. Beginning with a S-VHS video camera feeding into a 24-bit board on an Amiga, the suite is now as follows:

3 Software

Now that rubens runs under Linux, no commercial software is used anywhere in the operation, save for the W95 interface to the scanner. All the rest is freeware or shareware, or written in-house, usually in perl.

The basic image-processing utilities available under flavours of Unix - xv, xli, ImageMagick, and the pbmtools - all proved most useful. The software written in-house is described in an Appendix to this paper.

4 Conclusion
Imaging: the next five years

Several trends are discernible in the imaging world, and they are related to the increasing speed of networks and modems, to the high quality and lower cost of technological innovations, and hence to peoples' enhanced expectations.

Three years ago, any digital image appeared a marvel. Today, when a reasonable-quality digital camera (offering video resolution) costs $1,000, expectations have risen so that bigger and better images are making their appearance. Three years ago, my site was devoted to images exclusively of video resolution. Now, however, with the purchase of devices such as the Nikon E2 digital camera, and the Sony DXC-930P (with framebuffer), images of four-times and five-times video resolution respectively can be generated.

Storing such images in JPEG format is little problem given the cost of hard disks, but serving them presents problems because their size is usually still too large for most people. Users like big images, but they need ways of sampling sections and of having the image delivered to them (not necessarily involving a Web browser). Hence the development of our zoom program - see below, in the Appendix.

But 'bigger and better', whilst representing the progression in imaging, has not been reflected in Web technology, and herein lies the rub for the development of large sites. The current generation of servers and clients is easy to install and generally works first time. But the technology was never designed to deal with large quantities of material, perhaps in different media (text, sound, images, postscript files, video) so that we have seen the development of a host of add-ons which attempt to mitigate perceived deficiencies. Programs like wwwstat and pwebstats look after the reporting of log files, whilst Harvest/Glimpse allows us to pretend that the heaps of material underneath the Home Page really are in something approaching order, rather than chaos. CGI scripts help to impose various layers of order as well.

Professor Maurer and his team at the University of Graz have developed HyperG (now to be called HyperWave) as a 'second generation' server and set of clients (for various platforms.

Apart from the obvious problem with URLs (which should obviously become Universal resource Names), they point out problems in the current generation with scalability, with searching and with a basic lack of structure. HyperWave deals with these matters and also offers bidirectional links (hence link consistency - no more dead links), a read-write database (their description of the server), a full-text server with inverted indexes, a document dache server, and our old friend from databases - namely, referential integrity.

Appendix:
Software for Image-Processing in Bulk

Throughout the work with ArtServe, we have kept in mind the need to provide users in Art History (and any other discipline which uses a lot of images backed by a database) with effective, robust and reasonably simple software - although it should be made clear that, as the Web changes, so the features and type of the programs needed to populate it will also change.

Software

The guiding principles adopted for our software were as follows:

The programs divide into several groups:

Writing Image Datafiles

When dealing with large numbers of images, writing the datafile which describes them can be a bugbear. All the datafiles used on the ArtServe server are in plain ASCII format - there is no traditional 'database handler'.

The simplest method for writing such files is to do it directly into a text editor (normally vi, and cut and paste when (as is often the case) the bulk of records needs to be repeated - as, for example, when doing a series of records of the West facade of Chartres. The disadvantage of this method is that it is not particularly fast, and other arrangements need to be made for viewing the images to be catalogued, in their correct sequence.

This matter has now been addressed with a program using a Web interface, and with the (temporary) name of Bus-Queue which soon got changed to Image-Queue. First, the images are captured, and processed to provide thumbnail GIFs to sit alongside the larger JPEGs. Under my system, all images are filed in numerical directories, which makes various kinds of processing much easier.

The database records are entered using a forms interface to the Web browser, and this can be tailored using setup screens. Two types of entry are possible - namely, type-in boxes and pull-down menus. The labels and lengths of the former are set up in advance, as are the pull down menus, the latter being particularly useful when a restricted set of values is required, rather than any whimsy from the person doing the entering. No parameters are fixed in the rock, and all may be changed when (as is most likely) one realises the need to increase a type-in length here or make an addition to a menu there. An additional setup feature allows the user to specify how to treat the next record, obvious possibilities being to retain the parameters of the previous record and, in the case of some numerical fields, up the number by one.

But the program is called Image-Queue because of the way it presents the thumbnail images so the user can see what is going on. The image to be databased appears in the middle of the window, with the next four across the bottom of the window. (To examine the image in detail, click on it to bring up the large JPEG.) Once databased, the tally augments by one, new record appears, and the first thumbnail goes top left on the window. The next one joins it, and so on. Thus, after completing four new records, there are nine thumbnail images visible - the four already completed, the one currently being done, and the next four in the sequence. They shuffle from bottom right to top left - hence the analogy with a bus queue.

Datafiles and the Display of Databased Images

Text Files

Image Processing

Database Access

Electronic Lectures and Tutorials

How do such programs aid the provision and dissemination of learning materials for students, and their use in lectures, seminars and tutorials?

The traditional way of working with images is to use 35mm slides, presented (often two by two, using two projectors) in lecture or tutorial room. The 35mm slide replaced the 6x6 'lantern slide' gradually from the late 1950s, arguably with some loss in quality of the image, albeit with the addition of colour. 35mm slides are compact and portable, but of limited life-span (roughly, the more they are projected, the quicker they degrade) and expensive to purchase, repair and refile.

The University of British Columbia tried to get away from the disadvantages of slides in the mid-1980s, by 'enregistering' all student images onto a laserdisk. This gave images of video quality (roughly 760 by 525 pixels), and each display station needed a laserdisk player and a monitor in order to function.

A move toward computer storage of images and their display over the network is probably inevitable because it gets round the essential stupidity and wastefulness, in this networked age, of each group of users having a collection of images which duplicates every else's collection all around the world.

This is not to say that such procedures are necessarily cheaper than 'manual' work with 35mm slides; just as laserdisk 'automation' requires a player for each workstation, so the display of digitised images across the network requires the use of a video projector, and of a sufficiently speedy computer (and network) to feed each projector. The three-gun projectors offer the best quality, but are very expensive.

In order for work with digital images to be a practical proposition, programs need developing which will allow lecturers and students to manipulate and regiment images for presentations, private study and revision. It is essential that such tasks require the minimum of computer knowledge; hence, using the World Wide Web seemed the logical vehicle, given its workability on all kinds of computers. And because we have developed a system of using small thumbnail images (stand-ins for their larger brothers, which are a mouse-click away), such work can comfortably be done over a modem (14.4K or faster) if necessary.

The programs involved display complete records of the images in certain of our units; allow the interrogation of our databases of student images (some 10,000 images in all); aid the construction of visual presentations resulting from such interrogations; and include a quiz program of varying difficulty to allow student self-testing of image material.

Although the Web and HTML have some limitations, the generality of the technology is a great advantage, as is the ease with which users can write HTML pages which are the common feature of every program to file or floppy disk, or print them out (including images).

The various programs we have developed, thanks to the programming skills of David Blackman, are as follows: