Electronic Publishing: Communication in a Scholarly Environment

Terry Noreault & Bradley C. Watson
OCLC Online Computer Library Center, Inc.
Email: noreault@oclc.org
            watsonb@oclc.org

Abstract

It is now certainly a cliché to claim that the Internet is dramatically transforming scholarly publishing. The most obvious change is in the printing and distribution of scholarship. It is now within the reach, both technically and financially, of any scholar through the World Wide Web (WWW or Web) to make high quality 'print' available to tens of millions of potential readers. This new medium also allows the inclusion of multimedia (color, sound, video, and interactive 'applets'). However, perhaps the most fundamental change is yet to come. The new communication and collaboration technologies on the Web provide the potential to change the entire process of scholarly communication, not just the production and distribution phases. Specifically, it is now time to rethink the review and certification of scholarship and the recognition of the scholar's contribution in terms of the capabilities inherent in the Web technology.

1 Introduction

Michael J. O'Donnell in his article Electronic Journals: Scholarly Invariants in a Changing Medium makes the statement:

There is no compelling reason to change the structure of the refereeing process to adapt to an electronic medium, but there are some more highly interactive, and more open, forms of certification that might be tried (O'Donnell).

It is our hypothesis that such experimentation as O'Donnell speaks of so dismissively is critical to the future of electronic scholarly publishing. But the proof will be in the testing, so we are presenting a suggestion for one such interactive, more open form of certification. We believe that in time, in an evolutionary process, experimentation with such systems will prove that there are, in fact, compelling reasons for taking advantage of the new technology represented by the Web in the process of certifying the quality of scholarly contributions. To understand why we say this, it is important to know what scholarly journals are, and what role they play in the lives of scholars.

Scholarly journals are, themselves, a technology for delivering results of scholarly activity to the world. As such, they incorporate a wide variety of other technologies: language, paper, print, computers, graphic representation modes, colour representation and application, and many more. From one view, the history of scholarly journals can be seen as the history of an ongoing attempt to automate the delivery of information to as wide an audience as possible. From the first scholarly print-based journal published three centuries ago to the latest, electronic-based journal published on the Web, scholarly journal publishers have routinely looked to adopt the latest technology advances in order to lessen the costs while enhancing the value of their publications and increasing their audiences. In practice, this has meant the elimination of manual effort at every feasible point in the publication process.

During the last forty years, the application of technology to scholarly journals has focused principally on the physical production process, though the distribution aspects have not been ignored. Indeed, our organisation, OCLC Online Computer Library Center, Inc., along with its wholly owned subsidiary, Information Dimensions, Inc., has developed a product targeted towards publishers, OCLC STEPS System for Total Electronic Publishing Services (STEPS) using the latest in automation technologies applicable to the production and distribution processes. Other such systems from other vendors are also coming online.

But what has been left out of the reinventing of the fabrication of scholarly journals has been the process that actually procures the contents of journals: the scholarly articles. Generally, the procurement process, usually known in the trade as the refereeing process, is a highly manual affair, involving the interaction of numerous individuals, in a variety of ways, including the transfer of documents, reviews of documents, comments on documents and revised documents via a myriad of forms and means. Doing this manually has high costs involved, both in terms of time and resources expended and increased opportunities for error per article finally accepted for publication. Simply making a straight conversion or adaptation of the process from manual to automated will eliminate a significant amount of costs. But with time and experimentation, there is more to be gained by making changes that go beyond simple adaptation.

Before the Internet, before the World Wide Web, there was no reasonable way to automate the refereeing process. Now that they do exist, change can occur, but not only in order 'to adapt to an electronic medium' but because the electronic medium makes it possible to effect meaningful change. What the medium makes possible in our view is firstly, a more cost-effective process for the publisher. This will be done the old fashioned way by eliminating as much as possible the actual human effort expended in selecting deserving articles via automation. Secondly, the medium affords an opportunity to measure the value of individual scholars' contributions for promotion and tenure decisions via automation. These possibilities are actually intertwined, as will be shown later.

Before addressing the specifics of our proposal, we will first briefly review the present state of automation in publishing, as well as the current electronic journal environment.

2 Automation in the Publishing Process - Current Status

Automation has come incrementally to the publishing industry. The first major subsystem to be automated was the setting of type. This was done in several phases to the point where we now no longer have metal type and presses at all. Instead, typefaces are just digital representations of symbols that are manipulated via computer into a program for each text to be printed. This program is downloaded electronically to a printer (usually laser-driven) that prints the text based on the instructions embedded in the program. Systems for doing this have been available, even at the individual workstation/desktop level, for more than a decade and have made a wide mark on the whole publishing industry. In scholarly publishing, they have allowed a 'cottage industry' approach to the task of journal production, making it affordable for very small organisations, such as university departments, even individual scholars, to become players in the field.

Once the typesetting/photocomposition portion of the process was automated, and the copyediting was automated via word processors, the next step was to automate the production workflow environment. Software systems for keeping track of, and scheduling events for, articles and issues of journals have been around for a decade or more. These packages do not, of themselves, cause the events to happen, such as starting the photocomposition engine at the appropriate time with the given articles for a issue. But they do allow for an online, interactive accounting of all the various statuses of all the components being worked on at any given instance of time. They save a great deal of manual effort. Once all these subsystems were automated, the next step in the automation evolution of publishing was the integration of all these subsystems into one system.

Jolanda L. von Hagen of Springer-Verlag, at a seminar at Bond University in May 1992, laid a picture of the traditional publishing workflow alongside a picture of that same workflow when Xyvision's version of an integrated publishing system is used. The traditional workflow, which included using various automated tools such as word processors, graphics packages and photocomposition/typesetting engines, contained twenty-five processes. Under the Xyvision system, this reduces to a mere six (von Hagen). A similar reduction can be realised with OCLC STEPS, a fully integrated electronic publishing environment that allows for the importing, authoring, editing, composition, typesetting and distribution of texts via multiple mediums based on an SGML-aware database engine and any of several applications packages integrated as a shell around the database. Such systems represent the state-of-the-art in automation of the physical production of journals, as well as their distribution, in multiple formats.

With a STEPS-like system, it is possible to enter articles into a database, either through capture and conversion from formats other than SGML, or through importing them directly from SGML authoring tools, copyedit them, then schedule them for a given issue of a journal through an automated workflow manager and, at the appointed time, out will come the fully composed and typeset journal ready for printing on paper, or placing on a CDROM, or mounting on the Web or other online environment. Depending on the complexity of the documents involved and the distribution medium, it is possible to have no human intervention beyond the scheduling for publication step. Very complex documents such as those with heavy mathematics or intricate tables would probably need some intervention in the photocomposition stage, but this is only on an as needed basis. Online journals can be loaded automatically from STEPS-like systems after they have been composed, while for CDROM or paper journals, some human intervention is necessary to create and distribute the individual copies, though automated tools can certainly make this task very simple.

STEPS-like systems are aimed at publishers of many journals and monographs, such as academic presses, professional societies and commercial scholarly publishers, and are still not deployed even amongst this market in any appreciable way. The technology that has impacted scholarly communication more forcefully are the desktop publishing systems noted briefly above. These are systems designed to run on an individual workstation with an attached laser printer and/or World Wide Web Server connection. More powerful than their cousins, the What-You-See-Is-What-You-Get (WYSIWYG) word processors and desktop publishing programs give individuals the capability of fully composing and typesetting whole documents and either printing them or mounting them on online systems with all the photocomposition/typesetting quality that the larger publishers have traditionally had at their disposal.

Thus, individual scholars can for instance, layout their own texts in a high-quality manner, then publish (that is, distribute) them through a WWW or Gopher Server to the world of the Web/Internet. Desktop publishing tied to online delivery systems like the Web can make any person capable of publishing their own work. The question becomes, why would a scholar do such a thing? The answer lies in what motivates scholars to publish. This factor will be explored more fully later in the paper.

International technology standards, both de jure and de facto, are the basis for making such highly integrated, fully automated systems not just possible, but practical. Without standards such as ANSI Standard Character Information Interchange (ASCII), Standard Generalised Markup Language (SGML), Hypertext Markup Language (HTML), Adobe Postscript and Portable Document Format(PDF), Hypertext Transfer Protocol (HTTP), Rich Text Format (RTF), Transmission Control Protocol/Internet Protocol (TCP/IP) and dozens, if not hundreds, of others, then the automation we have today in the publishing world could not exist. The question that needs to be asked is what standards do we need that will make further automation possible and practical? Indeed, what is there left to be automated? These questions, and some possible answers, will be further developed later.

3 Electronic Journals - Their Process of Becoming

For an in-depth view of the past, present and future of electronic journals, we recommend both F. W. Lancaster's The Evolution of Electronic Publishing and Thomas B. Hickey's Present and Future Capabilities of the Online Journal which appeared in the Spring 1995 edition of Library Trends. Indeed, the entire volume was dedicated to the theme of Networked Scholarly Publishing and as such it contains many excellent articles. In this space, we will briefly discuss how both articles help establish the point that there is an evolutionary character to the process of turning to electronic publications, not a revolutionary one.

Lancaster isolates four primary stages in the evolutionary process of electronic publications:

  1. Computers used to generate print-on-paper publications.
  2. Electronic-based texts distributed that are exact equivalents of
    printed versions of the same texts.
  3. Electronic-based texts distributed that only exist electronically,
    but without taking special advantage of the medium - they look
    like paper text would look like.
  4. Electronic-based texts that are totally new take real advantage of
    the electronic media's nature.

As Lancaster points out, examples of all four co-exist in the here and now but logically they are manifestations of separate evolutionary stages. What is clear from Lancaster's presentation is that the electronic publication world did not start out with electronic only, totally new journals that from day one took full advantage of all the capabilities afforded by an electronic medium of creation and distribution. The movement has been gradual and deliberate (Lancaster).

Hickey, taking a slightly different tack, gives a detailed examination of the capabilities of three electronic forms of publication in terms of their relative advantages and disadvantages. Hickey's classification of forms, simple text, page image and structured text do not exactly match Lancaster's stages; still he clearly shows an evolutionary movement from straight-forward presentation, to more complex, to still more complex. As with Lancaster's stages, Hickey's three forms co-exist in time, with examples of each currently residing on the Internet/Web (Hickey).

We believe concurrent existence of separate stages and forms of electronic publications will continue for some time, probably into the next century. However, it is the stage four journals, most likely utilising Hickey's structured text form, that will be best suited to evolve the kinds of capability we are presenting here - an automated peer review process integrated into the presentation of the journal articles.

4 Publishing Economics In Terms Of Several Key Players

Authors, publishers and consumers are the three main player groups in the world of scholarly publishing. In this section, we will look at how the economics of scholarly publishing are viewed from the perspective of each. For the consumer group, we will substitute the institution of the university library since that is the central collector, keeper and disseminator of scholarly publications for the end user community of most universities.

4.1 Economics of Scholarly Publishing for Scholars as Authors

While economic incentives are not the only driving force for scholars to publish, they are certainly important. Scholars as authors have a personal economic interest in scholarly publishing in the sense that their potential for earning more income is directly affected by the publications they manage to have accredited to their account. Specifically, raises, promotions and tenure granting decisions for university scholars are largely based on evaluations of their impact on their field of study as measured by the number and quality of their publications. The quality of a given publication is measured indirectly in terms of the reputation of the journals that accept and publish the works.

Additionally, there is an economic value associated with each scholar's reputation amongst the scholar's peers for doing high quality scholarship. Like the outcomes of the committees on raises, promotions and tenure, scholarly reputations are largely based on the quality and number of publications by a given scholar. Quality, in this sense, is measured by the number of peers who actually read and appreciate the publications of a given scholar and do them the honour of citing their work in a positive manner in their own publications. The higher the reputation of the citing scholar, the more weight the cited scholar receives in terms of their own reputation. This is a very informal economic exchange with no central accounting office keeping score, unlike with the promotion and tenure committees. But it is important and is effective because it can lead to leadership roles in the larger community such as being asked to serve as a reviewer or editor for the important journals in the field. This in turn can lead to positive evaluations by the university committees that control raises, promotions, and tenure grants.

Thus, scholars have a vested interest in there existing as many possible high-quality publication venues - that is, journals for most scholars, monograph publishers for the rest, as can be viably supported. It does not hurt if there are also a good number of less prestigious, but definitely not disreputable, venues available as well, since no scholar always hits a home run every time. Therefore, scholars have a vested interest in keeping the costs of journal production as low as possible so that more journals can be produced with existing funds. As we all know, the piece of the scholarly funding pie available for publishing is not getting any bigger, so the goal is to cut it into as many smaller pieces as possible. This is one possible reason for the experimentation by small groups and individuals with desktop publishing systems and Internet/Web publishing. This approach has many advantages such as timeliness of publication, direct control of the presentation values and opening up the potential audience by large factors.

4.2 Economics of Scholarly Publishing for Publishers

The economic goal of any publisher is to keep the costs of publishing down to a bare minimum while keeping the presentation quality as high as possible. Both affect the bottom line. Lower production costs means more of the money available for other uses, while higher quality leads to more and higher return sales. People are often willing to pay more for higher quality production values. For example, a journal in a field where high quality colour photographs are essential to conveying the scholar's work will get higher fees and will sell more product, if it has higher quality photographs then the competitors'. Conversely, such a journal might not survive at all if it can not reproduce such photographs. This is true whether the publisher is attempting to make a profit for the journal owners or to just stay within a spending limitation set by whoever furnished the money to publish, such as a university department or professional society. Automation has traditionally been the route to lowering costs of production in publishing. It has also been the means for achieving higher quality output.

All of which is to say that a journal, in order to survive, must keep production costs down while keeping quality high, both in terms of content and presentation of content. As discussed earlier, current automation systems go a long way towards keeping down significantly the cost of the physical production of journals, whether paper or electronic, and their distribution. They also increase the presentation quality. However, the cost of acquiring the content has not yet been addressed by automation in any significant way. This is not an insignificant cost to publishers who have to shoulder all the work of forming and maintaining the network of peer reviewers and editors that keep the flow of high quality papers into the publisher's journals at a sufficiently high level in order to maintain market share. Automating this process could further lower the costs of publishers, freeing more capital to fund more journals, or raise the quality of existing ones. It could also lower the costs of entry into the publishing field, bringing more competition. This is a possible result that the following player group in this triad of players can only be very happy about.

4.3 Economics of Scholarly Publishing for Consumers

Consumers of scholarly publications are represented in the marketplace largely by university libraries whose budgets include large, though not infinite, sums dedicated to the acquisition of new materials, both serials and monographs. Because of the huge rise in cost of scholarly journals in recent years, more and more of the acquisition budget is being dedicated to serials at the expense of monographs. This is because most scholarly fields rely more on journals then on monographs for their publication venues. This has put the humanities and the arts at somewhat of a disadvantage overall, but the sciences are not too happy either since the other effect of high serial costs has been the elimination of subscriptions to serials considered to be of marginal value. This leads ultimately to the demise of such journals, a situation that publishers, authors and consumers are none too keen about.

Libraries are being hard pressed to keep up with the rising costs, which means that library patrons are being shorted in terms of what they want - large numbers of high quality scholarly journals readily available to them. It also means that in the long run, publishers will be publishing fewer journals, leading to fewer venues for scholars to publish their work in, leading to more difficulty in attaining their career objectives in terms of raises, promotions and acquiring tenure. The good news is that the Web offers an opportunity to change the parameters of this currently zero sum game. By automating the entire scholarly communication process via the World Wide Web model, the possibility exists for lowering the costs.

5 Next Phase of Automation in Scholarly Communication

The automation of scholarly communication via desktop publishing and electronic distribution has resulted in, and will continue to cause, significant changes for scholars. These changes have improved the quality of the final product, eased the burden on the creator, and provided whole new vehicles for distribution. Yet, despite the obvious powerfulness of what has already transpired in the way of changes, it is our assertion that their impact is just starting to be felt and that the full consequences haven't even begun to be registered in the scholarly community. The wave of change in scholarly publishing hasn't yet crested.

We believe the next phase of automation will address the need readers and authors have for peer reviewed scholarship and to further automate the publisher tasks. Tools developed in this phase will enable the mediation of the evaluation function which is so important to the progress of scholarship. This mediation will consist of reviewers providing feedback to authors so that work can be improved, authors receiving formal recognition of the quality of their work, and readers having automated filtering aids for identifying high quality scholarship. One consequence of these tools will be to automate some of the management of the collaborative process carried out by publishers, thus lowering their costs.

These technologies will certainly, at a minimum, enable scholars to receive timely comments from peers and readers to benefit from that evaluation of the material. But they can do more. Among other things, collaborative software technologies can

  1. allow authors to respond to reviewers comments by revising the articles, while keeping earlier versions available as well;
  2. maintain notes or evaluations by readers and reviewers with each article as attachments; and
  3. keep track of a measure of merit to be assigned to the article by peers. Below is a review of what such a system might look like.

5.1 Example System

To better explain this system, we will describe a possible implementation and how each of the stakeholders in the scholarly communication process would perceive the system . This is not to suggest that this is the only or best possible implementation, but rather to demonstrate the possible uses of the technology. In fact, the technology will enable a broad range of scholarly communication models which can be tuned to the individual needs of various disciplines.

5.1.1 Model

The fundamental component of this model is a subject-focused, electronic, Internet/Web accessible, archive of papers submitted by authors. The subject area for any given archive will be, as it is today, determined by the publisher. While one could imagine one giant archive that contains all the worlds' scholarship, it seems to us that the organisation that comes with subject-focused journals is just too valuable to sacrifice. An archive would point to the full text which may or may not be stored on the archive server. What will be on the server is a metadata record containing descriptive information about the article (for example, author, title, etc.) and subject classifiers to facilitate in the retrieval of the article by potential readers with specific information needs.

The metadata record will also contain status information about the article. This status will consist of whether or not the article has been reviewed, usage statistics (how many readers have accessed it), citation statistics, reviewer's ranking and reader evaluations.

5.1.2 Author

This approach to scholarly communication enables authors to rapidly publish scholarship. It enables the work to be reviewed and evaluated by formal, peer-based processes. Once the article is submitted, the author may receive comments from reviewers and/or readers and make changes to the article. At some point in time after the article has been available in the archive, if the paper is viewed by the editorial board as having sufficient merit, it will be labelled as an accepted paper, providing the first level of formal evaluation to the author. Other evaluation criteria will also be available to the author, such as number of citations to a given article.

5.1.3 Reader

This archive provides a very powerful tool to readers to scan the scholarship of the field. One very important attribute is that the filter provided by formal review and acceptance is still available to readers. However, at the same time, readers needing very broad coverage can choose to look at all submitted articles, using their own criteria for judging the value of the paper in light of their own needs.

In addition to the formal review, readers will be able to filter the database by highly referenced papers. That is, the reader can make sure that they only look at papers which have been referenced in other works in the archive. This provides another measure of quality that a reader can use to filter the archive during a search.

Perhaps the most exciting possibility is special interest communities to form within the population of all readers of an archive in order to evaluate articles based on the group's own set of criteria. In other words, small groups with specialised interests can provide different ratings of quality. These groups might be formally organised, like special interest groups within the scholarly and professional societies, or they might be informal, consisting of anonymous readers who share similar interest profiles. Thus, the ratings given to an article by its reader would be maintained anonymously by the archive, grouping the ratings for retrieval purposes by reader profiles. Thus, if a reader wanted to read articles that had been evaluated highly by other readers with a similar interest profile, the archive could supply all such articles by first selecting those readers with similar interest profiles and then retrieving those articles that were ranked highly by that group of similar readers.

5.1.4 Publisher

This kind of system should result in lower costs to the publisher in terms of the article acquisition process. While the publisher still has to define the subject focus of the archive, select reviewers, solicit new papers, provide assistance in copy editing, and promote the archive to potential readers, the automation of the review and acceptance cycle should realise a net gain in productivity for the publisher over current practice. It is important to note that this approach leaves many of the traditional publisher responsibilities in place.

6 Conclusion

The Internet and related technologies allow the reshaping of the scholarly communication processes. This will have significant impact on those processes, though much of the core will remain the same.

Bibliography

1
O'Donnell, Michael J. (1995) Electronic Journals: Scholarly Invariants in a Changing Medium, Journal of Scholarly Publishing, April, pp.183-199.

2
von Hagen, Jolanda L. (1992) The Electronic Journal: Is the Future with Us?, in The Electronic Journal: The Future of Serials-Based Information, Brian Cook (ed.), The Haworth Press, Inc., New York, pp. 3-16.

3
Lancaster, F. W. (1995) Evolution of Electronic Publishing, Library Trends, Spring, 43(4):518-527.

4
Hickey, Thomas B. (1995) Present and Future Capabilities of the Online Journal, Library Trends, Spring, 43(4):528-543.


Organised by: AUUG'96 & CSU Return to Conference Proceedings