A few months ago my publisher and I completed the final tasks in the production of a very large scientific textbook (31 chapters, 1300 pages, 600 figures, 2000 display equations) using TeX. This included preparation of both the source files and the final typeset pages using computer-based text-editing and typesetting facilities at Stanford University.
In response to suggestions from Eric Berg of Stanford and Ray Goucher of TUG, this note sets down a few of the lessons I learned about producing a book using TEX, including a few things I wish I'd thought about earlier in the project. These comments are focused primarily on organizational and procedural questions, rather than on technical problems involving TEX coding, macro packages. or computer hardware.
Book publishing is an old and highly refined art, with a highly developed and standardized set of procedures as to how an author's work goes from initial manuscript to bound books. Traditional book publishing is also an industry marked by a great deal of subcontracting. Book design, copy editing, preparation of art work. typesetting. printing and binding, even publication coordination are often subcontracted out by the primary publisher to freelancers or firms who supply those services to many different publishers. Many major publishers, in fact, may have never owned a printing press or even a typesetting machine.
This widespread use of subcontracting. along with the traditional character of the publishing industry, means that all the procedural steps from manuscript to final book -- the way the work flow is organized and scheduled, the way records are kept and objects are labelled. the proofreaders' and printers' marks that are used, the understandings as to who does what and in what order -- have become highly standardized and widely understood by all concerned. Technological developments which interrupt or short-circuit this understanding have a disturbing effect.
Widespread subcontracting of the production steps has also had a second, more subjective effect, in my opinion. It seems to me that many traditional publishers have not, at least in the past, developed a technologically oriented view of their industry. Over many years, when I attempted to converse with editors and publishers about word processors, text-editing programs, holograms, floppy disks, laser printers. and their impact on publishing technology, the publishing people seemed to me to focus only on things like printers' union demands, and whether it was better to send material to England or Asia for low-cost typesetting, They seemed to me to have little interest in technological developments in their industry. This disinterest in the technological aspects of publishing is of course now rapidly changing.
Publishing a book using TeX, especially when this involves direct author input of the source code, as well as author (or at least noncommercial) generation of the final typeset pages, obviously throws many of the traditional procedures and organizational arrangements of book publishing into confusion. To gain maximum advantage from book preparation in TeX while retaining traditional book quality, new and different organizational procedures will have to evolve. I would not want to say, from my limited experience to date, how these new procedures should develop. It seems clear, however, that this evolution has yet to be completed, or even to have gotten very well started.
In my own case, the source file for my book was prepared, and then expanded, revised. reshuffled. and rearranged through innumerable versions over a 10 year period, largely by myself and my secretary, Judy Clark, working on home and office terminals connected to various Stanford mainframe computers. Early versions started out formatted in Script on the campus IBM 360-370 mainframes. Things really began moving about 5 years ago when the source files up to that point were transferred first to a university and then a departmental DEC- 20 machine, and TeX became available on campus. Five or six years of steadily evolving class notes and draft output were generated and printed, first on a Benson Varian 9211 printer and then on one of the earliest Canon/Imagen laser printers (which is still running magnificently today). If we began again today, of course, we would probably carry out all of this manuscript preparation using TeX on our Macintoshes and obtaining draft output from a Laserwriter or something similar.
The final book, titled simply Lasers, is being published by University Science Books, headed by Bruce Armbruster in Mill Valley. California, and will be distributed overseas by Oxford University Press. The book design was done by Robert Ishi of Oakland, California, and the copy editing by Aidan Kelly of Alameda. California. The art work was drawn (using traditional hand drafting techniques) by John Choi, draftsman in the Stanford Chemistry Department. TeX macros to implement Bob Ishi's design, along with the massaging of the TeX source file into final shape, were largely done by Stanford TeXpert Laura Poplin, with assistance from Arthur Ogawa and Eric Berg. Final pages were typeset on the Stanford Computer Science Department's Autologic APS-p5 by Eric Berg, with counsel from Dave Fuchs. The art work was shot and pasted onto the typeset pages, and the book printed and bound, by the Maple-Vail Book Manufacturing Group. Publication coordination of all these people was accomplished, despite the confusions of a new and unfamiliar approach, by Bernie Scheier of Miller/Scheier Associates, Palo Alto.
Even when I began to use TeX in its earliest days (well before LaTeX), I knew enough to delineate the main structural elements of the book using a simple limited set of macros named "\chapter", "\section", "\subsection", "\sectionend", "\figure", "\problemsbegin", "\problem", "\problemsend", and so forth, putting as little native TeX code in the source file as possible. I wrote simple definitions of these macros myself for producing draft output, assuming that a hired TeXpert would modify and extend these macros when the final book design was developed. Any special symbols used in the laser field, such as the quantum-mechanical symbol "h-bar", along with any mathematical symbols for which I thought I might later want to change notation, were also represented by control sequences.
I might of course have looked for some standard TeX macro package to employ in doing this. Few if any TeX macro packages existed at the time I began, however. In addition, my experience with macro packages, in several different situations, has always been that any macro package powerful enough to provide sophisticated capabilities seems inevitably to acquire a complicated syntax. Once the syntax of the macro package becomes sufficiently complicated, I generally find that it becomes easier to learn the basic language and then name and define your own macros -- stealing as many clever wrinkles from other packages as you can -- rather than trying to learn the existing package syntax and make it do what you want.
One of the primary concepts in traditional publishing is of course the idea that the production of a book begins with an author's finished manuscript. Publishers assume there's going to be one. Copy editors assume they will have a stack of the author's typewritten manuscript pages, on which they will write mysterious editing marks in red, which will eventually be read by the printer and must never be erased, along with editorial corrections in blue, and questions to the author in green, which must be answered in the margins in soft black pencil. and the circled question marks crossed out in purple, and so on. This master copy of the manuscript becomes the primary vehicle of communication between all those involved in the production process.
The idea that there is no paper manuscript, only a mysterious (and changeable) source file in a computer somewhere, is very upsetting to people in publishing, at least to the traditionalists. In my initial dealings with many of the production people I had the strong impression that what they really wanted most was to get the manuscript out of the g-d d-m computer and into their hands, so that they could start working on it and marking it up in the way that God intended. (Despite this, the book's copy editor did eventually end up sitting at one of our terminals, doing much of his editing of the source file on line.)
Traditional publishers -- at least the ones I encountered -- also have a very strong reluctance to begin production work on a book until the author has turned in an absolutely complete, finished version of the manuscript and the associated art work. One can understand this viewpoint. It is probably based in part on well-founded experience that it always takes several times longer than the author promises for a manuscript to go from "virtually complete" to "really complete". Authors are probably also notorious for telling publishers that the first half of their manuscript is absolutely finished and the publisher can start production work on it, while the author cleans up just a few unfinished bits in the second half. The author then comes back later on with expensive last-minute changes in the first half of the manuscript after it is halfway through the production process, thereby throwing everything into confusion.
Setting the production process in motion also has cash flow implications for the publisher -- he must start paying the various subcontractors, with sales revenue still far in the future. Production people can also obviously work more efficiently, and follow schedules more effectively, if they have a complete and finished block of work handed to them all at once. Nonetheless, as I'll argue below, making plans as early as possible at least for how the production is to be accomplished is one step that can improve the efficiency and speed of the whole process.
To allow for maximum future flexibility, I had also adopted an approach in which every illustration in the book was given a TeX-style symbolic name -- e.g., "\powerflow" for a diagram of the power flow in a laser system. Each figure was located in the source file by a \figure macro with syntax like
The arguments give the figure's symbolic name, width and height, say whether the caption goes to the side or bottom of the artwork, and give the text for the caption. Encountering this macro in the source file of course bumps the current figure number up by one; assigns that number to the control sequence \powerflow: reserves suitable space in the TeX output (perhaps as a \topinsert) for the figure; and inserts a suitably formatted caption. I also wrote the symbolic name ("powerflow") in the upper right corner of the original art work for that figure. The figure was then referred to in the test by its symbolic name. e.g., "The flow of power between different elements in a laser is shown in Figure \powerflow.". With this approach I could obviously insert a new figure anywhere in the book. or move a whole section of the book with its imbedded figures to some other place in the manuscript. at any time before the final run of the source file. So long as I kept all the symbolic names unique, the figure captions, references and renumbering would all be taken care of.
The production people were unable to cope, however, with the idea of illustrations that were identified by names instead of numbers; and the copy editor and everyone else expressed great unhappiness at a manuscript which referred to "Figure \powerflow" rather than "Figure 8.11". Each piece of art work for a book is supposed to have a figure number, which is always written in red and circled in the upper right-hand corner of the illustration. The production coordinator wants to count these and make sure they're all there. The copy editor wants to see that the figure number written on the art work matches what is in the text. It was easiest eventually to go back, after the final manuscript was frozen, and hand-convert each symbolic name into the corresponding figure number, both on the art work copy and in the source file.
I certainly did not want to wrlte the final macro definitions and TeX coding myself to convert the book designer's specifications Into typeset output pages, nor did I want to massage the final page markup and run the phototypesetter myself. Hence it was necessary to find local free-lance TeXperts with publishing skills to carry out these tasks. We were fortunate in having available skilled help in the Stanford community to do this work. People with advanced ability in TeX, and especially people who combine high-level TEX skills with a genuine eye for quality book layout, are not yet thick on the ground, however. Minor complications also arose in getting the final typeset pages for the book prepared on the typsesetter belonging to the Computer Facilities group in the Stanford Computer Science Depart- ment. There were the usual purely technical problems of fonts and communication between machines, which were not too difficult to overcome. Beyond this, however, there were administrative difficulties of the sort that arise when commercial and academic worlds meet. Stanford's Computer Science Department views its typesetter as a typical academic facility: It's there and available to faculty, with some staff help available in using it, and its operating costs must be paid for more or less on a cost-of-use basis: but it is not a commercial enterprise.
The publication coordinator, however, concerned with schedules for getting the book printed, bound, and marketed, wanted to be able to make firm commitments to deliver the typeset pages to the printer on a scheduled date. Hence he wished to have a definite business commitment from the typesetting people that the work could be done by a certain date and for a certain fixed price. The Computer Science people, reasonably enough, were not In a position to give that kind of firm guarantee. Things did, of course, all work out in the end. Incidentally, after a small amount of looking we did not find any commercial typesetting firms, at least in the San Francisco Bay Area, that offered commercial-grade book-quality typesetting service starting from TeX source files.
What are some of the recommendations I would make, or steps I would do differently. if I had to do the whole job over again?
All in all, I would certainly produce any future book of any type using TeX, as compared to any other way. Important benefits for author, publisher, and readers that came from preparing this book in TeX included:
Finally, last but far from least, preparing this book in TeX gave me the opportunity to gain some appreciation of the remarkable intellectual achievement represented by the TeX language, and seemingly by everything else that Donald Knuth does.