A Symposium Celebrating the 25th Anniversary of the DENDRAL Project 1965-1990 December 15, 1990 Terman Auditorium Schedule: 8:30 - 9:00 Registration 9:00 Edward Feigenbaum 9:30 Dennis Smith 9:45 Raymond Carhart 10:00 Georgia Sutherland 10:15 Refreshment Break 10:30 Joshua Lederberg 11:00 Mark Stefik 11:15 Larry Masinter 11:30 Sridharan 11:45 Russell Altman 12:00 Suzanne Johnson 12:05 Box Lunch 1:15 Bruce Buchanan 1:45 Avron Barr 2:00 Peter Friedland 2:15 William Clancey 2:30 James Bennett 2:45 Thomas Dietterich 3:00 John Kunz 3:05 Robert Engelmore 3:15 Refreshment Break 3:45 Douglas Lenat 4:00 Gio Wiederhold 4:15 Peter Hart 4:30 Saul Amarel 4:45 Horace Flatt 4:50 Elliott Levinthal 4:55 Danny Bobrow 5:00 Adjourn 6:30 Cocktails and hors d'oeuvres at Jewish Community Center, 655 Arastradero Road 7:00 Banquet 8:00 Car] Djerassi: "Facts and Fiction" How DENDRAL Was Conceived and Born Joshua Lederberg, PhD Rockefeller University, New York 14 This will be a somewhat personal history of how I came to work with Ed Feigenbaum on DENDRAL, an exemplar of expert systems and of modeling problem-solving behavior. My recollections are based on a modest effort of historiography, but not a definitive survey of and search for all relevant documents. On the other hand, my recollections will give more of the flow of ideas and events as they happened than is customary in published papers in scientific journals—accounts so dry that Medawar lugubriously called them fraudulent {43} (compare Mer- ton & Zuckerman {44, 45, 61}). These authors point out that the stan- dard scientific publication is narrow-mindedly devoted to the context of justification. The DENDRAL effort (along with much of medical in- formatics) is dedicated to discovery: Should we use a different standard for its history? This is a first effort at historical research and informed consensus on the origins of DENDRAL,; and we all understand the limitations of a personal account—especially about what others were thinking at a given moment. Built into the phenomenon of history, as soon as enough time has passed to enable some detached judgment, the evi- dence becomes frail, and we become vulnerable to the myths we create. Understanding all of these limitations, I will no longer qualify every remark: It should be implicit that each is “to the best of my recollec- tion” or “as best as can be inferred from the fragmentary documentary record.” I will assume you are generally familiar with DENDRAL and will concentrate mainly on material not found in the published papers, es- pecially as there is a comprehensive synopsis of its postnatal produc- tions {41}. My story will focus on the period up to the recognition that what we were working on was a knowledge-based system (ca. 1971). Because computer science is not my primary profession, my rela- tionship to it has been more episodic; and I can more readily isolate how I came to take some part in it, at Stanford from 1962 to 1978, mainly in very close collaboration with Ed Feigenbaum, Bruce Bu- chanan, and a host of others. My central scientific commitments have been to molecular genetics, starting in 1945 when I was a 20-year-old medical student {38}. At Columbia and then at Yale, I worked on the "How DENDRAL Was Conceived and Born,” by Joshua Lederberg, is a chapter in A History of Medical Informatics, edited by Bruce J. Blum and Karen Duncan. © 1990 by ACM Press, a division of the Association for Computing Machinery, Inc. (ACM). Reprinted with permission of the publisher. How DENDRAL Was Conceived and Born 15 1. 1937-1943. Leibniz dream; Logic & Axiomatic Method: Studies in Columbia College 1941, 1953, 1962. Computer hardware: Desultory exposures 1947 ff. Information-theoretic formulations in genetics 1953 ff. Introspections about the history of bacterial genetics 1960. Instrumentation development for Mars exploration: NASA 1955, 1959, 1961, 1963. Meet Minsky, Djerassi, McCarthy, Feigenbaum am ke ON (In every biographic-historic account in science, one seeks an inter- play of personality, ideas, institutional setting, and other externali- ties.) FIGURE 1 Conceptual and experiential threads leading to the DENDRAL project genetics of bacteria, a specialty which converged with the function of DNA as genetic information. My first academic appointment was at the University of Wisconsin from 1947 to 1958; then I went to Stanford in 1959 to take part in the reconstruction of its School of Medicine (for- merly in San Francisco) at the Palo Alto campus. My intended role was to found a new Department of Genetics; ] had no plan to be working with computers. Fate dictated otherwise: I met Ed Feigenbaum in 1963. Then, promptly after he moved from Berkeley to the Stanford faculty in early 1965, we initiated the collaboration that became the DENDRAL project. These were hardly random events: I go back a few years to pick up the relevant premonitory strands, which I identify in Figure 1. It pro- vides a roadmap for the narrative that follows. (1) Starting in grade school, I had fantasies that echo Leibniz’ dream (see (13}) of a “universal calculus” for the “alphabet of human thought,” that all of knowledge might be so systematized that every fact could be tagged with a code. Compare Mortimer Adler’s Propaedia {1}. New York City in the late 1930s offered wonderful encouragement to self-improvement through education, in my own case at Stuyvesant High School and at the New York Public Library. There I was fascinated with the Dewey Decimal System, which was so helpful in locating the books. If I could but memorize that, it would be proxy for mastery of all the knowledge it classified. In those days, taxonomy dominated bio- logical teaching too. {I will not detain you now with the perils of mis- placed confidence in low-dimensional, or insight-free knowledge. They 16 The Keynote Addresses need to be remembered when we try to extract “knowledge” from an expert, measure how much we have, and so forth.) Although I was committed from a very early age to a career in experimental biology, while in college I was eager to have some under- standing of the epistemological roots of science, and I enrolled in sev- eral courses in logic and scientific method. At Columbia, I was fortunate to have some personal exposure to members of the philoso- phy department: Ernest Nagel, Justus Buchler, and James Gutmann. With their help, I read George Boole and Whitehead & Russell {58}, and tried to follow J.H. Woodger in his Axiomatic Method in Biology {59}— an effort to express what was then known of genetics and embryology in the formalisms of relational calculus. Our factual knowledge was sparse enough; but apart from that, I wondered if we really understood our assertions when they were expressed in the jargon of empirical biochemistry. Axiomatic reformulations of biology are only just now returning to the scene {3, 54, 57, 47}. They make the intellectual demand of coping both with the formal logic and the molecular biology. That background gives some flavor of the retrospection about method that has entered my thought sometimes during, often after, my experimental research projects. That would precondition me to look to Al as a way of expressing my philosophy of scientific method, a per- spective eloquently stated by Lindley Darden {66}. (2) My first encounter with a “computer” was in 1941, ina lab for high school students sponsored by IBM {23}. My own instrument was a microscope, but one of my fellows was making innovative improve- ments on a punch card sorter/tabulator. It was an impressive manifes- tation of an electro-mechanical automaton, one that could certainly calculate more reliably than I could. It looked like fun. After the war, there was some publicity about the electronic machines, which I read at the level of Scientific American or Science magazines. But my own next step was the IBM 602A, on which I practiced in Fred Gruenberger’s course at Wisconsin, in 1953, in order to get some concept of program- ming, albeit on a plugboard! One could do statistics on this machine, as did some of my colleagues in applied genetics, but I had no compara- ble excuse to play with it. (3) That postwar period also saw the elaboration of information- theoretic formulations of genetics. We were starting to say that genes encoded the information needed to specify protein structure (14, 51). This style of thought and expression became more explicit in the period after 1953 {25} with the recognition of the implications of the Watson- Crick molecular structure of DNA {22}. It would be backward for any- one in my field to ignore this way of looking at the biological world. Then, Marvin Minsky came to see me at Wisconsin in 1955 at the behest of some mutual friend to discuss automata. I am sure I had already heard of some of his own work. How DENDRAL Was Conceived and Born 17 (4) My own laboratory research was a very mixed bag of theoreti- cal formulation and empirical encounter. I had been extraordinarily lucky on several occasions—but I didn’t want to be a hostage to chance: Should there not be a more systematic strategy of problem for- mulation? And if one could do that, problem-solving might be a throw- away. Serious questions about the rational direction of science were invoked around an examination of why genetic recombination in bacte- ria had not been explored 40 years earlier {24, 60}. (5) Starting with the observation of Sputnik, and a conversation with J.B.S. Haldane in Calcutta in November 1957 {67}, I had set out to assure that fundamental biological science was properly represented in the programs of space research that were just emerging. I had met Hal- dane on a date that was both a lunar eclipse and the 40th anniversary of the Soviet October Revolution (almost precisely 30 years ago). He taunted me with the prospect that we might see a red star (a thermonu- clear explosion) on the moon during the eclipse. At best, that was a striking metaphor for the danger that scientific interests would be to- tally submerged by the international military and propaganda competi- tion. They have never gained first priority; they might have been totally excluded. My own efforts were merely advisory and critical until 1960, after NASA had organized a Life Sciences Research Office and asked me to establish an instrumentation laboratory at Stanford. With Elliott Levinthal’s able technical direction of the lab, we became actively in- volved in the conceptual design of approaches to test for life on Mars, at such time as there might be a mission. I know most of my colleagues thought that would be well into the 21st century, as we were a decade short of the lunar landings. But the possibility of finding another branch of evolution was of such compelling scientific interest, the stake was worth odds I knew were very long. Both the internal activities of the Instrumentation Research Labora- tory (IRL) and design discussions with the engineering managers of spaceflight missions (principally at Cal Tech’s JPL) brought us into inti- mate conversation with technology of automation, process control, communications, and computer management. Furthermore, mass spec- trometry soon emerged as a technology of choice for chemical analysis. It has enormous sensitivity, selectivity, and independence of prior bias as to the molecular species expected [33]. As we shall see, it also offered some special opportunities and challenges in computation. In 1961, I was also invited to serve on a President's Advisory Com- mittee (PSAC) panel on the management of scientific information. Our report {50} gives modest support to the implications of computer tech- nology, along with “reproducing and microphotographing equipment” for information storage and retrieval. However, I had become ac- quainted with Eugene Garfield, the inventor of Current Contents, and had helped him set up a trial run of the Science Citation Index in the 18 The Keynote Addresses field of genetics {36, 19}. That experience (with its overtones of the clas- sification of knowledge for purposes of retrieval) was an early success in the use of computers in support of scientific research. By now, I concluded that I would have to learn much more about computers, at a hands-on level. The opportunity was engendered by the evangelistic efforts of Al Bowker and George Forsythe to establish an intellectual and technical base for, and broaden interest in, comput- ers throughout the Stanford campus. In company with the develop- ment of a new division, then department, of Computer Science, and of a campus-wide computer center, elementary programming courses were organized. I enrolled in the BALGOL (Burroughs Algol) course given by Bob Oakford, over the summer of 1962. This had much of the flavor of a course in English for fresh immigrants, the class having a very broad distribution of age and of academic status, specialty, and sophistication. I quickly succumbed to the hacker syndrome (and have suffered episodic relapses over the last 25 years). This was reinforced by the relentless rectitude of the machine in rejecting my errors—always so obvious in retrospect. “Next time, next time I will master the #@l!"™ system!” Well, I did shortly become reasonably proficient (eventually, in a range from assembly to higher level languages) mostly out of de- termination not to be made a fool. In those days, we had a B220— which would match a fairly feeble PC today—as the first campus machine. Its operating system would accept decks of punched cards in serial batch mode, with output either from the printer or new punched cards. The usual turn-around time was about 12 hours. If you got to the computer room around midnight, you might get another pass by 2 AM. The democracy and night owl ambience of the batch system was a social mixer for several enthusiasts from wide-ranging disciplines. (1 particularly recall Tony Hearn, who was starting his symbolic algebra system, REDUCE, on the IBM 7090). The impedance of a one-pass per day turnaround certainly did filter out all but the most enthusiastic. You also spent a lot of energy trying to simulate the machine in your own thought, in contrast to the casual, experimental mode—"“Let's see if this works”—of today’s interactive systems. This mode has unques- tioned advantages; but it may weaken programming as a teaching dis- cipline for logical rigor (except insofar as pure, unremitting failure teaches mainly discouragement). Our first applications included some that are pertinent to medical informatics, but not to DENDRAL, in areas of genetic epidemiology {6}, including a contract to produce the child-spacing report on the 1960 census. Bob Tucker was instrumental in sustaining our effectiveness and sanity through that experience. When we discovered that “chil- dren” of some mothers were delivered at 3-month intervals, I again learned the familiar GIGO lesson, and a healthy skepticism for mass How DENDRAL Was Conceived and Born 19 data repositories. Massive numbers do not take the place of quality controls on individual data entries. Some other inquiries, e.g., of inter- correlations of season of birth and birth weight with postnatal out- comes, taught us the difficulties of removing all the confounding factors. The usual “socioeconomic status indicators” do not begin to exhaust the vagaries of stratification of human behavior. 1962 also marked the recruitment of John McCarthy to Stanford. We met around the computer room, and soon discovered we had a common friend in Marvin Minsky. I had read Marvin's article on steps toward artificial intelligence in the January 1961, special issue on com- puters of the Proceedings of the Institute of Radio Engineers (46), the first issue I received as a newly enrolled member (having joined at the urg- ing of Lloyd Berkner, chair of the Space Science Board). That article and McCarthy’s intellect and excitement gave me a sense of tangibility of the possibility of engaging in AI research. When he showed me his new DEC PDP-1 and its interactive CRT displays (namely, Spacewar) | reached the conviction that “computers were going to change the whole style of scientific investigation.” This was not going to happen with card-deck data entry. We soon conspired in various projects to try to enhance the inter- face of computers with medical science. The most ambitious of these was an effort to attract Marvin Minsky to join the faculty of Stanford Medical School, but unhappily for us he decided to stay at MIT. We also began to talk about bringing interactive computing, via time-shar- ing, to Stanford, along the lines of Project MAC, which John had helped to design at MIT. These discussions ultimately led to ACME and SUMEX, the first community-access time-shared systems at Stanford, as we discovered that the NIH was able to fund research resources for health research through its Biotechnology Resources Branch. McCarthy’s PDP-1 also led us to emulate it as a laboratory interface computer, and our IRL signed on as one of the test sites for the new LINC (laboratory instrumentation computer) whose development NIH was sponsoring. Lee Hundley and Nick Veizades provided the indis- pensable hardware engineering expertise to enable us to master this marvelous new machine. The LINC was, of course, the forerunner of the DEC PDP-8, and in turn, of the PC revolution. Meanwhile, the IRL was getting more actively involved in mass spectrometry. Carl Djerassi had come to the chemistry department in late 1959, and we had developed a close personal and professional as- sociation around his academic research as well as his continued re- search direction of the Syntex Corporation. (Upon the company’s relocation from Mexico City to new laboratories at Stanford Industrial Park in 1961, he asked me to advise on its establishment of the Syntex Institute of Molecular Biology.) He was an accomplished mass spectrometrist, and of course I leaned very heavily on him for the elab- 20 The Keynote Addresses oration of this technology for space applications. Conversely, he knew nothing about computers, and I was eager to find helpful applications in the zone of our common interest. The IRL began to work on using the LINC to manage the formidable data management problems of real-time gas-chromatography mass-spectrometry {52}. One central problem was the efficient translation of mass numbers to molecular formulas. As I reexamine that arithmetic play, it reveals some premonitions of the later work. So I will expand on it beyond the intrinsic worth of the solutions {29}. The mass spectrometer is an instrument that converts molecules of a sample material into ions that are accelerated and mea- sured one by one. Further, by a combination of magnetic and electro- static fields, each ion can be sorted by its mass number. For the initial ‘discussion, we will consider only the molecular ion, ignoring further processes of fragmentation. At low resolution, we take atomic masses as integers (H = 1; C = 12; N = 14; O = 16; etc.). If we find a mass number of 14, this might be composed of H(14), C + H(2), or N. H(14) is a monstrosity: We have valence rules (H = 1; C = 4; N = 3; O = 2) that limit how many atoms can be bonded to a given atom. The ambiguity already seen at m = 14 is of course greatly multiplied in real cases, like m = 3675, a number which reflects the bounds of current instrumenta- tion. Our first problem is to calculate all the compositional isomers consistent with a given mass number. At this level, it is a knapsack, or change-making problem: finding all the ways coins of different denom- ination can be combined to add up to a given sum. In nonnegative integers, this is a diophantine equation. Namely, we seek all the solu- tions (i.e., compositions in h, c, n, 0) of: h + 12c + 14n + 160 + ... =m. The brute force approach is a set of nested iterations, for (h = 1; h <= Z; h++4); for (n = lpn <= Zj nt++)... mM=h+14n +... and test the m’ sums for a match to m. One simplification is to augment m, m’’ = m + k == 0 mod 12. We then eliminate c and find solutions in h, n, o that satisfy (h + k) + 14n + 160 == 0 mod 12. I would be interested to learn of deeper analytic approaches to the problem. For on-line com- putation, one thinks of constraining Z, at least by the mass still unas- signed in each loop, to reasonable bounds. It transpired that the valence considerations also set constraints on possible values of h; and other tricks allowed still further pruning of the tree generated by the nest, greatly shortening the computation. How DENDRAL Was Conceived and Born 21 Prior aides to mass spectrometrists had been published tables (em- bracing 570 pages in print) that reported the compositions sorted by m, from about 1 to 500, with n and o no greater than 6 {4}. A full set of tables for m up to 1000 would take about 10,000 pages of fine print. In reality, the masses of individual nuclides are not integers (sub- ject to the so-called nuclide packing fraction), and we have H = 1.0078252 C = 12.0 (by definition) N = 14.003074 O = 15.994915 With a high resolution mass spectrometer, a given ion might be re- ported as 718.374 + .006. Hundreds of compositions would match 718 in integers. One should use the fractional mass (.374) as equally impor- tant information in limiting the search. We no longer have an equation in integers, owing to the instrumental error. Nevertheless, various arithmetic tricks were devised that took account of valence rules, plau- sibility of composition, the negative and positive packing fractions of O and N, and the abnormal proportional discrepancy of H, to keep the search down to a manageable scope. For paper and pencil work (in 1964) this was embodied in a hand- book of some 50 pages, in which one could quickly look up the “mass defect” of numbers classified by residues modulo 12 {26). Even that small book was later {35} obsoleted by an algorithm that depended on a one-page table with just 72 nonzero entries, and a few arithmetic steps easily done on a 4-function hand calculator. This algorithm has served well in the data system built for a GC-MS (gas-chromatograph/mass- spectrometer) designed around a MAT-711 MS 462} and the LINC com- puter. It has evidently been independently rediscovered in China {63}. By now, however, most machines are coupled with data processors that are oblivious to such economies. (And mass spectrometrists no longer give much thought to the arithmetic of this problem.) The main point is self-evident: Contextual information could be incorporated early into the combinatorics and reduce a blind generate- and-test search by very large factors. We turn now to the larger frame of chemical analysis. Molecular ions are important targets for mass spectrometry; in the ideal case they can give unambiguous compositional formulas. Of course, they tell nothing of the topological connectivity of the constituent atoms. To il- justrate with a trivial case, C(2) H(6) O has a mass of 46.041866 but this does not distinguish methy! ether (CH3-O-CH3) from ethanol (CH3- CH2-OH), a medically significant matter! Within the mass spectrome- ter, however, the molecular ion also breaks up into a set of fragments (according to reasonably well-understood rules). The spectrum is the The Keynote Addresses array of these fragments, revealed by their mass numbers. It is often an absolutely distinctive fingerprint, diagnostic of a specific structural iso- mer (as the molecular ion mass number is of the composition). The elementary problem of inferring composition from molecular mass now well-solved, could we take the next step: model the chemist’s inferential procedure in finding the structure from the spec- trum? How to represent organic molecular structures in graphs, and then their dissection into subgraph fragments, as occurs in the mass spec- trometer, became my task for 1963 to 1964. Emile Zuckerkandl, an asso- ciate of Linus Pauling, also visited my lab during this interval. We started some of the first statistical studies on amino acid sequences of proteins, looking for hints of nonrandom regularities within sequences, and unsuspected evolutionary relationships among different ones. This is a substantial industry today {40]; there were not enough published data in 1963 to offer more than a few tantalizing hints. (6) All this was then the ideological context of my meeting Ed Feigenbaum on April 6, 1963. This was a Saturday meeting that Karl Pribram had organized at the Center for Advanced Studies in Behav- ioral Sciences on computer models of thought. John McCarthy, Ken Colby, and several others were also present. I told Ed how I was grop- ing for ways to represent chemical structures; he was already on the lookout for problem areas in science to which to bring his background on mechanized problem-solving. We stayed in good contact; I have a signed copy of Computers and Thought dated 1/17/64 {15}. During 1964, I completed the preliminary graph-theoretical work on representation of organic molecular structures {30, 32, 28]. That had entailed going back to the elementary graph theory of the 19th century for canonical forms of tree structures {21}. Fortunately, George Polya had done some important work on generating functions in 1936 {49} and was most generous in his advice about that older literature. When it came to cyclic graphs, I had a particularly entertaining time, almost at the level of recreational mathematics (31, 34}. For a century after the conception of organic molecules as ensem- bles of connected atoms subject to structural isomerism (Berzelius, 1831 {48}, Crum, Brown, Butlerov, and Kekule in the 1860s (20}), no more than desultory attention was given to the formal mathematics of their representation as graphs, to the potentialities of a connection between Hamilton arcuits, convex polytopes, and organic molecules {53}. It is hard to account for such an egregious lapse, one possibly another can- didate for the label of a “postmature discovery” {60}. The topology was perhaps too elementary to engage the interest of serious mathemati- cians—but there are still intractable problems in the enumeration of cyclic graphs (after automorphisms!). Related issues, like the notorious map-coloring problem, illustrate the still primitive state of analytical FIGURE 2 Mappings of cyclic chemical structures onto trivalent graphs—convex polyhedra from (28) How DENDRAL Was Conceived and Born 23 VERTICES ° 2a aA 46 6A aA sc 104A (0B toc 100 10E BaOQRASCSRBYSEBO FIGURE CIRCLE BICYCLANE TETRAHEDRON TRICYCLANE PRISM cueeE 8l- PENTAGON TETRAHEDRON 6I- HEXAGON PENTAGONAL PRISM POLYGON FORM(S) O () EXAMPLE S & Bese R AP & Bo 8 @ BENZENE NAPHTHALENE TRIPHENYLENE ANTHRACENE PYRENE CUBANE BENZOPYRENE PERYLENE BENZOPERYLENE DIBENZO- CHRYSENE DIBENZO~- PYRENE ETHANEDIYLIOENE- CYCLOPENTA- PENTALENE approaches to the taxonomy of graphs. Cayley {12} made a stab (a falla- cious one) at the enumeration of the hydrocarbons; this was improved on by Henze and Blair in the 1930s {5}. In the mid-1960s, Balaban and his colleagues in Romania began their extensive investigations inde- pendently of the work at Stanford {2}. Chemistry has then developed a taxonomy of its own structures that has no coherent mathematical theme. It is full of colorful but trivial names that give no structural information: A few eccentricities like “windowpane” for four fused rectangles are a partial exception. A for- 24 The Keynote Addresses midable burden in learning chemistry is the enormous amount of rote memorization that is entailed in associating names like butane, cholest- ane, cytosine, melezitose, xanthopterin—there are tens of thousands of these—with graphic representations. One may think of these as the passwords for admission to the secret society; they do deter many a student, and they may also impair a critical analytical perspective about organic chemistry. These pictures also have formal names, but the nomenclature that gives the rules for their translation occupies a thick book, mostly the idiosyncratic cases. DENDRAL-64 is a set of reports to NASA (30, 32, 28) that outlines an approach to formal repre- sentation of chemical graph structures, and a generator of all possible ones. Acyclic structures (trees) were readily tractable. Cyclic ones can be dealt with, mainly with the help of a few tricks that rely on an empirical enumeration of the underlying vertex graphs—this is feasible within the bounds of practical chemistry—which is analytically unsatis- fying. It helped to learn about Hamilton Circuits of graphs (paths that touch each node just once) {27}, since the enumeration of these, and the elimination of automorphisms are greatly simplified. When it came to the implementation of DENDRAL for (typical) organic molecules with imbedded rings, Harold Brown, Larry Hjelme- land, and Larry Masinter provided the group-theoretic general mathe- matical solutions to these perplexities {10, 8, 7}. A few molecules have been constructed precisely because they defy some constraints of topo- logical simplicity—e.g., topological planarity, namely their connection graphs cannot be drawn on the plane without bonds crossing; as excep- tions they make history and can be dealt with as such {31, 55). The DENDRAL generator was then designed so that only one ca- nonical form of a possible automorphic proliferation is issued, greatly pruning the space of candidate graphs. This was the essential prerequi- site for an Al program that could manage the generator and confront it with information derived from the mass spectrum. But I had no idea how one would go about translating these structural concepts into a computer program, nor whether this would be computationally feasi- ble with available hardware. Even more telling, 1 had only secondhand access to the field of Al and barely knew how to relate these conjectures to the systematic approaches that were emerging (15). It was fortunate indeed that Ed Feigenbaum came to Stanford just at this time. We promptly got together again and organized the collaboration that be- came the DENDRAL project. Ed now deserves equal time in presenting his personal prehistory. Some of his oral history has appeared in McCorduck’s book {42}. In addition, I have a few of his own words, excerpted from an electronic message: How DENDRAL Was Conceived and Born 25 Date: Thu 8 Mar 84 00:22:01 PST From: Edward Feigenbaum To: lederberg@SUMEX-AIM. ARPA Subject: our history {"reterring to some private notes"} Josh, all of what you have written accords with my memory of things we discussed in 1965 as we quickly got to know each other better. Your mention of mass spectra! analysis as a problem domain in which we should work came as an answer to a question |! posed you. | had de- cided that | wanted to work on constructing models of EMPIRICAL IN- DUCTION IN SCIENCE, within the methodology that | had learned trom Newell and Simon, i.e., work on a concrete task domain, not in the ab- stract. So | needed a concrete task domain. You said you knew of one that contained the essence of the empirical induction problem, that you had been working on it for a while, you even had a computational algo- rithm underlying it (which immediately made me think: aha, legal move generator as in chess-playing programs). ALL of this conversation (em- bryonic research planning) took place AFTER | arrived at Stanford Jan 1, 1965, but | remember that | would not have sought you out for ad- vice on the aforementioned puzzlement had | not met you earlier (April 1963] and learned of your interest in machine models of thinking. Re- call: there were very very few people to talk to about machine models of thinking at Stanford in early 1965. We didn’t just “bump into each other” as in “lucky accident.” You weren't at the [April 63] meeting by “lucky accident.” | didn’t decide to work with you on the mass spec analysis problem because it was of general intellectual stimulation. You had a definite interest in Al and | had a definite interest in hypothesis-tormation/theory-formation. (Inci- dentally, do you remember how we went round and round on whether to deign to call what DENDRAL did “theory formation?” We decided on “hypothesis formation” to distinguish the case of one spectrum being explained by one (or a few) structures. We reserved the use of the term “theory formation” for a later date, for a more general approach, and decided to use it in describing Meta-DENDRAL (many spectra --> rule set). P.S. Some things do appear to be “lucky accidents.” It would appear to be a genuinely lucky accident that | chose to go to college at Carnegie Tech (an accident of Westinghouse scholarships and my family’s finan- cial condition), and a lucky accident that | met Herb Simon through Jim March, and that Herb paid attention to me, and that the Logic Theory program was invented while | was still a Carnegie Tech undergrad and that | was taking a seminar from Herb at the time of its invention. One level deeper: | was an ACTIVE RECEPTOR SITE re the idea of a com- puter. | had never even heard of an electronic digital computer before Herb handed me an IBM 701 manual, but... | had been entranced by 26 The Keynote Addresses mechanical calculator machines in high school and before. My father was an office manager/accountant and owned a giant, heavy Monroe or Marchant calculator. | became an expert on its use. | even remember dragging it with me miles on the bus to Weehawken High School, heavy as it was, just to show off my skill with this marvelous technol- ogy that no other kid in the high school knew anything about. So when Herb gave me that manual, he was projecting me five or six orders of magnitude into a territory | was already fascinated with. It was also very fortunate that my introduction to the electronic computer was via the computer as general symbol manipulator (Herb never mentioned that it was anything BUT that) and that my introduction to programming was via IPL 1 and 2. (| might add that such a sophisticated early view, given to me by Herb and Ai Newell, has taken away most of the awe from later developments; everything else has seemed to be “merely” ex- tensions of the great inventions and discoveries of the 1956-1959 period.) END OF MESSAGE It is now spring 1965, and our project is concretely launched. We began to think of and label it “Heuristic DENDRAL” to mark it as a refinement of the “DENDRAL Algorithm” for generating all the feasi- ble structures. Ed and Richard Watson circulated a bulletin {16}, “An initial problem statement for a machine induction research project,” to graduate students in computer science; but it was to take a few years of slow accretion to organize a cadre of collaborators. One of our first, research associate Georgia Sutherland, did a fabulous job on the formi- dable task of converting the concepts of DENDRAL-64 into a LISP pro- gram, interleaving its production with that of a baby: an early prototype of telecommuting. Her first report was issued February 1967 {56}: We finally had a working program with which we could all exper- iment with heuristics and other measures to bring its performance to practically useful levels. The choice of LISP was originally mandated by the good match of its data structures to trees, to the sparse connection tables of chemical structures. But the memory and bit crunching re- quirements were of course monumental—it’s a wonder we got as far as we did with the hardware of the time. I used to remark, in arguments with ideologues, that in the last analysis it was the programming envi- ronment of INTERLISP that was its key advantage. We were fortunate to have continued support from NASA and from DARPA to continue these explorations. We had quickly found that the campus IBM 7090 had too little memory to support our LISP pro- grams; and we were eager to move to more interactive systems for program development. In 1966, our DARPA sponsorship gave us access to the Q-32 time-sharing system at System Development Corporation (Santa Monica) with a 100-baud teletype interface. My first experience with remote, time-shared hacking was a happy vision of future im- How DENDRAL Was Conceived and Born 27 provements. Then, John McCarthy acquired a DEC PDP-6, and we ap- proached something closer to the modern era. Bruce Buchanan joined our group, and we had great benefit from his philosophical perspec- tive, patience, insight, and administrative acumen, not to mention a lot of hard work in implementation of the software. We gained more and more collaborators, including the explicit involvement of Carl Djerassi and his associates as founts of authentic chemical expertise {64}. As our reports began to appear in refereed chemistry journals, we eventually bolstered our confidence that we were contributing to the scientific do- main, as well as to system building—a point about which some of my colleagues had been skeptical. Broader access to these computer appli- cations became possible with the help of the NIH-supported computer resources: ACME, a general time-sharing system for the Stanford Medi- cal School, and SUMEX-AIM, a national resource to support research in artificial intelligence in medicine {11). The history of SUMEX would take us into many lessons about the social organization of cooperative intelligence. However, as this account is now moving into a time of documented history and numerous publications {41}, I omit many de- tails. Largely owing to the contributions of Carl Djerassi and his col- leagues in natural products chemistry, the program was crammed with chemical information. It was becoming an effective assistant in the analysis of spectra and other analytical information. Buchanan recoded DENDRAL’s knowledge of mass spectrometry, originally embodied in a collection of LISP procedures, into a table of explicit rules separated from the internal operations of the system. This redesign to facilitate augmenting, validating, and editing the informational (ie., rule) base was a paradigm shift later to become the standard for expert systems. Balky resistance of the program to input of new ideas remained the limiting factor in its elaboration. At every weekly group meeting, a dozen new ideas would come up: But we knew that each one would take weeks to implement in tested software code, just to test it. Natural intelligence still enjoys a flexibility of hierarchical planning yet to be achieved in machine emulations {17}. Throughout this time, we would ask ourselves the nagging ques- tion: Was the growing pragmatic success of DENDRAL in solving chemical problems teaching us anything about artificial intelligence? Had we simply crafted a special case, accumulating a hefty store of chemical knowledge from several experts? We did see the need for— and Bruce Buchanan made a stab at—a self-learning system, whereby Meta-DENDRAL could induce its own rules (as the chemist does) by introspecting about concrete data inputs of mass-spectral fragmentation of molecules of known structure. This showed real promise {10}, but was impeded by the insufficiency of computer horsepower needed when DENDRAL itself had to be invoked repetitively to test every new rule candidate induced. 28 The Keynote Addresses We never got a grip on one idea that I hope to return to someday. DENDRAL is remarkably neatly structured (as implied by its name) as a generator of trees of candidate structures {39}. These can easily num- ber in the billions or more, in practical cases. The efficiency of the pro- gram depends on the pruning of impossible or implausible cases, as early as possible; preferably large branches at a fell swoop. The order of application of the shears can have a large effect. To give a stupidly trivial example, if N (nitrogen) is absent, we don’t generate molecules that may contain N, then retrospectively eliminate each of those twigs. We gave some forethought towards optimizing the sequence of shears; but we know this will be case-specific, sometimes in ways we have difficulty predicting. We should build in recurrent introspection about the shearing sequence, make that a specific planning objective, and ex- periment with it from time to time. These considerations (I called it Theta-DENDRAL for reasons not recalled) would have broad generaliz- ability to rule-based systems: The sequence of invocation of rules is | often totally inaccessible to the user, and rarely if ever (as far as I know) is it dynamically regulated. We did do some work on the interesting trade-offs between storing memory of all partially completed branches versus regenerating them as needed. Finally, we had many discussions of the desirability of learn- ing to read expertise from the world’s published books, to bypass the oral tradition. The ultimate fantasy was to attach a high-order DEN- DRAL directly to a mass spectrometer, learning directly from Nature. I wish I had the documentation, but I have an image of a conversa- tion when I was pressing Ed about the limitations of DENDRAL as general intelligence: He responded with the illumination that I may paraphrase: “That’s exactly the point! Knowledge, not tricks or meta- physical insight, is what makes the program effective—and that itself is an insight of general import.” That is why I remark, we were trying to invent AI, and in the process discovered an expert system. This shift of paradigm, “that Knowledge IS Power” was explicated in our 1971 paper {17} and has been the banner of the knowledge-based system movement within AI research from that moment. (Compare Alan Newell's comments {65}.) Shortly thereafter, Bruce Buchanan and Ted Shortliffe initiated the MYCIN project {9}. As Alan Newell remarked in his preface to (91, MYCIN had no pretensions to deep theoretical structure of chemistry, none to outdoing the experts, but only to conveying that expertise as advisory to the general practitioner in optimizing the prescription of antibiotics. Their coding of MYCIN gave a fresh start to the design of rule-based systems that could be readily transported to other applica- tions. How DENDRAL Was Conceived and Born 29 The published documentation after this time is quite rich, and | will refer to that for further historical development. Time now for the numerous morals of the story. Most problematical is the public utility of private autobiography. But biography remains very popular, albeit the main lesson may be the very idiosyncrasy of personal history and character. Worse than no his- tory would be a false conception of it, that it has rigorous rules. As my tale shows, chance does play an enormous role in bringing together people, ideas, situation in a productive way. Were we lucky? Who knows what the alternatives might have been? One lesson of personality should be brought out, especially when the media enjoy characterizing the scientific enterprise as rapacious competition and selfishness. The fraternity that came out of the DEN- DRAL effort was a high in my life experience, matching the gratifica- tions of scientific excitement and (perhaps belated) recognition. One is not always so lucky in one’s colleagues; but we should not glamorize and confuse the pathology as the standard. The project also dramatized the values of electronic communica- tion in project management. Although we certainly met informally from time to time, most of our serious communication (be it a few yards down the hall) was by electronic mail. In this way, innumerable proposals and drafts could be posted on common bulletin boards and subjected to consensual review, often through scores of cycles of reitera- tion. Distance was no consideration, courtesy of the ARPANET, and communication could be sustained during momentary travel; collabo- ration continued when participants moved. (Of course, the manuscript for this chapter has been shared between Rockefeller and Stanford.) Such draft texts, program modules and outputs needed critical scrutiny of a kind that is only possible when one has a copy of the file to work on from one’s own terminal. I went so far as to characterize this mode of communication “The New Literacy,” and I meant it {37}. Databases should not be thought of as static, final repositories but as bulletin boards, subjected to dynamic critical attention by the entire knowledge- able community. Stanford University, in the 1960s, was a fortunate place to be for the pursuit of scientific innovation, and equally for a highly interdisci- plinary program. Computer science, medical science, and chemistry were all in a surge of rapid expansion and new opportunity. If there were no specific facilitations for these kinds of interactions, nor were there rigid impediments. There were potential problems of disciplinary homes for the degrees sought by graduate students; but in the event we never found any students who looked for a degree in what might have been a difficult hybrid of say genetics, chemistry, and informatics. The 30 The Keynote Addresses graduates in the project were able to justify themselves by the stan- dards of the major department. The laissez-faire philosophy of the insti- tution worked admirably, so long as we were able to secure funding. While we had the usual share of crises, we should look back in awe at the forbearance of the three agencies, NASA, DARPA, and NIH, who did make significant risk investments in a novel venture. Needless to say, all of the senior professors were also staking their credibility in the process. There is no guarantee that untenured faculty would have been able to feel so secure. The greatest hurdle in efforts to replicate the experience would be to find experts willing and positioned to be able to forego continued immediate productivity in their own fields, for the sake of longer term ends in system building. Students and fellows may be intimidated by the demands of working across disciplines, and some were concerned that there would be a limited market in say artificial intelligence in molecular biology. Their prudence may be pragmatically justified. The process of knowledge extraction is unbelievably arduous: As always, 90% of the effort must go into debugging and validation. The process can give the expert an opportunity for critical self-reflection about the foundations of the scientific domain. Some of the return on investment of DENDRAL was in its motivating a fresh study of the conceptual structure of organic chemistry, apart from its actual application in com- puter programs. This is to be commended in problem choice in other areas of application, scientific or otherwise. The choice of organic chemistry and mass spectrometry as an ob- ject domain was a matter of careful reflection. It was rich in experimen- tal data, and in a conceptual framework of mechanism that lent itself to model construction on the computer; I had no taste for purely statistical correlations. I might have preferred molecular genetics as more ger- mane, and closer to my own experience. But in 1965 I did not feel it had ripened sufficiently to allow a secure theoretical framework for the nec- essary deductive tests of candidate hypotheses. (By 1975 it had, and this perception was the root of the follow-on MOLGEN project {18].) In his 1961 review, Minsky had been rather critical of generate-and-test paradigms: “for any problem worthy of the name, the search through all possibilities will be too inefficient for practical use.” He had chess playing in mind with 10” possible move paths. It is true that equally intractable problems, like protein folding, are known in chemistry and other natural sciences. These are also difficult for human intelligence. The heuristics we have evolved biologically tend instead to relate to real world faculties like speech and image recognition. Nevertheless, solution spaces of 10° to 10” candidates are both interesting and feasi- ble challenges to computation, and many are of scientific or technologi- cal consequence. Our particular problem in chemical analysis is one of exhaustive elimination, to find ALL solutions that match the spectral How DENDRAL Was Conceived and Born 31 data set. Further measures may then be needed for a final disambigua- tion. Theorem-proving is a reasonably good analogy. Our chemical heu- ristics are second order: to find efficient ways of rigorously pruning the search tree, though it can be helpful to find a single approximate solu- tion from the most plausible genera of chemical structures (e.g., rings limited to 5 or 6 nodes) and examine ways in which it can be altered and give the successfully matching spectrum. Whatever heuristics are used, no search branches can be discarded without the rationale being transparent to the chemist. Unlike chess or image understanding, chemistry does have an intrinsic mathematical structure that permits its move generator to heed the constraints of the data, so that efficiency is more readily achievable. And we have criteria, both for a formally cor- rect candidate (a graph in canonical form), and to know when it is a solution, ie., the test generates a spectrum that matches the data. We played against Nature. In chess (and in war), you have to play against another “expert.” Other areas of natural science deserve a fresh reconnaissance to inspire a reexamination of their conceptual structure. Biology, in partic- ular, will soon suffocate in the sheer bulk of knowledge about DNA and protein structures, and the complex interactions of the causal chains they initiate, unless new epistemological machinery can be in- vented. Our education of physicians and scientists must also place more stress on the skills needed to acquire new knowledge as needed than on rote memorization that will promptly be obsolete {68}. Finally, I would remark that I have never viewed research on artifi- cial intelligence as having much bearing on how the human brain func- tions: There are too many differences in architecture and in levels of complexity, connectivity, and programmability. Nor do I see how neu- robiology has contributed very much to Al. At the highest level of problem-solving routines, expert systems do of course exploit human experience. Lindley Darden’s discussion of “the history of science as compiled hindsight” {66} eloquently captures my own perspectives. My interest in AI has little to do with my background as a biologist, a great deal with curiosity about complex systems that follow rules of their own, and which have great potentialities in preserving the fruits of human labor, of sharing hard-won traditions with the entire commu- nity. In that sense, the knowledge-based system on the computer is above all a remarkable social device, the ultimate form of publication. Acknowledgment It is impossible to give fair and sufficient credit to the many graduate students, collaborators, and programmers who made this effort possi- 32 The Keynote Addresses ble. Where possible, they have been named in {41} and its bibliography. We are also indebted to the agency sponsors, primarily in DARPA, NIH, and NASA whose financial support made the work possible. They often made us work hard on our proposals to justify that support; but it clearly was a gamble that demanded a gift of confidence of un- usual measure. The author would also like to acknowledge the Na- tional Academy of Sciences for permission to reprint material from Topological mapping of organic moelcules, by J. Lederberg, Proceedings of the National Academy of Sciences, vol 53, pp. 134-139, 1965. Bibliography In addition to the references tabulated, the following archival sources would be instrumental for further historical research: The Stanford University Computer Science Department: A.I. Lab Re- ports (microfiches available from Scientific DataLink, Comtex Sci. Corp.) Most of these are indexed in (41). Annual reports to NIH on the DENDRAL and SUMEX projects. Ar- chived at Stanford and Rockefeller Universities. Annual reports to NASA on the S.U., Department of Genetics Instru- mentation Research Laboratory. Archived as above, and invento- ried by the U.S. NTIS. Peer critiques, NIH study sections and councils. Available, in principle, from NIH under the Fol Act. This should be a rich resource in tracing contemporary reactions. Buchanan, Feigenbaum, Lederberg, notes, correspondence, and elec- tronic mail files. To be archived, as above. References 1. Adler, M.J. Propaedia (outline of knowledge). Encyclopedia Britannica, 15 ed. Chicago, 1985. 2. Balaban, A.T. (ed.). Chemical Applications of Graph Theory. New York: Academic Press, 1976. 3. Balzer, W. and Dawe, C.M. Structure and comparison of genetic theories: (I) Classical genetics. Brit. J. Phil. Sci. 37, 1 (Mar.), 55-69, 1986. 4. Beynon, J.H. and Williams, A.E. Mass and Abundance Tables for Use in Mass Spectrometry. New York: Elsevier, 1963. 5. Blair, C.M. and Henze, H.R. The number of structurally isomeric alcohols of the methanol series. J. Amer. Chem. 53, 3042-3046, 1931. How DENDRAL Was Conceived and Born 33 6. Bodmer, WE. and Lederberg, J. Census data for studies of genetic demography. Proc. III Int. Congress of Human Genetics, Baltimore: Johns Hopkins Press, 1967, pp. 459-471. 7. Brown, H. and Masinter, L.M. Algorithm for the construction of the graphs of organic molecules. Discrete Mathematics 8:227-244, 1974. 8. Brown, H., Hjelmeland, L., and Masinter, L.M. Constructive graph labelling using double cosets. Discrete Mathematics 7:1-30, 1974. 9. Buchanan, B.G. and Shortliffe, EH. Rule-based Expert Systems. Reading, Mass.: Addison-Wesley, 1984. 10. Buchanan, B.G., Smith, D.H., White, W.C., Gritter, R., Feigenbaum, E.A., Lederberg, J., and Djerassi, C. Applications of artifi- cial intelligence for chemical inference, XXII. Automatic rule formation in mass spectrometry by means of the meta-DENDRAL program. ]. Am. Chem. Soc. 98:6168-6178, 1976. 11. Carhart, R.E., Johnson, S.M., Smith, D.H., Buchanan, B.G., Dromey, R.G., and Lederberg, J. Networking and a collaborative re- search community: A case study using the DENDRAL programs. In Peter Lykos, (ed.) Computer Networking and Chemistry, ACS Symposium Series, No. 19. Washington: American Chemical Society, 1975. 12. Cayley, A. On the mathematical theory of isomers. Phil. Mag. 47, 444-446, 1874. 13. Cohen, M.R. and Nagel, E. An Introduction to Logic and Scientific Method. New York: Harcourt, Brace and Company, 1934. 14. Ephrussi, B., Leopold, U., Watson, J.D., and Weigle, J.J. Terminol- ogy in bacterial genetics. Nature 171: 701, 1953. 15. Feigenbaum, E.A. and Feldman, J., (eds.). Computers and Thought. New York: McGraw-Hill, 1963. 16. Feigenbaum, E.A. and Watson, R. An initial problem statement for a machine induction research project. Stanford AI Memo # 40, 1965. 17. Feigenbaum, E.A., Buchanan, B.G., and Lederberg, J. On general- ity and problem solving: A case using the DENDRAL program. Ma- chine Intelligence 6, Meltzer, B. and Michie, D., (eds.). Edinburgh: Edinburgh University Press, 1971, pp. 165-190. 18. Friedland, P. and Kedes, L. Discovering the secrets of DNA. Comm. ACM 28:1164-1186, 1985. 19. Garfield, E. Citation Indexing—Its Theory and Application in Science, Technology, and Humanities. Philadelphia: ISI Press, 1979, p. 274. 20. Gould, R.E, (ed.). Kekule Centennial. Advances in Chemistry Series 61. Washington: American Chemical Society, 1966. 21. Jordan, C. Sur les assemblages de lignes. J. fuer die reine und an- gewandte. Math. 70,185 (1869). The Keynote Addresses 22, Judson, H.F. The Eighth Day of Creation: Makers of the Revolution in Biology. New York: Simon & Schuster, 1979. 23. Kinney, H. The year of the gifted children. Think (IBM) 45:12-17, 1979. 24. Lederberg, J. Genetics and microbiology. Symposium on Perspec- tives and Horizons in Microbiology. Rutgers: Rutgers Univ. Press, 1955, pp. 24-39. 25. Lederberg, J. A view of genetics. Les Prix Nobel, 1958, pp. 170- 189. 26. Lederberg, J. Computation of Molecular Formulas for Mass Spectrome- try. San Francisco: Holden-Day, Inc, 1964. 27. Lederberg, J. Hamilton circuits of convex trivalent polyhedra (up to 18 vertices). Am. Math Monthly 74, 522-527, 1965. 28. Lederberg, J. Systematics of organic molecules, graph topology and Hamilton circuits. A general outline of the DENDRAL system. NASA CR-68899. STAR No. N66-14075, 1965. 29. Lederberg, J. Online computation of molecular formulas from mass number. NASA CR-95977; Accession # x68-18613, 1966. 30. Lederberg, J. DENDRAL-64. A system for computer construction, enumeration & notation of organic molecules as tree structures and cy- clic graphs. Part I. Notational algorithm for tree structures. NASA CR- 57029. STAR No. N65-13158, 1964. 31. Lederberg, J. Topological mapping of organic molecules. Proc. Nat. Acad. Sci. U.S. 53, 134-139, 1965. 32. Lederberg, J. DENDRAL-64. Part II. Topology of cyclic graphs. NASA CR-68898. STAR No. N66-14074, 1965. 33. Lederberg, J. Signs of Life: Criterion system of exobiology. Nature 207, 9-13, 1965. 34. Lederberg, J. Topology of molecules. The Mathematical Sciences (COSRIMS). Cambridge: MIT Press, 1969, pp. 37-51. 35. Lederberg, J. Rapid calculation of molecular formulas from mass values. J. Chem. Ed. 49, 613, 1972. 36. Lederberg, J. Foreword to Essays of an Information Scientist by E. Garfield. Philadelphia: ISI Press, 1977, pp. xi-xi. 37. Lederberg, J. Digital communications and the conduct of science: The new literacy. Proc. of the IEEE 1978, 66(11):1314-1319. 38. Lederberg, J. Forty years of genetic recombination in bacteria. A fortieth anniversary reminiscence. Nature 327:627-628, 1986. 39. Lederberg, J., Sutherland, G.L., Buchanan, B.G., Feigenbaum, E.A., Robertson, A.V., Duffield, A.M., and Djerassi, C. Applications of How DENDRAL Was Conceived and Born 35 artificial intelligence for chemical inference, I. The number of possible organic compounds. Acyclic structures containing C, H, O, and N. ]. Am. Chem. Soc. 91:2973-2976, 1969. 40. Lewin, R. National networks for molecular biologists. Science 223, 1379-1380, 1984. 41. Lindsay, R.K., Buchanan, B.G., Feigenbaum, E.A., and Lederberg, J. Applications of Artificial Intelligence for Organic Chemistry: The DEND- RAL Project. New York: McGraw-Hill Book Co, 1980. 42. McCorduck, P. Machines Who Think. San Francisco: W.H. Freeman and Company, 1979. 43. Medawar, PB. Is the scientific paper fraudulent? Saturday Rev. (1 Aug.), pp. 42-43, 1964. 44. Merton, RK. Social Theory and Social Structure. New York: Free Press, 1968, pp. 4-7. 45. Merton, R.K. The sociology of science: An episodic memoir. Mer- ton, R. K and Gaston, J, (eds.). The Sociology of Science in Europe. Car- bondale, Illinois: Southern Illinois University Press, 1977, see esp. pp. 119-120. 46. Minsky, M. Steps toward artificial intelligence. Proc. Inst. Radio Eng. 49:8-30, 1961. 47. Models for Biomedical Research. Washington, D.C.: National Acad- emy Press, 1985, p. 180. 48. Partington, J.R. A History of Chemistry, Vol. IV. London: Macmil- lan & Co. Ltd., 1964. 49. Polya, G. Kombinatorische Anzahlbestimmungen fur Gruppen, Graphen und Chemische Verbindungen. Acta Math. 68, 145-254, 1938. 50. Pres. Sci. Adv. Comm. Science, Government, and Information. Panel on Scientific Information Report. (A. Weinberg, chmn.) Washing- ton: USGPO, 1963. 51. Quastler, H., (ed.). Essays on the Use of Information Theory in Biol- ogy. Urbana: University of Illinois Press, 1953, p. 273. 52. Reynolds, W.E., Bacon, V.A., Bridges, J.C., Coburn, T.C., Lederberg, J., Levinthal, E.C., Steed, E., and Tucker, RB. A computer operated mass spectrometer system. Analytical Chem. 42, 1122-1129. 53. Rouvray, D.H. Graph theory in chemistry. RIC Reviews 4:173-195, 1971. 54. Savageau, M.A. Biochemical Systems Analysis. A Study of Function and Design in Molecular Biology. Reading, Mass.: Addison-Wesley, 1976, p- 379. 55. Simmons, H.E. and Maggio, J-E. Synthesis of the first topologi- cally nonplanar molecule. Tetrahedron Letters 22, 287-290, 1981. 36 The Keynote Addresses 56. Sutherland, G. DENDRAL—a computer program for generating and filtering chemical structures. Stanford Artificial Intelligence Memo #49, 1967. 57. Thomas, R., (ed.). Lecture Notes in Biomathematics. Kinetic Logic: A Boolean Approach to the Analysis of Complex Regulatory Systems. Berlin and New York: Springer-Verlag, 1979. 58. Whitehead, A.N. and Russell, B. Principia Mathematica, 3 vols, 2nd ed. London: Cambridge at the University Press, 1950. 59. Woodger, J.H. The Axiomatic Method in Biology. London: Cam- bridge at the University Press, 1937. 60. Zuckerman, H.A. and Lederberg, J. Forty years of genetic recom- bination in bacteria. Postmature scientific discovery? Nature 327:629- 631, 1986. 61. Zuckerman, H.A. and Merton, R.K. Patterns of evaluation in sci- ence. Institutionalization, structure and functions of the referee system. Minerva 9:66-100, 1971. ADDITIONAL REFERENCES 62. Rindfleisch, T.C., Smith, D.H., Yeager, W.J., Achenbach, M.W., and Wegmann; A. Mass spectrometer data acquisition and processing systems. Biomedical Applications of Mass Spectrometry, First Supplemen- tary Volume, Waller, G.R. and Dermer, O.C., (eds.). New York: John Wiley and Sons, Inc., 1980, Chapter 3, pp. 55-77. 63. Huang, Z. and Wu, W. MDAL: A novel algorithm for the compu- tation of empirical formulas in high-resolution mass spectrometry. Bio- medical Env. Mass. Spec. 13:187-191, 1986. 64. Djerassi, C. Organic chemistry: A view through steroid glasses. Modern Organic Chemistry: Scientific and Historic Perspectives, Seeman, J.L, (ed.). Washington: American Chemical Society, 1988. 65. Newell, A. Intellectual issues in the history of artificial intelli- gence. The Study of Information: Interdisciplinary Messages. Machlup, F. and Mansfield, U. New York: Wiley Interscience, 1983, pp. 187-227. 66. Darden, L. Viewing the history of science as compiled hindsight. Al Magazine 8(Summer): 33-41, 1987. 67. Lederberg,J. Sputnik 1957-1987. The Scientist Oct. 5, 1987. 68. Lederberg, J. Computers in Science, Communication and Eduaation, AAMC Symp. on Medical Informatics, held March 7, 1985, pp. 62-70. Med- ical Education in Information Age. Washington: AAMC, 1986. How DENDRAL Was Conceived and Born 37 Appendix This memo is an excellent snapshot of the status of AI research as seen in 1965, and therefore is appended in its entirety. STANFORD ARTIFICIAL INTELLIGENCE PROJECT April 5, 1965 Memo No. 30 AN INITIAL PROBLEM STATEMENT FOR A | MACHINE INDUCTION RESEARCH PROJECT by E.A. Feigenbaum and R.W. Watson Abstract: A brief description is given of a research project presently getting under way. This project will study induction by machine using organic chemistry as a task area. Topics for graduate student research related to the problem listed. The research reported here was supported in part by the Advanced Research Projects Agency of the Office of the Secretary of Defense (SD- 183). We are engaged, in conjunction with Professor Lederberg of the medical school, in a research project which offers possibilities for grad- uate research, both well defined problems suitable for C.S, 239 projects and not so well defined problems suitable for PhD thesis topics. In this memorandum we will define the problem briefly and then outline some suggested projects. If you are interested in any of the projects or topics suggested, or have a topic to suggest related to this project see either of us for further details. The long-range goal of this research is to attempt to come to grips with the problem of induction by machine. That is, how does one build a machine (write a program) which can interact through a suitable in- terface with its environment and build and improve models of the en- vironment. The specific task area chosen in which to attack this problem is organic chemistry and in particular, the determination of the structure of organic molecules from mass spectrograph data. The problem pres- ently facing a chemist is roughly the following: 1. A quantity of an organic molecule is supplied to a mass spectrome- ter. 2. The molecules are bombarded with electrons which break up the molecules into ionized subparts. 3. The mass spectrometer outputs a spectrum (i.e., a distribution of the masses of the subparts). 38 The Keynote Addresses 4. The largest mass in the distribution which occurs in any quantity above a given noise level is that of the parent molecule. 5. By trying various combinations of atoms the chemist finds molecu- lar compositions which have a mass equal to that determined in 4. If the resolution of the mass spectrometer is fine enough the deter- mination of a unique composition is possible. 6. Once the chemical composition, or possible compositions, of the molecule is determined, the chemist uses various heuristics in con- junction with the mass spectrum to determine the structure of the molecule. The computer science problem is to automate the above process. At the present time we see the project as progressing in the following stages. Stage O—Display of Chemical Structures Professor Lederberg has developed a linear notation for organic molec- ular structures. Further, he has devised an algorithm which, given a chemical composition as an input, will produce as an output all topo- logically unique organic structures corresponding to this composition. The system is called “DENDRAL” and exists as an Algol program for the B-5000 written by Larry Tessler. At the present time many of the structures are not chemically meaningful. Therefore, our first task will be to develop a system which will interact with a chemist and the DENDRAL system and determine rules for chemically meaningful structures. These rules will be automat- ically incorporated into a “filter” for the DENDRAL system. Presently a program for the PDP-1 exists which accepts a linear DENDRAL string and displays a chemical graph on the Philco scope. The problem then of Stage O is to improve this program and to develop the software for tying it in with Larry Tessler’s program through the disc and which will allow us to use LISP on the 7090 from the Philco scopes. Stage 1—Chemist at the Philco Keyboard During Stage 1 we will develop the programming techniques which will allow a dialogue to take place between the chemist and the system for growing the filter on the DENDRAL output. This system will in- volve the display of a graph and the chemist’s determination of whether or not it is chemically meaningful. The system must then ques- How DENDRAL Was Conceived and Born 39 tion the chemist to find out what rules the chemist is using for his determinations and accept his answers in a suitable language. In gen- eral, the chemist will not be explicitly conscious of the rules he is using, and the machine will serve the important function of helping to bring these rules to a precise awareness. The end result of Stage 1 is that we will have an improved DEN- DRAL system and have learned some important and useful computing techniques. An improved DENDRAL system and associated display should also be of value to those interested in the problems of informa- tion retrieval associated with the chemical sciences. Stage 2—Mass Spectrograph Analysis In Stage 2 a chemist and a machine interact in real time through the medium of a scope, scope keyboard, typewriter, and possibly light pen or tablet. If the machine were used strictly for performing clerical and algorithmic processes, the following dialogue would result. 1. The machine would be supplied with the mass spectrum and would display on the scope face a histogram and the chemical composition(s) of the molecule. 2. The chemist using his experience and peripheral information would then input a linear description of a trial structure which would then be displayed on the scope as a chemical graph, or the DENDRAL system would be invoked to systematically display chemical graphs which correspond to the given composition. 3. The chemist, using his knowledge of likely places for breaks to occur in the above structure when under electron bombardment, would indicate such a break on the graph. The machine would then compute the mass of the subparts and indicate whether or not such a mass exists in the histogram. Or, the chemist would in- dicate a mass number in the histogram and the machine would in- dicate whether or not a subgraph exists which has this mass and if it does exist indicate which subgraph it is. 4. The chemist may also want to move various subgraphs from one place to another and then proceed as above. The machine will then compute the linear canonical form of these new graphs and possibly change the display to a canonical form. Further, the DENDRAL system may be invoked to systematically change a given subgraph. 5. The chemist eventually finds a structure which he hypothesizes as capable of yielding the mass spectrum. 40 The Keynote Addresses What we want is for the machine to be used not only for clerical work, but more importantly to learn from the chemist’s behavior and there- fore take over much of the analysis on its own. To this end we visualize the following variation of the above dialogue. Initially the machine would input the correct structures corre- sponding to different chemical compositions. The chemist would then proceed to present an example analysis of this structure in conjunction with its mass spectrum; finally concluding with the known result that the structure could have yielded the given mass spectrum. During this process the machine will probe the chemist for the rules leading to his behavior. The machine will incorporate these rules in a data structure, which will allow the machine to perform a similar analysis. The machine will then be given a chemical structure corresponding to a given mass spectrum and will be asked to proceed on a step by step analysis of its own. The machine will report its “reasoning” to the chemist as it proceeds. When the machine makes an incorrect step the chemist will interrupt, and a dialogue will take place until the machine can make the correct step. Finally, when the machine can correctly analyze structures known to correspond to given mass spectrums, the system will be given a composition and the DENDRAL generator will be invoked to systemat- ically present for analysis possible structures. Then a dialogue of the following type will take place: The machine will proceed with an anal- ysis as far as it is able and then the chemist will take over. As the chemist manipulates the graph with machine aid, the machine queries the chemist for the rules governing his behavior and a dialogue takes place. Eventually the chemist reaches a hypothesis that the given struc- ture could or could not yield the given mass spectrum. The machine then proceeds to analyze the structure on its own to see if it would reach the same hypothesis. If not, a further dialogue takes place until the machine can reach the hypothesis of the chemist. When the machine seems adequate at this task we proceed to Stage 3. Stage 3—Good Initial Guesses as to Chemical Structure In Stage 2 the man and machine proceeded systematically through the structures produced by DENDRAL. Clearly for any large structures the number of isomers of a given chemical composition could run into the millions. Therefore, the chemist must make a good initial guess as to a possible structure and only rely on the DENDRAL generator to modify How DENDRAL Was Conceived and Born 41 subgraphs. Again the chemist and system interact, with the machine querying the chemist to determine the rules for proposing initial struc- tures. The procedures to be followed will be similar to those of the previous stage. Stage 4—Refinement of the System When Stage 3 is completed the system will be a good mass spectrum analyzer. However, the data structures produced during this stage will be complicated, duplicated, and in general unlikely to be optimal. Therefore, the program, and associated data structures which result from Stage 3, will be carefully analyzed to determine how to write an efficient, compact system and to determine which sections contain gen- eral chemical knowledge and which contain knowledge of a specialized character, useful mainly for mass-spectrograph analysis. The final effi- cient program which results will form the software for some experi- ments to be undertaken by a suggested Mars probe and the efficient program minus the specialized structures will form the basis for a sys- tem to be applied to some other chemical tasks such as the synthesis of organic molecules. The following problems suggest themselves as possible research projects. 1. Display problems: In order that the display of the chemical graphs be as useful as possible to the chemist, it should display the graphs in a form as close as possible to that to which the chem- ist is trained. This task is difficult to do automatically with our present experience. Therefore, one possible approach at this time is to develop a system which automatically displays a graph close to that desired by the chemist and then allows the chemist to ma- nipulate substructures by simple rotations and bond-length adjust- ments. Another possibility is to allow the chemist to “draw” the graph from the keyboard or with a light pen when it is available. Because of the size limitations of the scope face, it will not be pos- sible to display large molecular structures in their entirety. There- fore, it would be useful to have a “window” mechanism which will allow the chemist to study subsections. Other features are needed which will allow one to save displays, display more than one graph at a time, and perform text editing on the linear input. It would also be useful to allow the chemist to build an initial structure and to later make insertions and dele- tions as well as move a given substructure to another point on the graph. 42 The Keynote Addresses As the work on the display proceeds, feedback from chemists will indicate other useful refinements to the display system. . Various programs need to be written which will allow us to use the facilities of the 7090 from the Philco keyboard. . Problems relating to DENDRAL: DENDRAL is a system for ca- nonical representation of chemical structures. However, the chem- ist is usually not trained in this system and would probably find it easier to input a noncanonical linear string. Therefore, it would be of value to have a routine which would convert this string to a ca- nonical one. Other more abstract problems relating to the DENDRAL generator are supplied by Professor Lederberg in Appendix A. . Mass spectrograph analysis problems: The chemist will want to have a histogram displayed or some display containing equivalent information. The chemist will further want to indicate a given mass number and have the system determine whether or not there is a subgraph with the indicated mass. The work on this problem will lead to abstract on the searching and comparison of list struc- tures. It will also be of use to the chemist to be able to indicate a given bond as a likely place for a break to have occurred when under electron bombardment and have the system determine if the masses of the subparts are in the distribution. The chemist will also want to be able to invoke the DENDRAL generator to system- atically mark and change subgraphs. . The DENDRAL filter growing problem: As mentioned before, the DENDRAL generator will generate all topologically unique structures regardless of whether or not they are chemically mean- ingful. The problem here is to grow, on-line, a filter which will only allow chemically meaningful structures to be displayed. To solve this problem, techniques need to be developed so that the chemist can be questioned for his rules of chemical meaningful- ness and so that his responses can be dynamically incorporated in a changing data structure. Because the chemist will not always give correct rules, methods must be introduced to guard against the possibility of incorrect rules permanently entering the system. Persons interested in natural language and the computer or for- mal languages may be interested in this phase of the work. . Advanced mass spectrograph analysis problems: Related to the problem above will be the development of techniques which allow the rules supplied by the chemist for analyzing structures to be directly introduced into an internal machine structure. This How DENDRAL Was Conceived and Born 43 structure will allow the system to perform the same functions as the chemist and report to the chemist the important stages of its analysis. The detailed problems in this area will only become clear as we proceed. It would seem to us that the problems related to the display are the most suitable for M.S. projects as they are quite well defined. The more challenging problems related to the DENDRAL system and filter and Stage 2 would seem to be of the greatest interest to those contemplating doctoral research. References Lederberg, J. DENDRAL-64: A system for computer construction, enu- meration and notation of organic molecules as free structures and cyclic graphs. Interim Report to the National Aeronautics and Space Administration, December 15, 1964. (Available from either author of this memo). Lederberg, J. Topological mapping of organic molecules. Proc. Nat. Acad. Sci. 53:134-139, 1965. (Available from either author of this memo). Appendix A A number of problems in combinatorial graph theory, abstract groups, symmetry, and related subjects have arisen. Some of these would con- tribute to the elegance and efficiency of the DENDRAL system. Other questions are more abstract and have been suggested by the chemical graphs. a. Enumeration of cyclic trivalent graphs. This includes the polyhe- dra. Grace (a former Stanford mathematics graduate student) has done a possibly vulnerable enumeration up to the 18th order. b. Efficient test for isomorphism and reduction to canonical forms. c. Programming to anticipate symmetries and avoid retrospective elimination of isomorphs. d. What is the least polyhedron lacking a Hamilton circuit? Now known 20 < n < 46. e. Generalization of the Hamilton circuit (in the sense of mapping a graph on to segments of a circle) to mappings on higher order fig- The Keynote Addresses ures. In DENDRAL-64 the treatment of non-Hamiltonian cyclic graphs’ remains somewhat messy. f. Heuristic approaches to finding a Hamilton circuit of a graph. g. Enumeration of graphs with some 4-valent vertices. In DENDRAL- 64 this is also somewhat messy, being treated by the collapse of 4- node circuits into 4-valent nodes. ‘e.g.: e t a ™~ NL