Frederick Jelinek: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
OAbot (talk | contribs)
m Open access bot: doi updated in citation with #oabot.
 
(31 intermediate revisions by 19 users not shown)
Line 2: Line 2:
{{Use mdy dates|date=March 2014}}
{{Use mdy dates|date=March 2014}}
{{Infobox scientist
{{Infobox scientist
| birth_name = Bedřich Jelínek
| birth_name = Bedřich Jelínek
| birth_date = {{Birth date|1932|11|18}}
| birth_date = {{Birth date|1932|11|18}}
| birth_place = [[Kladno]], now [[Czech Republic]]
| birth_place = [[Kladno]], now [[Czech Republic]]
| death_date = {{Death date and age|2010|09|14|1932|11|18}}
| death_date = {{Death date and age|2010|09|14|1932|11|18}}
| death_place = [[Baltimore]], United States
| death_place = [[Baltimore]], United States
| citizenship = American
| citizenship = American
| fields = [[Information theory]], [[natural language processing]]
| fields = [[Information theory]], [[natural language processing]]
| residence =
| residence =
| workplaces = [[Cornell University]], [[IBM Research]], [[Johns Hopkins University]]
| workplaces = [[Cornell University]], [[IBM Research]], [[Johns Hopkins University]]
| alma_mater = [[Massachusetts Institute of Technology]]
| alma_mater = [[Massachusetts Institute of Technology]]
| doctoral_advisor = [[Robert Fano]]
| doctoral_advisor = [[Robert Fano]]
| notable_students =[[Neil Sloane]]
| notable_students = [[Neil Sloane]]
| known_for = Advancement of natural language processing techniques
| known_for = Advancement of natural language processing techniques
| awards = *[[IEEE James L. Flanagan Speech and Audio Processing Award|James L. Flanagan Award]] (2005)
| influences = [[Roman Jakobson]]
| influenced =
| awards =
*[[IEEE James L. Flanagan Speech and Audio Processing Award|James L. Flanagan Award]] (2005)
*[[Association for Computational Linguistics|ACL]] Lifetime Achievement Award (2009)
*[[Association for Computational Linguistics|ACL]] Lifetime Achievement Award (2009)
| signature = <!--(filename only)-->
| signature = <!--(filename only)-->
| footnotes =
| footnotes =
|spouse=[[Milena Jelinek]]
| spouse = [[Milena Jelinek]]
}}
}}


'''Frederick Jelinek''' (18 November 1932 – 14 September 2010) was a [[Czech American|Czech-American]] researcher in [[information theory]], [[automatic speech recognition]], and [[natural language processing]]. He is well known for his oft-quoted statement, "Every time I fire a linguist, the performance of the speech recognizer goes up".{{#tag:ref|Although its fame and iconic status are undisputed (it was for example used as the title of a 1998 speech by Julia Hirschberg),<ref name="hirschberg"/> its context is unknown and its specific wording and dating are unclear. According to [[Daniel Jurafsky]] and James H. Martin, Jelinek himself recalled the quote as "Anytime a linguist leaves the group the recognition rate goes up" and dated it to December 1988 (Wayne, Pennsylvania), further noting that the quote did not appear in the published proceeding,<ref name="jurafsky2009"/><ref name="palmer1990"/> whereas Roger K. Moore gave the wording as "Every time we fire a phonetician/linguist, the performance of our system goes up" and dated it to an IEEE Automatic Speech Recognition and Understanding workshop held in 1985.<ref name="moore2005"/> According to Steve Young, "the story goes that one day one of his linguists resigned, and Fred decided to replace him not by another linguist but by an engineer. A little while later, Fred noticed that the performance of his system improved significantly. So he encouraged another linguist to find alternative employment, and sure enough performance improved again."<ref name="young"/>|group=note}}
'''Frederick Jelinek''' (18 November 1932 – 14 September 2010) was a [[Czech American|Czech-American]] researcher in [[information theory]], [[automatic speech recognition]], and [[natural language processing]]. He is well known for his oft-quoted statement, "Every time I fire a linguist, the performance of the speech recognizer goes up".{{#tag:ref|Although its fame and iconic status are undisputed (it was for example used as the title of a 1998 speech by [[Julia Hirschberg]]),<ref name="hirschberg"/> its context is unknown and its specific wording and dating are unclear. According to [[Daniel Jurafsky]] and James H. Martin, Jelinek himself recalled the quote as "Anytime a linguist leaves the group the recognition rate goes up" and dated it to December 1988 (Wayne, Pennsylvania), further noting that the quote did not appear in the published proceeding,<ref name="jurafsky2009"/><ref name="palmer1990"/> whereas Roger K. Moore gave the wording as "Every time we fire a phonetician/linguist, the performance of our system goes up" and dated it to an IEEE Automatic Speech Recognition and Understanding workshop held in 1985.<ref name="moore2005"/> According to Steve Young, "the story goes that one day one of his linguists resigned, and Fred decided to replace him not by another linguist but by an engineer. A little while later, Fred noticed that the performance of his system improved significantly. So he encouraged another linguist to find alternative employment, and sure enough performance improved again."<ref name="young"/>|group=note}}


Jelinek was born in [[Czechoslovakia]] just before the outbreak of [[World War II]] and emigrated with his family to the United States in the [[History of Czechoslovakia (1948–1989)|early years]] of the communist regime. He studied engineering at the [[Massachusetts Institute of Technology]] and taught for 10 years at [[Cornell University]] before being offered a job at [[IBM Research]]. In 1961, he married Czech screenwriter [[Milena Jelinek]]. At IBM, his team advanced approaches to computer speech recognition and machine translation. After IBM, he went to head the [[Center for Language and Speech Processing]] at [[Johns Hopkins University]] for 17 years, where he was still working on the day he died.
Jelinek was born in [[Czechoslovakia]] before [[World War II]] and emigrated with his family to the United States in the [[History of Czechoslovakia (1948–1989)|early years]] of the communist regime. He studied engineering at the [[Massachusetts Institute of Technology]] and taught for 10 years at [[Cornell University]] before accepting a job at [[IBM Research]]. In 1961, he married Czech screenwriter [[Milena Jelinek]]. At IBM, his team advanced approaches to computer speech recognition and machine translation. After IBM, he went to head the [[Center for Language and Speech Processing]] at [[Johns Hopkins University]] for 17 years, where he was still working on the day he died.


== Personal life ==
== Personal life ==
Jelinek was born on November 18, 1932, as '''Bedřich Jelínek'''<ref name="rejzek"/> in [[Kladno]] to Vilém and Trude Jelinek.<ref name=jelinek1997 /> His father was Jewish; his mother was born in Switzerland to Czech Catholic parents and had converted to Judaism.<ref name="hajic2010"/><ref name=lohr2010 /> Jelinek senior, a dentist, had planned early for an escape to England; he arranged for a passport, visa, and the shipping of his dentistry materials. The couple planned to send their son to an English [[Independent school (United Kingdom)|private school]]. However, Vilém decided to stay at the last minute and was eventually sent to the [[Theresienstadt concentration camp]],<ref name=jelinek2001/> where he died in 1945.<ref name=jelinek1997 /><ref name=lohr2010 /> The family was forced to move to [[Prague]] in 1941, but Frederick, his sister and mother{{mdash}}thanks to the latter's background{{mdash}}escaped the concentration camps.<ref name=lohr2010 />
Jelinek was born on November 18, 1932, as '''Bedřich Jelínek'''<ref name="rejzek"/> in [[Kladno]] to Vilém and Trude Jelínek.<ref name=jelinek1997 /> His father was Jewish; his mother was born in Switzerland to Czech Catholic parents and had converted to Judaism.<ref name="hajic2010"/><ref name=lohr2010 /> Jelínek senior, a dentist, had planned early to escape Nazi occupation and flee to England; he arranged for a passport, visa, and the shipping of his dentistry materials. The couple planned to send their son to an English [[Independent school (United Kingdom)|private school]]. However, Vilém decided to stay at the last minute and was eventually sent to the [[Theresienstadt concentration camp]],<ref name=jelinek2001/> where he died in 1945.<ref name=jelinek1997 /><ref name=lohr2010 /> The family was forced to move to [[Prague]] in 1941, but Frederick, his sister and mother{{mdash}}thanks to the latter's background{{mdash}}escaped the concentration camps.<ref name=lohr2010 />


{{Quote box|align=right|width=26em|qalign=right|salign=left|quote = "It is generally believed that scientific talent reveals itself in early youth. [...] This was certainly not my case. I somehow slid into my scientific profession. My mother wished for me to become a physician, just like my father. [...] I myself wanted to be a lawyer, defender of the unjustly accused. But my career is the result of political circumstances, academic possibilities, and lucky accidents."|source =Talking about his life in a 2001 speech.<ref name=jelinek2001/>
{{quote box|align=right|width=26em|qalign=right|salign=left|quote = It is generally believed that scientific talent reveals itself in early youth. ... This was certainly not my case. I somehow slid into my scientific profession. My mother wished for me to become a physician, just like my father. ... I myself wanted to be a lawyer, defender of the unjustly accused. But my career is the result of political circumstances, academic possibilities, and lucky accidents.|source = —Talking about his life in a 2001 speech.<ref name=jelinek2001/>}}
}}


After the war, Jelinek entered in the [[gymnasium (school)|gymnasium]], despite having missed several years of schooling because education of Jewish children had been forbidden since 1942. His mother, anxious that her son should get a good education, made great efforts for their emigration,<ref group=note>As he put it, "she didn't want to emulate my father's big mistake."</ref> especially when it became clear he would not be allowed to even attempt the graduation examination. His mother hoped her son would become a physician, but Jelinek dreamed of being a lawyer. He studied engineering in evening classes at the [[City College of New York]] and received stipends from the [[National Committee for a Free Europe]] that allowed him to study at the [[Massachusetts Institute of Technology]]. About his choice of specialty, he said: "Fortunately, to electrical engineering there belonged a discipline whose aim was not the construction of physical systems: the theory of information".<ref name=jelinek2001/> He obtained his Ph.D. in 1962, with [[Robert Fano]] as his adviser.<ref name="vitae"/><ref name=jelinek2009/>
After the war, Jelinek entered in the [[gymnasium (school)|gymnasium]], despite having missed several years of schooling because education of Jewish children had been forbidden since 1942. His mother, anxious that her son should get a good education, made great efforts for their emigration,<ref group=note>As he put it, "she didn't want to emulate my father's big mistake."</ref> especially when it became clear he would not be allowed to even attempt the graduation examination. His mother hoped her son would become a physician, but Jelinek dreamed of being a lawyer. He studied engineering in evening classes at the [[City College of New York]] and received stipends from the [[National Committee for a Free Europe]] that allowed him to study at the [[Massachusetts Institute of Technology]]. About his choice of specialty, he said: "Fortunately, to electrical engineering there belonged a discipline whose aim was not the construction of physical systems: the theory of information".<ref name=jelinek2001/> He obtained his Ph.D. in 1962, with [[Robert Fano]] as his adviser.<ref name="vitae"/><ref name=jelinek2009/>


In 1957, Jelinek paid an unexpected visit to Prague. He had been in [[Vienna]] and applied for a visa, hoping to see his former acquaintances again. He met with his old friend [[Miloš Forman]], who introduced him to film student [[Milena Jelinek|Milena Tabolova]]{{mdash}}whose screenplay had been the basis for the movie ''Easy Life'' (''Snadný život'').<ref name=hershenson/><ref name=willoughby/> His flight back to the U.S. had a stopover in Munich, during which he called her to propose.<ref name=lohr2010/> Tabolova was considered a dissident and the authorities were not happy with her film.<ref name=willoughby/> Jelinek asked for help from [[Jerome Wiesner]] and [[Cyrus Eaton]], the latter who lobbied [[Nikita Khrushchev]].<ref name=hershenson/> Following the inauguration of [[John F. Kennedy]], a group of Czech dissidents were allowed to emigrate in January 1961. Thanks to the lobbying, the future Milena Jelinek was one of them.<ref name=lohr2010/><ref name=hershenson/>
In 1957, Jelinek paid an unexpected visit to Prague. He had been in [[Vienna]] and applied for a visa, hoping to see his former acquaintances again. He met with his old friend [[Miloš Forman]], who introduced him to film student [[Milena Jelinek|Milena Tobolová]]{{mdash}}whose screenplay had been the basis for the movie ''Easy Life'' (''Snadný život'').<ref name=hershenson/><ref name=willoughby/> His flight back to the U.S. had a stopover in Munich, during which he called her to propose.<ref name=lohr2010/> Tobolová was considered a dissident and the authorities were not happy with her film.<ref name=willoughby/> Jelinek asked for help from [[Jerome Wiesner]] and [[Cyrus Eaton]], the latter who lobbied [[Nikita Khrushchev]].<ref name=hershenson/> Following the inauguration of [[John F. Kennedy]], a group of Czech dissidents were allowed to emigrate in January 1961. Thanks to the lobbying, the future Milena Jelinek was one of them.<ref name=lohr2010/><ref name=hershenson/>


After completing his graduate studies, Jelinek, who had developed an interest in [[linguistics]], had plans to work with [[Charles F. Hockett]] at [[Cornell University]]. However these fell through and during the next ten years he continued to study information theory.<ref name=jelinek2001/> Having previously worked at [[IBM]] during a sabbatical, he began full-time work there in 1972{{mdash}}at first on leave for Cornell, but permanently from 1974. He remained there for over twenty years. Although at first he had been offered a regular research job, upon his arrival he learned that [[Josef Raviv]] had recently been promoted to head of the newly opened [[IBM Haifa Research Laboratory]], and became head of the Continuous Speech Recognition group at the [[Thomas J. Watson Research Center]].<ref name=jelinek2001/><ref name=jelinek2009/> Despite his team's successes in this area, Jelinek's work remained little known in his home country because Czech scientists were not allowed to participate in key conferences.<ref name=hershenson/>
After completing his graduate studies, Jelinek, who had developed an interest in [[linguistics]], had plans to work with [[Charles F. Hockett]] at [[Cornell University]]. However these fell through and during the next ten years he continued to study information theory.<ref name=jelinek2001/> Having previously worked at [[IBM]] during a sabbatical, he began full-time work there in 1972{{mdash}}at first on leave for Cornell, but permanently from 1974. He remained there for over twenty years. Although at first he had been offered a regular research job, upon his arrival he learned that [[Josef Raviv]] had recently been promoted to head of the newly opened [[IBM Haifa Research Laboratory]], and became head of the Continuous Speech Recognition group at the [[Thomas J. Watson Research Center]].<ref name=jelinek2001/><ref name=jelinek2009/> Despite his team's successes in this area, Jelinek's work remained little known in his home country because Czech scientists were not allowed to participate in key conferences.<ref name=hershenson/>


After the 1989 fall of communism, Jelinek helped establish scientific relationships, regularly visiting to lecture and helping to persuade IBM to establish a computing centre at [[Charles University]].<ref name="hajic2010"/><ref name=jelinek2001/><ref name=dresser2010/> In 1993, he retired from IBM and went to [[Johns Hopkins University]]'s Center for Language and Speech Processing, where he was director and Julian Sinclair Smith Professor of Electrical and Computer Engineering.<ref name=vitae/><ref name=jhu/> He was still working there at the time of his death; Jelinek died of a heart attack at the close of an otherwise normal workday in mid-September 2010.<ref name=lohr2010 /><ref name=jhu/> He was survived by his wife, daughter and son, sister, stepsister, and three grandchildren: including Sophie Gold Jelinek.
After the 1989 fall of communism, Jelinek helped establish scientific relationships, regularly visiting to lecture and helping to persuade IBM to establish a computing centre at [[Charles University]].<ref name="hajic2010"/><ref name=jelinek2001/><ref name=dresser2010/> In 1993, he retired from IBM and went to [[Johns Hopkins University]]'s Center for Language and Speech Processing, where he was director and Julian Sinclair Smith Professor of Electrical and Computer Engineering.<ref name=vitae/><ref name=jhu/> He was still working there at the time of his death; Jelinek died of a heart attack at the close of an otherwise normal workday in mid-September 2010.<ref name=lohr2010 /><ref name=jhu/> He was survived by his wife, daughter and son, sister, stepsister, and three grandchildren, including Sophie Gold Jelinek.


== Research and legacy ==
== Research and legacy ==
[[Information theory]] was a fashionable scientific approach in the mid '50s.<ref name=jelinek2009/> However, pioneer [[Claude Shannon]] wrote in 1956 that this trendiness was dangerous. He said, "Our fellow scientists in many different fields, attracted by the fanfare and by the new avenues opened to scientific analysis, are using these ideas in their own problems&nbsp;...&nbsp;It will be all too easy for our somewhat artificial prosperity to collapse overnight when it is realized that the use of a few exciting words like information, entropy, redundancy, do not solve all our problems."<ref name="liberman-cite">Quoted in Liberman (2010).</ref> During the next decade, a combination of factors shut down the application of information theory to [[natural language processing]] (NLP) problems{{mdash}}in particular [[machine translation]]. One factor was the 1957 publication of [[Noam Chomsky]]'s ''[[Syntactic Structures]],'' which stated, "probabilistic models give no insight into the basic problems of syntactic structure".<ref name="young-cite">Quoted in Young (2010).</ref> This accorded well with the philosophy of the [[artificial intelligence]] research of the time, which promoted rule-based approaches. The other factor was the 1966 [[ALPAC]] report, which recommended that the government should stop funding research into machine translation. ALPAC chairman [[John R. Pierce|John Pierce]] later said that the field was filled with "mad inventors or untrustworthy engineers". He said that the underlying linguistic problems must be solved before attempts at NLP could be reasonably made. These elements essentially halted research in the field.<ref name=young/><ref name=liberman/>
[[Information theory]] was a fashionable scientific approach in the mid '50s.<ref name=jelinek2009/> However, pioneer [[Claude Shannon]] wrote in 1956 that this trendiness was dangerous. He said, "Our fellow scientists in many different fields, attracted by the fanfare and by the new avenues opened to scientific analysis, are using these ideas in their own problems&nbsp; ... &nbsp;It will be all too easy for our somewhat artificial prosperity to collapse overnight when it is realized that the use of a few exciting words like information, entropy, redundancy, do not solve all our problems."<ref name="liberman-cite">Quoted in Liberman (2010).</ref> During the next decade, a combination of factors shut down the application of information theory to [[natural language processing]] (NLP) problems{{mdash}}in particular [[machine translation]]. One factor was the 1957 publication of [[Noam Chomsky]]'s ''[[Syntactic Structures]],'' which stated, "probabilistic models give no insight into the basic problems of syntactic structure".<ref name="young-cite">Quoted in Young (2010).</ref> This accorded well with the philosophy of the [[artificial intelligence]] research of the time, which promoted rule-based approaches. The other factor was the 1966 [[ALPAC]] report, which recommended that the government should stop funding research into machine translation. ALPAC chairman [[John R. Pierce|John Pierce]] later said that the field was filled with "mad inventors or untrustworthy engineers". He said that the underlying linguistic problems must be solved before attempts at NLP could be reasonably made. These elements essentially halted research in the field.<ref name=young/><ref name=liberman/>


Jelinek had begun to develop an interest in linguistics after the immigration of his wife, who initially enrolled in the MIT linguistics program with the help of [[Roman Jakobson]]. Jelinek often accompanied her to Chomsky's lectures, and even discussed the possibility of changing orientation with his adviser. Fano was "really upset", and after the failure of his project with Hockett at Cornell, he did not return to this field of research until starting work at IBM.<ref name=jelinek2009/> The scope of research at IBM was considerably different from that of most other teams. According to Liberman, "While [Jelinek] was leading IBM’s effort to solve the general dictation problem during the decade or so following 1972, most other U.S. companies and academic researchers were working on very limited problems&nbsp;...&nbsp;or were staying out of the field entirely".<ref name=liberman/>
Jelinek had begun to develop an interest in linguistics after the immigration of his wife, who initially enrolled in the MIT linguistics program with the help of [[Roman Jakobson]]. Jelinek often accompanied her to Chomsky's lectures, and even discussed the possibility of changing orientation with his adviser. Fano was "really upset", and after the failure of his project with Hockett at Cornell, he did not return to this field of research until starting work at IBM.<ref name=jelinek2009/> The scope of research at IBM was considerably different from that of most other teams. According to [[Mark Liberman]], "While [Jelinek] was leading IBM's effort to solve the general dictation problem during the decade or so following 1972, most other U.S. companies and academic researchers were working on very limited problems&nbsp; ... &nbsp;or were staying out of the field entirely".<ref name=liberman/>


{{Quote box|align=right|width=26em|qalign=right|salign=left|quote = "He was not a pioneer of speech recognition, he was the pioneer of speech recognition."|source =Steve Young (2010)<ref name=young/>}}
{{quote box|align=right|width=26em|qalign=right|salign=left|quote = He was not a pioneer of speech recognition, he was the pioneer of speech recognition.|source = —Steve Young (2010)<ref name=young/>}}


Jelinek regarded [[speech recognition]] as an information theory problem{{mdash}}a [[noisy channel]], in this case the acoustic signal{{mdash}}which some observers considered a daring approach.<ref name=young/><ref name=jhu/><ref name=liberman/> The concept of [[perplexity]] was introduced in their first model,<ref name=jelinek2009/> New Raleigh Grammar, which was published in 1976 as the paper "Continuous Speech Recognition by Statistical Methods" in the journal ''[[Proceedings of the IEEE]]''.<ref name=young/> According to Young, the basic noisy channel approach "reduced the speech recognition problem to one of producing two statistical models".<ref name=young/> Whereas New Raleigh Grammar was a [[hidden Markov model]], their next model, called Tangora, was broader and involved [[n-gram]]s, specifically trigrams. Even though "it was obvious to everyone that this model was hopelessly impoverished", it was not improved upon until Jelinek presented another paper in 1999.<ref name="young"/> The same trigram approach was applied to [[phone (phonetics)|phones]] in single words. Although the identification of [[parts of speech]] turned out not to be very useful for speech recognition, tagging methods developed during these projects are now used in various NLP applications.<ref name=jelinek2009/>
Jelinek regarded [[speech recognition]] as an information theory problem{{mdash}}a [[noisy channel]], in this case the acoustic signal{{mdash}}which some observers considered a daring approach.<ref name=young/><ref name=jhu/><ref name=liberman/> The concept of [[perplexity]] was introduced in their first model,<ref name=jelinek2009/> New Raleigh Grammar, which was published in 1976 as the paper "Continuous Speech Recognition by Statistical Methods" in the journal ''[[Proceedings of the IEEE]]''.<ref name=young/> According to Young, the basic noisy channel approach "reduced the speech recognition problem to one of producing two statistical models".<ref name=young/> Whereas New Raleigh Grammar was a [[hidden Markov model]], their next model, called Tangora, was broader and involved [[n-gram]]s, specifically trigrams. Even though "it was obvious to everyone that this model was hopelessly impoverished", it was not improved upon until Jelinek presented another paper in 1999.<ref name="young"/> The same trigram approach was applied to [[phone (phonetics)|phones]] in single words. Although the identification of [[parts of speech]] turned out not to be very useful for speech recognition, tagging methods developed during these projects are now used in various NLP applications.<ref name=jelinek2009/>
Line 54: Line 50:
The incremental research techniques developed at IBM eventually became dominant in the field after [[DARPA]], in the mid-80s, returned to NLP research and imposed that [[methodology]] to participating teams, shared common goals, data, and precise evaluation metrics.<ref name=liberman/> The Continuous Speech Recognition Group's research, which required large amounts of data to train the algorithms, eventually led to the creation of the [[Linguistic Data Consortium]]. In the 1980s, although the broader problem of speech recognition remained unsolved, they sought to apply the methods developed to other problems; machine translation and stock value prediction were both seen as options. A group of IBM researchers went on to work for [[Renaissance Technologies]]. Jelinek wrote, "The performance of the Renaissance fund is legendary, but I have no idea whether any methods we pioneered at IBM have ever been used. My former colleagues will not tell me: theirs is a very hush-hush operation!"<ref name=jelinek2009/> Methods very similar to those developed for achieving speech recognition are at the base of most machine translation systems in use today. Observers have said that Pierce's paradigm, according to which engineering achievements in this area would be built on scientific progress, has been inverted, with the achievements in engineering being at the base of a number of scientific findings.<ref name=young/><ref name=liberman/>
The incremental research techniques developed at IBM eventually became dominant in the field after [[DARPA]], in the mid-80s, returned to NLP research and imposed that [[methodology]] to participating teams, shared common goals, data, and precise evaluation metrics.<ref name=liberman/> The Continuous Speech Recognition Group's research, which required large amounts of data to train the algorithms, eventually led to the creation of the [[Linguistic Data Consortium]]. In the 1980s, although the broader problem of speech recognition remained unsolved, they sought to apply the methods developed to other problems; machine translation and stock value prediction were both seen as options. A group of IBM researchers went on to work for [[Renaissance Technologies]]. Jelinek wrote, "The performance of the Renaissance fund is legendary, but I have no idea whether any methods we pioneered at IBM have ever been used. My former colleagues will not tell me: theirs is a very hush-hush operation!"<ref name=jelinek2009/> Methods very similar to those developed for achieving speech recognition are at the base of most machine translation systems in use today. Observers have said that Pierce's paradigm, according to which engineering achievements in this area would be built on scientific progress, has been inverted, with the achievements in engineering being at the base of a number of scientific findings.<ref name=young/><ref name=liberman/>


Jelinek's works won "best paper" awards on several occasions, and he received a number of company awards while he worked at IBM.<ref name=young/><ref name=vitae/> He received the Society Award for "outstanding technical contributions and leadership" from the [[IEEE Signal Processing Society]] for 1997,<ref name=SPSaward/> and the [[International Speech Communication Association|ESCA]] Medal for Scientific Achievement in 1999.<ref name=ESCA/> He was a recipient of an IEEE Third Millennium Medal in 2000, the [[European Language Resources Association]]'s first Antonio Zampolli Prize in 2004,<ref name=ELRA/> the 2005 [[IEEE James L. Flanagan Speech and Audio Processing Award|James L. Flanagan Speech and Audio Processing Award]],<ref name=IEEEaward/> and the 2009 Lifetime Achievement Award from the [[Association for Computational Linguistics]].<ref name=vitae/><ref name=jelinek2009/> He received an ''honoris causa'' Ph.D. from [[Charles University]] in 2001,<ref name=HonCaus/> was elected to the [[National Academy of Engineering]] in 2006 and was made one of twelve inaugural fellows of the [[International Speech Communication Association]] in 2008.<ref name=young/>
Jelinek's works won "best paper" awards on several occasions, and he received a number of company awards while he worked at IBM.<ref name=young/><ref name=vitae/> He received the Society Award for "outstanding technical contributions and leadership" from the [[IEEE Signal Processing Society]] for 1997,<ref name=SPSaward/> and the [[International Speech Communication Association|ESCA]] Medal for Scientific Achievement in 1999.<ref name=ESCA/> He was a recipient of an IEEE Third Millennium Medal in 2000, the European Language Resources Association's first Antonio Zampolli Prize in 2004,<ref name=ELRA/> the 2005 [[IEEE James L. Flanagan Speech and Audio Processing Award|James L. Flanagan Speech and Audio Processing Award]],<ref name=IEEEaward/> and the 2009 Lifetime Achievement Award from the [[Association for Computational Linguistics]].<ref name=vitae/><ref name=jelinek2009/> He received an ''honoris causa'' Ph.D. from [[Charles University]] in 2001,<ref name=HonCaus/> was elected to the [[National Academy of Engineering]] in 2006 and was made one of twelve inaugural fellows of the [[International Speech Communication Association]] in 2008.<ref name=young/>


== Selected publications ==
== Selected publications ==
{{refbegin}}
{{refbegin}}
*Jelinek, Frederick (1968). ''Probabilistic Information Theory: Discrete and memoryless models''. McGraw-Hill series in systems science. New York: McGraw-Hill. 689p. {{LCCN|68011611}} {{doi-inline|10.1109/TIT.1970.1054413|(review)}}
*Jelinek, Frederick (1968). ''Probabilistic Information Theory: Discrete and memoryless models''. McGraw-Hill series in systems science. New York: McGraw-Hill. 689p. {{LCCN|68011611}} {{doi-inline|10.1109/TIT.1970.1054413}} (review)
*———————- (1969). "Fast sequential decoding algorithm using a stack". ''IBM Journal of Research and Development'' '''13'''(6):675–685. {{doi|10.1147/rd.136.0675}}.
*———————- (1969). "Fast sequential decoding algorithm using a stack". ''IBM Journal of Research and Development'' '''13'''(6):675–685. {{doi|10.1147/rd.136.0675}}.
*———————- (1969). "Tree encoding of memoryless time-discrete sources with a fidelity criterion". ''IEEE Transactions on Information Theory'' '''15'''(5):584–590. {{doi|10.1109/TIT.1969.1054355}}. (received 1971 "Best Paper" award)
*———————- (1969). "Tree encoding of memoryless time-discrete sources with a fidelity criterion". ''IEEE Transactions on Information Theory'' '''15'''(5):584–590. {{doi|10.1109/TIT.1969.1054355}}. (received 1971 "Best Paper" award)
*Bahl, Lalit R.; [[John Cocke]], Frederick Jelinek, [[Josef Raviv]] (1974). "Optimal decoding of linear codes for minimizing symbol error rate". ''IEEE Transactions on Information Theory'' '''20'''(2):284–287. {{doi|10.1109/TIT.1974.1055186}}. (received Information Theory Society Golden Jubilee paper award)
*Bahl, Lalit R.; [[John Cocke (computer scientist)|John Cocke]], Frederick Jelinek, [[Josef Raviv]] (1974). "Optimal decoding of linear codes for minimizing symbol error rate". ''IEEE Transactions on Information Theory'' '''20'''(2):284–287. {{doi|10.1109/TIT.1974.1055186}}. (received Information Theory Society Golden Jubilee paper award)
*———————- (1976). "Continuous speech recognition by statistical methods". ''Proceedings of the IEEE'' '''64'''(4):532–556. {{doi|10.1109/PROC.1976.10159}}.
*———————- (1976). "Continuous speech recognition by statistical methods". ''Proceedings of the IEEE'' '''64'''(4):532–556. {{doi|10.1109/PROC.1976.10159}}.
*Brown, P.; J. Cocke, S. Della Pietra, V. Della Pietra, F. Jelinek, R, Mercer and P. Roossin (1988). [http://www.aclweb.org/anthology-new/C/C88/C88-1016.pdf "A statistical approach to language translation"]. In Dénes Vargha, ed. ''Coling 88: Proceedings of the 12th conference on Computational linguistics, volume 1''. Budapest: John Von Neumann society for computing sciences. pp.&nbsp;71–76. {{doi|10.3115/991635.991651}}. {{ISBN|963-8431-56-3}}.
*Brown, P.; J. Cocke, S. Della Pietra, V. Della Pietra, F. Jelinek, R, Mercer and P. Roossin (1988). [http://www.aclweb.org/anthology-new/C/C88/C88-1016.pdf "A statistical approach to language translation"] {{Webarchive|url=https://web.archive.org/web/20110807052934/http://www.aclweb.org/anthology-new/C/C88/C88-1016.pdf |date=August 7, 2011 }}. In Dénes Vargha, ed. ''Coling 88: Proceedings of the 12th conference on Computational linguistics, volume 1''. Budapest: John Von Neumann society for computing sciences. pp.&nbsp;71–76. {{doi|10.3115/991635.991651}}. {{ISBN|963-8431-56-3}}.
*———————- (1990). "Self-Organized Language Modeling for Speech Recognition". In Alex Waibel & Kai-Fu Lee, eds. ''Readings in speech recognition''. San Mateo: Morgan Kaufmann. 629p. {{ISBN|1-55860-124-4}}.
*———————- (1990). "Self-Organized Language Modeling for Speech Recognition". In Alex Waibel & Kai-Fu Lee, eds. ''Readings in speech recognition''. San Mateo: Morgan Kaufmann. 629p. {{ISBN|1-55860-124-4}}.
*———————-; John D. Lafferty and Robert L. Mercer. (1990) "Basic methods of probabilistic context free grammars". Technical Report RC 16374 (72684), IBM.
*———————-; John D. Lafferty and Robert L. Mercer. (1990) "Basic methods of probabilistic context free grammars". Technical Report RC 16374 (72684), IBM.
**Reprinted in Laface, Pietro; Renato De Mori (1992). ''Speech Recognition and Understanding: Recent advances, trends, and applications''. NATO ASI series. Series F, Computer and systems sciences, '''75'''. New York: Springer-Verlag. pp.&nbsp;345–360. {{ISBN|0-387-54032-6}}.
**Reprinted in Laface, Pietro; Renato De Mori (1992). ''Speech Recognition and Understanding: Recent advances, trends, and applications''. NATO ASI series. Series F, Computer and systems sciences, '''75'''. New York: Springer-Verlag. pp.&nbsp;345–360. {{ISBN|0-387-54032-6}}.
*———————- (1997). [https://books.google.com/books?id=1C9dzcJTWowC ''Statistical Methods for Speech Recognition'']. Cambridge, Mass.: MIT Press. 283p. {{ISBN|0-262-10066-5}}. [http://acl.ldc.upenn.edu/J/J99/J99-2009.pdf (review)] [http://www.questia.com/googleScholar.qst?docId=5002322101 (review 2)]
*———————- (1997). [https://books.google.com/books?id=1C9dzcJTWowC ''Statistical Methods for Speech Recognition'']. Cambridge, Mass.: MIT Press. 283p. {{ISBN|0-262-10066-5}}. [https://web.archive.org/web/20110629121956/http://acl.ldc.upenn.edu/J/J99/J99-2009.pdf (review)] [https://www.questia.com/googleScholar.qst?docId=5002322101 (review 2)]
* Chelba, Ciprian; Frederick Jelinek (2000). "Structured Language Modeling". ''Computer Speech & Language'' '''14'''(4):283–332. {{doi|10.1006/csla.2000.0147}} (received 2002 "Best Paper" award).
* Chelba, Ciprian; Frederick Jelinek (2000). "Structured Language Modeling". ''Computer Speech & Language'' '''14'''(4):283–332. {{doi|10.1006/csla.2000.0147}} (received 2002 "Best Paper" award).
**Expanded version of a presentation at NLDB'99. Klagenfurt, Austria, June 17–19, 1999 ({{arxiv|cs/0001023}}).
**Expanded version of a presentation at NLDB'99. Klagenfurt, Austria, June 17–19, 1999 ({{arxiv|cs/0001023}}).
* Xu, Peng; Ahmad Emami and Frederick Jelinek (2003). "[http://www.clsp.jhu.edu/people/xp/publications/emnlp03.pdf Training Connectionist Models for the Structured Language Model]". In Michael Collins and Mark Steedman, eds. ''EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing''. East Stroudsburg, Penn.: Association for Computational Linguistics. pp.&nbsp;160–167. {{ISBN|1-932432-13-2}}. {{doi|10.3115/1119355.1119376}}. (won "best paper" award)
* Xu, Peng; Ahmad Emami and Frederick Jelinek (2003). "[https://web.archive.org/web/20110719222555/http://www.clsp.jhu.edu/people/xp/publications/emnlp03.pdf Training Connectionist Models for the Structured Language Model]". In Michael Collins and Mark Steedman, eds. ''EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing''. East Stroudsburg, Penn.: Association for Computational Linguistics. pp.&nbsp;160–167. {{ISBN|1-932432-13-2}}. {{doi|10.3115/1119355.1119376}}. (won "best paper" award)
{{refend}}
{{refend}}


Line 82: Line 78:
<ref name=ESCA>{{cite web |title=1999 ESCA Medal for Scientific Achievement |url=http://www.isca-speech.org/awards/jelinek.html |year=1999 |publisher=International Speech Communication Association |accessdate=December 21, 2010|archiveurl=https://web.archive.org/web/20090802234755/http://www.isca-speech.org/awards/jelinek.html|archivedate=August 2, 2009}}</ref>
<ref name=ESCA>{{cite web |title=1999 ESCA Medal for Scientific Achievement |url=http://www.isca-speech.org/awards/jelinek.html |year=1999 |publisher=International Speech Communication Association |accessdate=December 21, 2010|archiveurl=https://web.archive.org/web/20090802234755/http://www.isca-speech.org/awards/jelinek.html|archivedate=August 2, 2009}}</ref>


<ref name=ELRA>{{cite web |title= In Honour of Prof. Antonio Zampolli |url=http://www.elra.info/Antonio-Zampolli-Prize.html |publisher=European Language Resources Association |accessdate=December 21, 2010}}</ref>
<ref name=ELRA>{{cite web |title=In Honour of Prof. Antonio Zampolli |url=http://www.elra.info/Antonio-Zampolli-Prize.html |publisher=European Language Resources Association |accessdate=December 21, 2010 |url-status=dead |archiveurl=https://web.archive.org/web/20110721200927/http://www.elra.info/Antonio-Zampolli-Prize.html |archivedate=July 21, 2011 |df=mdy-all }}</ref>


<ref name="hajic2010">{{cite news|last=Hajic |first=Jan |title=Prof. Frederick Jelinek, 1932–2010 |url=http://eacl.coli.uni-saarland.de/contents/newsletter12.html |work=EACL Newsletter |volume=12 |date=November 2010 |accessdate=December 19, 2010}}</ref>
<ref name="hajic2010">{{cite news|last=Hajic |first=Jan |title=Prof. Frederick Jelinek, 1932–2010 |url=http://eacl.coli.uni-saarland.de/contents/newsletter12.html |work=EACL Newsletter |volume=12 |date=November 2010 |accessdate=December 19, 2010}}</ref>
Line 90: Line 86:
<ref name=hirschberg>{{cite speech |last=Hirschberg |first=Julia |date=July 29, 1998 |title='Every time I fire a linguist, my performance goes up', and other myths of the statistical natural language processing revolution |location=15th National Conference on Artificial Intelligence, Madison, Wisconsin}} Invited speech.</ref>
<ref name=hirschberg>{{cite speech |last=Hirschberg |first=Julia |date=July 29, 1998 |title='Every time I fire a linguist, my performance goes up', and other myths of the statistical natural language processing revolution |location=15th National Conference on Artificial Intelligence, Madison, Wisconsin}} Invited speech.</ref>


<ref name=HonCaus>{{cite press release|title=Dr. h. c. prof. F. Jelinek |url=http://www.cuni.cz/UK-1227.html |publisher=Charles University in Pragues |date=November 22, 2001 |accessdate=December 17, 2010}}<!-- ALTERNATIVE ARCHIVE URL: http://web.archive.org/web/20020111014700/http%3A//certik.ruk.cuni.cz/press/aktualita/011109-03.html --></ref>
<ref name=HonCaus>{{cite press release|title=Dr. h. c. prof. F. Jelinek |url=http://www.cuni.cz/UK-1227.html |publisher=Charles University in Pragues |date=November 22, 2001 |accessdate=December 17, 2010}}<!-- ALTERNATIVE ARCHIVE URL: https://web.archive.org/web/20020111014700/http://certik.ruk.cuni.cz/press/aktualita/011109-03.html --></ref>


<ref name=IEEEaward>{{cite web |title=IEEE James L. Flanagan Speech and Audio Processing Award recipients |url=http://www.ieee.org/about/awards/bios/flanagan_recipients.html |publisher=IEEE |accessdate=December 21, 2010}}</ref>
<ref name=IEEEaward>{{cite web |title=IEEE James L. Flanagan Speech and Audio Processing Award recipients |url=http://www.ieee.org/about/awards/bios/flanagan_recipients.html |publisher=IEEE |accessdate=December 21, 2010}}</ref>
Line 96: Line 92:
<ref name=jelinek1997>{{cite book |last=Jelinek |first=Frederick |year=1997 |title=Statistical Methods for Speech Recognition |location=Cambridge, Mass. |publisher=MIT Press |url=https://books.google.com/books?id=1C9dzcJTWowC&pg=PR5 |isbn=0-262-10066-5 |page=v}}</ref>
<ref name=jelinek1997>{{cite book |last=Jelinek |first=Frederick |year=1997 |title=Statistical Methods for Speech Recognition |location=Cambridge, Mass. |publisher=MIT Press |url=https://books.google.com/books?id=1C9dzcJTWowC&pg=PR5 |isbn=0-262-10066-5 |page=v}}</ref>


<ref name=jelinek2001>{{cite speech |last=Jelinek |first=Frederick |title=How I Got Here |url=http://www.clsp.jhu.edu/people/jelinek/promoce.html |archiveurl=https://web.archive.org/web/20080316183233/http://www.clsp.jhu.edu/people/jelinek/promoce.html |archivedate=2008-03-16 |date=November 22, 2001 |location=Charles University, Prague, Czechoslovakia |accessdate=December 17, 2010}} Honoris causa degree acceptance speech.<!--Archive: http://web.archive.org/web/20080316183233/http://www.clsp.jhu.edu/people/jelinek/promoce.html--></ref>
<ref name=jelinek2001>{{cite speech |last=Jelinek |first=Frederick |title=How I Got Here |url=http://www.clsp.jhu.edu/people/jelinek/promoce.html |archiveurl=https://web.archive.org/web/20080316183233/http://www.clsp.jhu.edu/people/jelinek/promoce.html |archivedate=2008-03-16 |date=November 22, 2001 |location=Charles University, Prague, Czechoslovakia |accessdate=December 17, 2010}} Honoris causa degree acceptance speech.<!--Archive: https://web.archive.org/web/20080316183233/http://www.clsp.jhu.edu/people/jelinek/promoce.html--></ref>


<ref name=jelinek2009>{{cite journal| last=Jelinek |first=Fred |date=December 2009 |title=The Dawn of Statistical ASR and MT |journal=Computational Linguistics |url=http://www.mitpressjournals.org/doi/pdf/10.1162/coli.2009.35.4.35401 |accessdate=December 16, 2010 |volume=35 |issue=4|pages=483–494 |doi=10.1162/coli.2009.35.4.35401}}</ref>
<ref name=jelinek2009>{{cite journal| last=Jelinek |first=Fred |s2cid=1486422 |date=December 2009 |title=The Dawn of Statistical ASR and MT |journal=Computational Linguistics |volume=35 |issue=4|pages=483–494 |doi=10.1162/coli.2009.35.4.35401|doi-access=free }}</ref>


<ref name=jhu>{{cite news |last=Sneiderman |first=Phil |title=Frederick Jelinek, 77, pioneer in speech and text understanding technology |url=http://gazette.jhu.edu/2010/09/20/frederick-jelinek-77-pioneer-in-speech-and-text-understanding-technology/ |date=September 20, 2010 |work=The JHU Gazette |publisher=Johns Hopkins University |accessdate=December 16, 2010}}</ref>
<ref name=jhu>{{cite news |last=Sneiderman |first=Phil |title=Frederick Jelinek, 77, pioneer in speech and text understanding technology |url=http://gazette.jhu.edu/2010/09/20/frederick-jelinek-77-pioneer-in-speech-and-text-understanding-technology/ |date=September 20, 2010 |work=The JHU Gazette |publisher=Johns Hopkins University |accessdate=December 16, 2010}}</ref>


<ref name="jurafsky2009">{{cite book |last=Jurafsky |first=Daniel |author2=James H. Martin |year=2009 |title= Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition |edition=2nd |series=Prentice Hall series in artificial intelligence |location=Upper Saddle, New Jersey |publisher=Prentice Hall |isbn=0-13-187321-0 |page=83}}</ref>
<ref name="jurafsky2009">{{cite book |last=Jurafsky |first=Daniel |author2=James H. Martin |year=2009 |title= Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition |url=https://archive.org/details/speechlanguagepr00jura_180 |url-access=limited |edition=2nd |series=Prentice Hall series in artificial intelligence |location=Upper Saddle, New Jersey |publisher=Prentice Hall |isbn=978-0-13-187321-6 |page=[https://archive.org/details/speechlanguagepr00jura_180/page/n107 83]}}</ref>


<ref name=liberman>{{cite journal| last=Liberman |first=Mark |date=December 2010 |authorlink=Mark Liberman |title=Obituary: Fred Jelinek |journal=Computational Linguistics|volume=36 |issue=4|pages=595–599 |doi=10.1162/coli_a_00032 |url=http://www.mitpressjournals.org/doi/pdf/10.1162/coli_a_00032 |accessdate=December 16, 2010}}</ref>
<ref name=liberman>{{cite journal| last=Liberman |first=Mark |date=December 2010 |authorlink=Mark Liberman |title=Obituary: Fred Jelinek |journal=Computational Linguistics|volume=36 |issue=4|pages=595–599 |doi=10.1162/coli_a_00032 |doi-access=free }}</ref>


<ref name=lohr2010>{{cite news |last=Lohr |first=Steve |title=Frederick Jelinek, Who Gave Machines the Key to Human Speech, Dies at 77 |url=https://www.nytimes.com/2010/09/24/business/24jelinek.html |date=September 24, 2010 |newspaper=New York Times |page=B10 |accessdate=December 16, 2010}}</ref>
<ref name=lohr2010>{{cite news |last=Lohr |first=Steve |title=Frederick Jelinek, Who Gave Machines the Key to Human Speech, Dies at 77 |url=https://www.nytimes.com/2010/09/24/business/24jelinek.html |date=September 24, 2010 |newspaper=New York Times |page=B10 |accessdate=December 16, 2010}}</ref>
Line 110: Line 106:
<ref name="moore2005">{{cite conference |last=Moore |first=Roger K. |year=2005 |title=Results from a Survey of Attendees at ASRU 1997 and 2003 |conference=INTERSPEECH-2005 |location= Lisbon, September 4–8, 2005 |url=http://staffwww.dcs.shef.ac.uk/people/R.Moore/publications/Results%20from%20a%20Survey%20of%20Attendees%20at%20ASRU%201997%20and%202003.pdf |archiveurl=https://web.archive.org/web/20110720235500/http://staffwww.dcs.shef.ac.uk/people/R.Moore/publications/Results%20from%20a%20Survey%20of%20Attendees%20at%20ASRU%201997%20and%202003.pdf |archivedate=July 20, 2011 }}</ref>
<ref name="moore2005">{{cite conference |last=Moore |first=Roger K. |year=2005 |title=Results from a Survey of Attendees at ASRU 1997 and 2003 |conference=INTERSPEECH-2005 |location= Lisbon, September 4–8, 2005 |url=http://staffwww.dcs.shef.ac.uk/people/R.Moore/publications/Results%20from%20a%20Survey%20of%20Attendees%20at%20ASRU%201997%20and%202003.pdf |archiveurl=https://web.archive.org/web/20110720235500/http://staffwww.dcs.shef.ac.uk/people/R.Moore/publications/Results%20from%20a%20Survey%20of%20Attendees%20at%20ASRU%201997%20and%202003.pdf |archivedate=July 20, 2011 }}</ref>


<ref name="palmer1990">{{cite journal |last=Palmer |first=Martha |author2=[[Tim Finin]] |year=1990 |title=Report on the Workshop on the Evaluation of Natural Language Processing Systems |journal=[[Computational Linguistics (journal)|Computational Linguistics]] |volume=16 |issue=1 |pages=171–185 |url=http://www.cs.umbc.edu/~finin//papers/acl90.pdf}}</ref>
<ref name="palmer1990">{{cite journal |last=Palmer |first=Martha |author2=Tim Finin |year=1990 |title=Report on the Workshop on the Evaluation of Natural Language Processing Systems |journal=[[Computational Linguistics (journal)|Computational Linguistics]] |volume=16 |issue=1 |pages=171–185 |url=http://www.cs.umbc.edu/~finin//papers/acl90.pdf|author2-link=Tim Finin }}</ref>


<ref name="rejzek">{{cite news |last=Rejžek |first=Jan |title=Nekrolog |url=http://www.lidovky.cz/ln_noviny.asp?r=ln_noviny&c=A100917_000081_ln_noviny_sko |date=September 17, 2010 |work=[[Lidové noviny]] |accessdate=December 17, 2010}}</ref>
<ref name="rejzek">{{cite news |last=Rejžek |first=Jan |title=Nekrolog |url=http://www.lidovky.cz/ln_noviny.asp?r=ln_noviny&c=A100917_000081_ln_noviny_sko |date=September 17, 2010 |work=[[Lidové noviny]] |accessdate=December 17, 2010}}</ref>
Line 118: Line 114:
<ref name=willoughby>{{cite news |last=Willoughby |first=Ian |title=Milena Jelinek – member of golden generation of Czech filmmakers now teaching screenwriting at Columbia |url=http://www.radio.cz/en/section/one-on-one/milena-jelinek-member-of-golden-generation-of-czech-filmmakers-now-teaching-screenwriting-at-columbia |date=June 9, 2008 |work=One on One |publisher=[[Radio Prague]] |accessdate=February 1, 2014}}</ref>
<ref name=willoughby>{{cite news |last=Willoughby |first=Ian |title=Milena Jelinek – member of golden generation of Czech filmmakers now teaching screenwriting at Columbia |url=http://www.radio.cz/en/section/one-on-one/milena-jelinek-member-of-golden-generation-of-czech-filmmakers-now-teaching-screenwriting-at-columbia |date=June 9, 2008 |work=One on One |publisher=[[Radio Prague]] |accessdate=February 1, 2014}}</ref>


<ref name="young">{{Cite journal |last=Young |first=Steve |date=November 2010 |title=Frederick Jelinek 1932–2010: The Pioneer of Speech Recognition Technology |url=http://www.signalprocessingsociety.org/technical-committees/list/sl-tc/spl-nl/2010-11/jelinek/ |journal=Speech and Language Processing Technical Committee Newsletter |publisher=[[IEEE Signal Processing Society]] |accessdate=December 16, 2010}} Adapted from a speech given in 2006.</ref>
<ref name="young">{{Cite journal |last=Young |first=Steve |date=November 2010 |title=Frederick Jelinek 1932–2010: The Pioneer of Speech Recognition Technology |url=http://www.signalprocessingsociety.org/technical-committees/list/sl-tc/spl-nl/2010-11/jelinek/ |journal=Speech and Language Processing Technical Committee Newsletter |publisher=[[IEEE Signal Processing Society]] |accessdate=December 16, 2010 |url-status=dead |archiveurl=https://web.archive.org/web/20110728025129/http://www.signalprocessingsociety.org/technical-committees/list/sl-tc/spl-nl/2010-11/jelinek/ |archivedate=July 28, 2011 |df=mdy-all }} Adapted from a speech given in 2006.</ref>


<ref name=vitae>{{cite web |first=Jelinek |last=Jan |title=Curriculum Vitae |url=http://www.clsp.jhu.edu/~jelinek/documents/Jelinekvitae2006fj_000.doc |archiveurl=https://web.archive.org/web/20060903200951/http://www.clsp.jhu.edu/~jelinek/documents/Jelinekvitae2006fj_000.doc |archivedate=2006-09-03 |date=June 13, 2006 |accessdate=December 17, 2010}}</ref>}}
<ref name=vitae>{{cite web |first=Jelinek |last=Jan |title=Curriculum Vitae |url=http://www.clsp.jhu.edu/~jelinek/documents/Jelinekvitae2006fj_000.doc |archiveurl=https://web.archive.org/web/20060903200951/http://www.clsp.jhu.edu/~jelinek/documents/Jelinekvitae2006fj_000.doc |archivedate=2006-09-03 |date=June 13, 2006 |accessdate=December 17, 2010}}</ref>}}
Line 124: Line 120:
== External links ==
== External links ==
{{wikiquote|Fred Jelinek}}
{{wikiquote|Fred Jelinek}}
* [http://www.clsp.jhu.edu/people/jelinek/ Institutional page] at Johns Hopkins university
* [https://web.archive.org/web/20060405210212/http://www.clsp.jhu.edu/people/jelinek/ Institutional page] at Johns Hopkins university
{{s-start}}
{{s-start}}
{{S-bef| before=[[Fumitada Itakura]]}}
{{S-bef| before=[[Fumitada Itakura]]}}
Line 145: Line 141:
[[Category:1932 births]]
[[Category:1932 births]]
[[Category:2010 deaths]]
[[Category:2010 deaths]]
[[Category:American people of Bohemian descent]]
[[Category:American people of Czech-Jewish descent]]
[[Category:American people of Czech-Jewish descent]]
[[Category:Statistical natural language processing]]
[[Category:Statistical natural language processing]]
[[Category:Czechoslovak emigrants to the United States]]
[[Category:Czechoslovak emigrants to the United States]]
[[Category:Czech Jews]]
[[Category:Cornell University faculty]]
[[Category:Cornell University faculty]]
[[Category:Harvard University faculty]]
[[Category:Harvard University faculty]]
[[Category:Johns Hopkins University faculty]]
[[Category:Johns Hopkins University faculty]]
[[Category:Massachusetts Institute of Technology alumni]]
[[Category:MIT School of Engineering alumni]]

[[Category:IBM Research computer scientists]]
[[Category:IBM Research computer scientists]]
[[Category:IBM employees]]
[[Category:IBM employees]]
[[Category:Members of the United States National Academy of Engineering]]
[[Category:Members of the United States National Academy of Engineering]]
[[Category:People from Kladno]]
[[Category:Speech processing researchers]]
[[Category:Natural language processing researchers]]

Latest revision as of 20:48, 8 November 2023

Frederick Jelinek
Born
Bedřich Jelínek

(1932-11-18)November 18, 1932
DiedSeptember 14, 2010(2010-09-14) (aged 77)
Baltimore, United States
CitizenshipAmerican
Alma materMassachusetts Institute of Technology
Known forAdvancement of natural language processing techniques
SpouseMilena Jelinek
Awards
Scientific career
FieldsInformation theory, natural language processing
InstitutionsCornell University, IBM Research, Johns Hopkins University
Doctoral advisorRobert Fano
Notable studentsNeil Sloane

Frederick Jelinek (18 November 1932 – 14 September 2010) was a Czech-American researcher in information theory, automatic speech recognition, and natural language processing. He is well known for his oft-quoted statement, "Every time I fire a linguist, the performance of the speech recognizer goes up".[note 1]

Jelinek was born in Czechoslovakia before World War II and emigrated with his family to the United States in the early years of the communist regime. He studied engineering at the Massachusetts Institute of Technology and taught for 10 years at Cornell University before accepting a job at IBM Research. In 1961, he married Czech screenwriter Milena Jelinek. At IBM, his team advanced approaches to computer speech recognition and machine translation. After IBM, he went to head the Center for Language and Speech Processing at Johns Hopkins University for 17 years, where he was still working on the day he died.

Personal life[edit]

Jelinek was born on November 18, 1932, as Bedřich Jelínek[6] in Kladno to Vilém and Trude Jelínek.[7] His father was Jewish; his mother was born in Switzerland to Czech Catholic parents and had converted to Judaism.[8][9] Jelínek senior, a dentist, had planned early to escape Nazi occupation and flee to England; he arranged for a passport, visa, and the shipping of his dentistry materials. The couple planned to send their son to an English private school. However, Vilém decided to stay at the last minute and was eventually sent to the Theresienstadt concentration camp,[10] where he died in 1945.[7][9] The family was forced to move to Prague in 1941, but Frederick, his sister and mother—thanks to the latter's background—escaped the concentration camps.[9]

It is generally believed that scientific talent reveals itself in early youth. ... This was certainly not my case. I somehow slid into my scientific profession. My mother wished for me to become a physician, just like my father. ... I myself wanted to be a lawyer, defender of the unjustly accused. But my career is the result of political circumstances, academic possibilities, and lucky accidents.

—Talking about his life in a 2001 speech.[10]

After the war, Jelinek entered in the gymnasium, despite having missed several years of schooling because education of Jewish children had been forbidden since 1942. His mother, anxious that her son should get a good education, made great efforts for their emigration,[note 2] especially when it became clear he would not be allowed to even attempt the graduation examination. His mother hoped her son would become a physician, but Jelinek dreamed of being a lawyer. He studied engineering in evening classes at the City College of New York and received stipends from the National Committee for a Free Europe that allowed him to study at the Massachusetts Institute of Technology. About his choice of specialty, he said: "Fortunately, to electrical engineering there belonged a discipline whose aim was not the construction of physical systems: the theory of information".[10] He obtained his Ph.D. in 1962, with Robert Fano as his adviser.[11][12]

In 1957, Jelinek paid an unexpected visit to Prague. He had been in Vienna and applied for a visa, hoping to see his former acquaintances again. He met with his old friend Miloš Forman, who introduced him to film student Milena Tobolová—whose screenplay had been the basis for the movie Easy Life (Snadný život).[13][14] His flight back to the U.S. had a stopover in Munich, during which he called her to propose.[9] Tobolová was considered a dissident and the authorities were not happy with her film.[14] Jelinek asked for help from Jerome Wiesner and Cyrus Eaton, the latter who lobbied Nikita Khrushchev.[13] Following the inauguration of John F. Kennedy, a group of Czech dissidents were allowed to emigrate in January 1961. Thanks to the lobbying, the future Milena Jelinek was one of them.[9][13]

After completing his graduate studies, Jelinek, who had developed an interest in linguistics, had plans to work with Charles F. Hockett at Cornell University. However these fell through and during the next ten years he continued to study information theory.[10] Having previously worked at IBM during a sabbatical, he began full-time work there in 1972—at first on leave for Cornell, but permanently from 1974. He remained there for over twenty years. Although at first he had been offered a regular research job, upon his arrival he learned that Josef Raviv had recently been promoted to head of the newly opened IBM Haifa Research Laboratory, and became head of the Continuous Speech Recognition group at the Thomas J. Watson Research Center.[10][12] Despite his team's successes in this area, Jelinek's work remained little known in his home country because Czech scientists were not allowed to participate in key conferences.[13]

After the 1989 fall of communism, Jelinek helped establish scientific relationships, regularly visiting to lecture and helping to persuade IBM to establish a computing centre at Charles University.[8][10][15] In 1993, he retired from IBM and went to Johns Hopkins University's Center for Language and Speech Processing, where he was director and Julian Sinclair Smith Professor of Electrical and Computer Engineering.[11][16] He was still working there at the time of his death; Jelinek died of a heart attack at the close of an otherwise normal workday in mid-September 2010.[9][16] He was survived by his wife, daughter and son, sister, stepsister, and three grandchildren, including Sophie Gold Jelinek.

Research and legacy[edit]

Information theory was a fashionable scientific approach in the mid '50s.[12] However, pioneer Claude Shannon wrote in 1956 that this trendiness was dangerous. He said, "Our fellow scientists in many different fields, attracted by the fanfare and by the new avenues opened to scientific analysis, are using these ideas in their own problems  ...  It will be all too easy for our somewhat artificial prosperity to collapse overnight when it is realized that the use of a few exciting words like information, entropy, redundancy, do not solve all our problems."[17] During the next decade, a combination of factors shut down the application of information theory to natural language processing (NLP) problems—in particular machine translation. One factor was the 1957 publication of Noam Chomsky's Syntactic Structures, which stated, "probabilistic models give no insight into the basic problems of syntactic structure".[18] This accorded well with the philosophy of the artificial intelligence research of the time, which promoted rule-based approaches. The other factor was the 1966 ALPAC report, which recommended that the government should stop funding research into machine translation. ALPAC chairman John Pierce later said that the field was filled with "mad inventors or untrustworthy engineers". He said that the underlying linguistic problems must be solved before attempts at NLP could be reasonably made. These elements essentially halted research in the field.[5][19]

Jelinek had begun to develop an interest in linguistics after the immigration of his wife, who initially enrolled in the MIT linguistics program with the help of Roman Jakobson. Jelinek often accompanied her to Chomsky's lectures, and even discussed the possibility of changing orientation with his adviser. Fano was "really upset", and after the failure of his project with Hockett at Cornell, he did not return to this field of research until starting work at IBM.[12] The scope of research at IBM was considerably different from that of most other teams. According to Mark Liberman, "While [Jelinek] was leading IBM's effort to solve the general dictation problem during the decade or so following 1972, most other U.S. companies and academic researchers were working on very limited problems  ...  or were staying out of the field entirely".[19]

He was not a pioneer of speech recognition, he was the pioneer of speech recognition.

—Steve Young (2010)[5]

Jelinek regarded speech recognition as an information theory problem—a noisy channel, in this case the acoustic signal—which some observers considered a daring approach.[5][16][19] The concept of perplexity was introduced in their first model,[12] New Raleigh Grammar, which was published in 1976 as the paper "Continuous Speech Recognition by Statistical Methods" in the journal Proceedings of the IEEE.[5] According to Young, the basic noisy channel approach "reduced the speech recognition problem to one of producing two statistical models".[5] Whereas New Raleigh Grammar was a hidden Markov model, their next model, called Tangora, was broader and involved n-grams, specifically trigrams. Even though "it was obvious to everyone that this model was hopelessly impoverished", it was not improved upon until Jelinek presented another paper in 1999.[5] The same trigram approach was applied to phones in single words. Although the identification of parts of speech turned out not to be very useful for speech recognition, tagging methods developed during these projects are now used in various NLP applications.[12]

The incremental research techniques developed at IBM eventually became dominant in the field after DARPA, in the mid-80s, returned to NLP research and imposed that methodology to participating teams, shared common goals, data, and precise evaluation metrics.[19] The Continuous Speech Recognition Group's research, which required large amounts of data to train the algorithms, eventually led to the creation of the Linguistic Data Consortium. In the 1980s, although the broader problem of speech recognition remained unsolved, they sought to apply the methods developed to other problems; machine translation and stock value prediction were both seen as options. A group of IBM researchers went on to work for Renaissance Technologies. Jelinek wrote, "The performance of the Renaissance fund is legendary, but I have no idea whether any methods we pioneered at IBM have ever been used. My former colleagues will not tell me: theirs is a very hush-hush operation!"[12] Methods very similar to those developed for achieving speech recognition are at the base of most machine translation systems in use today. Observers have said that Pierce's paradigm, according to which engineering achievements in this area would be built on scientific progress, has been inverted, with the achievements in engineering being at the base of a number of scientific findings.[5][19]

Jelinek's works won "best paper" awards on several occasions, and he received a number of company awards while he worked at IBM.[5][11] He received the Society Award for "outstanding technical contributions and leadership" from the IEEE Signal Processing Society for 1997,[20] and the ESCA Medal for Scientific Achievement in 1999.[21] He was a recipient of an IEEE Third Millennium Medal in 2000, the European Language Resources Association's first Antonio Zampolli Prize in 2004,[22] the 2005 James L. Flanagan Speech and Audio Processing Award,[23] and the 2009 Lifetime Achievement Award from the Association for Computational Linguistics.[11][12] He received an honoris causa Ph.D. from Charles University in 2001,[24] was elected to the National Academy of Engineering in 2006 and was made one of twelve inaugural fellows of the International Speech Communication Association in 2008.[5]

Selected publications[edit]

  • Jelinek, Frederick (1968). Probabilistic Information Theory: Discrete and memoryless models. McGraw-Hill series in systems science. New York: McGraw-Hill. 689p. LCCN 68-11611 [1] (review)
  • ———————- (1969). "Fast sequential decoding algorithm using a stack". IBM Journal of Research and Development 13(6):675–685. doi:10.1147/rd.136.0675.
  • ———————- (1969). "Tree encoding of memoryless time-discrete sources with a fidelity criterion". IEEE Transactions on Information Theory 15(5):584–590. doi:10.1109/TIT.1969.1054355. (received 1971 "Best Paper" award)
  • Bahl, Lalit R.; John Cocke, Frederick Jelinek, Josef Raviv (1974). "Optimal decoding of linear codes for minimizing symbol error rate". IEEE Transactions on Information Theory 20(2):284–287. doi:10.1109/TIT.1974.1055186. (received Information Theory Society Golden Jubilee paper award)
  • ———————- (1976). "Continuous speech recognition by statistical methods". Proceedings of the IEEE 64(4):532–556. doi:10.1109/PROC.1976.10159.
  • Brown, P.; J. Cocke, S. Della Pietra, V. Della Pietra, F. Jelinek, R, Mercer and P. Roossin (1988). "A statistical approach to language translation" Archived August 7, 2011, at the Wayback Machine. In Dénes Vargha, ed. Coling 88: Proceedings of the 12th conference on Computational linguistics, volume 1. Budapest: John Von Neumann society for computing sciences. pp. 71–76. doi:10.3115/991635.991651. ISBN 963-8431-56-3.
  • ———————- (1990). "Self-Organized Language Modeling for Speech Recognition". In Alex Waibel & Kai-Fu Lee, eds. Readings in speech recognition. San Mateo: Morgan Kaufmann. 629p. ISBN 1-55860-124-4.
  • ———————-; John D. Lafferty and Robert L. Mercer. (1990) "Basic methods of probabilistic context free grammars". Technical Report RC 16374 (72684), IBM.
    • Reprinted in Laface, Pietro; Renato De Mori (1992). Speech Recognition and Understanding: Recent advances, trends, and applications. NATO ASI series. Series F, Computer and systems sciences, 75. New York: Springer-Verlag. pp. 345–360. ISBN 0-387-54032-6.
  • ———————- (1997). Statistical Methods for Speech Recognition. Cambridge, Mass.: MIT Press. 283p. ISBN 0-262-10066-5. (review) (review 2)
  • Chelba, Ciprian; Frederick Jelinek (2000). "Structured Language Modeling". Computer Speech & Language 14(4):283–332. doi:10.1006/csla.2000.0147 (received 2002 "Best Paper" award).
    • Expanded version of a presentation at NLDB'99. Klagenfurt, Austria, June 17–19, 1999 (arXiv:cs/0001023).
  • Xu, Peng; Ahmad Emami and Frederick Jelinek (2003). "Training Connectionist Models for the Structured Language Model". In Michael Collins and Mark Steedman, eds. EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing. East Stroudsburg, Penn.: Association for Computational Linguistics. pp. 160–167. ISBN 1-932432-13-2. doi:10.3115/1119355.1119376. (won "best paper" award)

References[edit]

Notes
  1. ^ Although its fame and iconic status are undisputed (it was for example used as the title of a 1998 speech by Julia Hirschberg),[1] its context is unknown and its specific wording and dating are unclear. According to Daniel Jurafsky and James H. Martin, Jelinek himself recalled the quote as "Anytime a linguist leaves the group the recognition rate goes up" and dated it to December 1988 (Wayne, Pennsylvania), further noting that the quote did not appear in the published proceeding,[2][3] whereas Roger K. Moore gave the wording as "Every time we fire a phonetician/linguist, the performance of our system goes up" and dated it to an IEEE Automatic Speech Recognition and Understanding workshop held in 1985.[4] According to Steve Young, "the story goes that one day one of his linguists resigned, and Fred decided to replace him not by another linguist but by an engineer. A little while later, Fred noticed that the performance of his system improved significantly. So he encouraged another linguist to find alternative employment, and sure enough performance improved again."[5]
  2. ^ As he put it, "she didn't want to emulate my father's big mistake."
References
  1. ^ Hirschberg, Julia (July 29, 1998). 'Every time I fire a linguist, my performance goes up', and other myths of the statistical natural language processing revolution (Speech). 15th National Conference on Artificial Intelligence, Madison, Wisconsin.{{cite speech}}: CS1 maint: location (link) Invited speech.
  2. ^ Jurafsky, Daniel; James H. Martin (2009). Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition. Prentice Hall series in artificial intelligence (2nd ed.). Upper Saddle, New Jersey: Prentice Hall. p. 83. ISBN 978-0-13-187321-6.
  3. ^ Palmer, Martha; Tim Finin (1990). "Report on the Workshop on the Evaluation of Natural Language Processing Systems" (PDF). Computational Linguistics. 16 (1): 171–185.
  4. ^ Moore, Roger K. (2005). Results from a Survey of Attendees at ASRU 1997 and 2003 (PDF). INTERSPEECH-2005. Lisbon, September 4–8, 2005. Archived from the original (PDF) on July 20, 2011.{{cite conference}}: CS1 maint: location (link)
  5. ^ a b c d e f g h i j Young, Steve (November 2010). "Frederick Jelinek 1932–2010: The Pioneer of Speech Recognition Technology". Speech and Language Processing Technical Committee Newsletter. IEEE Signal Processing Society. Archived from the original on July 28, 2011. Retrieved December 16, 2010. Adapted from a speech given in 2006.
  6. ^ Rejžek, Jan (September 17, 2010). "Nekrolog". Lidové noviny. Retrieved December 17, 2010.
  7. ^ a b Jelinek, Frederick (1997). Statistical Methods for Speech Recognition. Cambridge, Mass.: MIT Press. p. v. ISBN 0-262-10066-5.
  8. ^ a b Hajic, Jan (November 2010). "Prof. Frederick Jelinek, 1932–2010". EACL Newsletter. Vol. 12. Retrieved December 19, 2010.
  9. ^ a b c d e f Lohr, Steve (September 24, 2010). "Frederick Jelinek, Who Gave Machines the Key to Human Speech, Dies at 77". New York Times. p. B10. Retrieved December 16, 2010.
  10. ^ a b c d e f Jelinek, Frederick (November 22, 2001). How I Got Here (Speech). Charles University, Prague, Czechoslovakia. Archived from the original on March 16, 2008. Retrieved December 17, 2010. Honoris causa degree acceptance speech.
  11. ^ a b c d Jan, Jelinek (June 13, 2006). "Curriculum Vitae". Archived from the original on September 3, 2006. Retrieved December 17, 2010.
  12. ^ a b c d e f g h Jelinek, Fred (December 2009). "The Dawn of Statistical ASR and MT". Computational Linguistics. 35 (4): 483–494. doi:10.1162/coli.2009.35.4.35401. S2CID 1486422.
  13. ^ a b c d Hershenson, Roberta (December 31, 1989). "Czech Couple Keep Eye on Homeland". New York Times. Retrieved December 17, 2010.
  14. ^ a b Willoughby, Ian (June 9, 2008). "Milena Jelinek – member of golden generation of Czech filmmakers now teaching screenwriting at Columbia". One on One. Radio Prague. Retrieved February 1, 2014.
  15. ^ Dresser, Michael (September 19, 2010). "Frederick Jelinek, speech recognition pioneer, dies". The Baltimore Sun. Retrieved December 16, 2010.
  16. ^ a b c Sneiderman, Phil (September 20, 2010). "Frederick Jelinek, 77, pioneer in speech and text understanding technology". The JHU Gazette. Johns Hopkins University. Retrieved December 16, 2010.
  17. ^ Quoted in Liberman (2010).
  18. ^ Quoted in Young (2010).
  19. ^ a b c d e Liberman, Mark (December 2010). "Obituary: Fred Jelinek". Computational Linguistics. 36 (4): 595–599. doi:10.1162/coli_a_00032.
  20. ^ "Society Award". IEEE Signal Processing Society. Retrieved December 21, 2010.
  21. ^ "1999 ESCA Medal for Scientific Achievement". International Speech Communication Association. 1999. Archived from the original on August 2, 2009. Retrieved December 21, 2010.
  22. ^ "In Honour of Prof. Antonio Zampolli". European Language Resources Association. Archived from the original on July 21, 2011. Retrieved December 21, 2010.
  23. ^ "IEEE James L. Flanagan Speech and Audio Processing Award recipients". IEEE. Retrieved December 21, 2010.
  24. ^ "Dr. h. c. prof. F. Jelinek" (Press release). Charles University in Pragues. November 22, 2001. Retrieved December 17, 2010.

External links[edit]

Preceded by IEEE Signal Processing Society Award
1997
Succeeded by
Preceded by
Mario Rossi
ISCA Medal for Scientific Achievement
1999
Succeeded by
Louis Pols
Preceded by IEEE James L. Flanagan
Speech and Audio Processing Award

2005
Succeeded by
Preceded by ACL Lifetime Achievement Award
2009
Succeeded by