Metrika

  • citati u SCIndeksu: 0
  • citati u CrossRef-u:0
  • citati u Google Scholaru:[]
  • posete u poslednjih 30 dana:0
  • preuzimanja u poslednjih 30 dana:0

Sadržaj

članak: 8 od 42  
Back povratak na rezultate
2012, vol. 11, br. 44, str. 33-40
Nov metod ekstrakcije informacija baziran na transduktorima
aUniverzitet u Beogradu, Poljoprivredni fakultet
bUniverzitet u Beogradu, Matematički fakultet

e-adresasvesna@agrif.bg.ac.rs, paja@agrif.bg.ac.rs, stasa@matf.bg.ac.rs
Ključne reči: ekstrakcija informacija; obrada prirodnih jezika; strukturiranje podataka
Sažetak
U radu je dat osvrt na oblast ekstrakcije informacije, čije su metode i tehnike nezaobilazne u pretrazi i upravljanju informacijama. Ova oblast u sebi sadrži tehnike drugih oblasti matematike i računarstva, kao što su obrada prirodnih jezika, teorija formalnih jezika, verovatnoća i statistika. Uzimajući u obzir sve specifičnosti zahteva za informacijom i tekstualnih resursa iz kojih se izdvajanje vrši, razvijen je i u radu prikazan nov metod za ekstrakciju informacija nazvan Dvofazni metod baziran na transduktorima. Predstavljena je arhitektura sistema koji implementira ovaj metod kao i primer konkretne primene. Poseban značaj ovaj metod ima u situacijama kada ne postoje već pripremljeni tekstualni korpusi, neophodni za primenu postojećih metoda, posebno onih baziranih na verovatnoći i statistici.
Reference
Allen, J. (1995) Natural language understanding. Redwood City: Benjamin-Cummings
Appelt, D.E., Hobbs, J., Bear, J., Israel, D., Tyson.M. (1993) FASTUS: A finite-state processor for Information Extraction from real world text. u: Proceedings of the 13th International Joint Conference on Artificial Intelligence, pages 1172-1178
Baumgartner, R., Flesca, S., Gottlob, G. (2001) Visual web information extraction with Lixto. u: International conference on very large databases (XXVI), proceedings, str. 119-128
Bilofsky, H.S., Christian, B. (1988) The GenBank® genetic sequence data bank. Nucleic Acids Research, 16(5): 1861-1863
Califf, M.E., Mooney, R.J. (1999) Relational learning of pattern-match rules for information extraction. u: National conference on artificial intelligence and eleventh conference on innovative applications of artificial intelligence (XVI), , Orlando, FL, July, proceedings, str. 328-334
Casacuberta, F., Vidal, E., Picó, D. (2005) Inference of finite-state transducers from regular languages. Pattern Recognition, 38(9): 1431-1443
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V. (2002) GATE: A framework and graphical development environment for robust NLP tools and applications. u: 40th Anniversary Meeting of the Association for Computational Linguistics (ACL'02), Proceedings, Philadelphia, July 2002
Friburger, N., Maurel, D. (2004) Finite-state transducer cascades to extract named entities in texts. Theoretical Computer Science, 313(1): 93-104
Garrity, G. (2005) Bergey's manual of systematic bacteriology: The Proteobacteria. Volume 2; ISBN 978-0-387-95040-2
Garrity, G., Don, J., Krieg, N.R., Staley, J.T. (2005) Bergey's Manual of Systematic Bacteriology, Volume Two: The Proteobacteria (Part C). ISBN 978-0-387-24145-6
Grisham, R., Beth, S. (1996) Message understanding Conference 6: A brief history. u: International Conference on Computational Linguistics (COLING) (XVI), Copenhagen, Proceedings, str. 466-471
Gross, M., Perrin, D., ur. (1987) Electronic dictionaries and automata in computational linguistics. Lecture Notes in Computer Science
Hobbs, J.R., Appelt, D., Bear, J., Israel, D., Kameyama, M., Stickel, M., Tyson, M. (1997) FASTUS: A cascaded finite- state transducer for extracting information from natural- language text. u: [ur.] Finite-State Language Processing, Cambridge, MA: The MIT Press, pages 383-406
Jayram, T.S., Krishnamurthy, R., Raghavan, S., Vaithyanathan, S., Zhu, H. (2006) Avatar information extraction system. IEEE Data Engineering Bulletin, 29, 2006, 40-48
Jurafsky, D., Martin, J. H. (2008) Speech and language processing. Prentice-Hall, 2nd edition
Kornai, A., ur. (1999) Extended finite state models of language. London: Cambridge University Press
Krieg, N.R., Ludwig, W., Whitman, W.B., Hedlund, B.P., Paster, B.J., Staley, J.T., Ward, N., Brown, D., Parte, A. (2010) Bergey's manual of systematic bacteriology: The Bacteroidetes, Spirochaetes, Tenericutes (Mollicutes), Acidobacteria, Fibrobacteres, Fusobacteria, Dictyoglomi, Gemmatimonadetes, Lentisphaerae, Verrucomicrobia, Chlamydiae, and Planctomycetes. Volume 4; ISBN 978-0-387-95042-6
Liu, L., Pu, C., Han, W. (2000) XWRAP: An XMLenabledWrapper Construction System for Web Information Sources. u: Intern. Conference on Data Engineering (ICDE), pages 611-621
Mirhaji, P., Byrne, S., Kunapareddy, N., Casscells, S.W. (2006) Semantic approach for text understanding of chief complaints data. u: AMIA Annual Symposium Proceedings, Washington, p. 1033
Moens, M. (2006) Information Extraction: Algorithms and prospects in a retrieval context. Dordrecht: Springer
Pajić, V. (2011) Putting Encyclopaedia Knowledge into Structural Form: Finite State Transducers Approach. Journal of Integrative Bioinformatics, Informationsmanagement in der Biotechnologie e. V. Germany, 8 (2): 164
Pajić, V., Pavlović-Lažetić, G., Pajić, M. (2011) Information extraction from semi-structured resources: A two-phase finite state transducers approach. u: Implementation and application of automata: Proceedings of 16th International Conference CIAA, Lecture Notes in Computer Science, Berlin - Heidelberg: Springer, a; 282-289; ISBN 3642222552, 9783642222559
Pajić, V., Pavlović-Lažetić, G., Beljanski, M., Brandt, B., Pajić, M. (2011) Towards a Database for Genotype-Phenotype Association Research: Mining Data from Encyclopedia. International Journal of Data Mining and Bioinformatics, Inderscience publishers
Paumier, S. (2011) Unitex 2.1 User Manual. Universit'e de Marne-la-Vall'ee, http://www-igm.univ-mlv.fr/˜unitex/UnitexManual2.1.pdf
Reiss, F., Raghavan, S., Krishnamurthy, R., Zhu, H., Vaithyanathan, S. (2008) An algebraic approach to rule-based information extraction. u: ICDE, 2008
Roche, E. (1999) Finite state transducers: Parsing free and frozen sentences. u: Extended finite state models of language, Cambridge University Press, pp. 108. 120
Roche, E., Schabes, Y., ur. (1997) Finite state language processing. Cambridge, MA, itd: Massachusetts Institute of Technology Press / MIT Press
Sastre, J.M. (2009) Efficient Parsing Using Filtered-Popping Recursive Transition Networks. Lecture Notes in Computer Science, 5642: 241-244
Sastre, J.M., Forcada, M. (2007) Efficient parsing using recursive transition networks with output. u: Vetulani Zygmunt [ur.] Proceedings of 3rd Language & Technology Conference (LTC07), str. 280-284
Shen, W., Doan, A., Naughton, J.F., Ramakrishnan, R. (2007) Declarative information extraction using datalog with embedded extraction predicates. u: VLDB, pp. 1033-1044
Silberztein, M. (1993) Le dictionnaire électronique et analyse automatique de textes: Le système INTEX. Paris, itd: Masson
Soderl, S. (1999) Learning information extraction rules for semi-structured and free text. Machine Learning, 34(1-3):233-272
Vitas, D. (2006) Prevodioci i interpretatori - uvod u teoriju i metode kompilacije programskih jezika. Beograd: Matematički fakultet
Vos, P., Garrity, G., Jones, D., Krieg, N.R., Ludwig, W., Rainey, F.A., Schleifer, K.H., Whitman, W.B. (2009) Bergey’s manual of systematic bacteriology. u: The Firmicutes, Vol. 3: ISBN 978-0-387-95041-9
 

O članku

jezik rada: srpski
vrsta rada: članak
objavljen u SCIndeksu: 22.03.2013.

Povezani članci