EDITORIAL BOARD
Editor-In-Chief
Ngo Thanh Long,Le Quy Don Technical University
Deputy Editor-In-Chief
Tran Xuan Nam, Le Quy Don Technical University
Editors
Pham The Long, Le Quy Don Technical University
Nguyen Thanh Thuy, Vietnam National University, Ha Noi
Vu Duc Thi, Vietnam National University, Ha Noi
Le Hoai Bac, Vietnam National University, Ho Chi Minh
Tu Minh Phuong, Posts and Telecommunications Institute of Technology
Huynh Quyet Thang, Hanoi University of Science and Technology
Nguyen Huu Thanh, Hanoi University of Science and Technology
Luong Chi Mai, Vietnam Academy of Science and Technology
Tran Xuan Tu, Vietnam National University, Ha Noi
Truong Trung Kien, Posts and Telecommunications Institute of Technology
Tran Nguyen Ngoc, Le Quy Don Technical University
Hoang Van Phuc, Le Quy Don Technical University
Scientific Secretary
Tran Cao Truong, Le Quy Don Technical University
On the Fourier coefficients of Siegel Eisenstein series of higher degree.

Tran Trung, Nguyen Tuan Dang.

Keywords: Discourse Representation Structure, Indication pronoun, Assign type word label, Assign paragraph label, Parsing

Abstract: Text meaning representation is the first important step in a text summarization system (K. S. Jones [25, 26]) following the abstractive direction. The summarization process following this direction has three main performing steps: (i) Transform the input text into a semantic representation form; (ii) Transfom this semantic form into an output form; (iii) Generate the summary from the output form. In this article, we present a new method for performing the first step in the above process. This method is applied in the abstractive summarization model which was proposed in previous researches. We apply this method for short paragraphs composing from 2 to 5 Vietnamese sentences which have the anaphoric pronoun relationships between some sentences. The output of the method is Discourse Representation Structure (DRS) presenting the semantic of the input paragraph. The transforming method includes three stages: (i) Parse the shallow structures of the original sentences by OpenNLP tools (Tokenizer, POS Tagger and Chunker); (ii) Map the shallow structure to reduced structure composing main phrases, and re-create the sentence from these phrases. (iii) Build the DRS of the result paragraph from stage two.

Issues 9 (12/2016)

References

[1] A. Berger, S.A.D. Pietra and V.J.D. Pietra, "A Maximum Entropy Approach to Natural Language Processing", Computational Linguistics, vol. 22, no. 1, pp. 39-71, 1996.

[2] A. Khan and N. Salim, "A Review on Abstractive Summarization Methods", Journal of Theoretical and Applied Information Technology, vol. 59, no.1, pp. 64-72, 2014.

[3] A. Ratnaparkhi, "A maximum entropy model for part-of-speech tagging", Proceedings of the Conference on Empirical Methods in Natural Language Processing, 1996.

[4] A. Ratnaparkhi, A simple introduction to maximum entropy models for natural language processing, Technical Report 97-08, Institute for Research in Cognitive Science, University of Pennsylvania, 1997.

[5] B. Santorini, Part-of-speech Tagging Guidelines for the Penn Treebank Project, Technical Report MS-CIS-90-47, Department of Computer and Information Science, University of Pennsylvania, 1990.

[6] C.F. Greenbacker, "Towards a framework for abstractive summarization of multimodal documents", ACL HLT 2011, 75 (2011).

[7] C.S. Saranyamol and L. Sindhu, "A Survey on Automatic Text Summarization", International Journal of Computer Science and Information Technologies, vol. 5, no. 6, pp. 7889-7893, 2014.

[8] C.S. Lee, Z.W. Jian and L.K. Huang, "A Fuzzy Ontology and Its Application to News Summarization", IEEE Transaction on Systems, Man and Cybernetics, Part B: Cybernetics, vol. 35, no. 5, pp. 859-880, 2005.

[9] C.Y. Lin, "Looking for a Few Goods Metrics: ROUGE and its Evaluation", Proceedings of NTCIR Workshop 2004, Tokyo, Japan, 2004.

[10] C.Y. Lin, "ROUGE: A Package for Automatic Evaluation of Summaries", Proceedings of the Workshop on Text Summarization Branches Out, Post-Conference Workshop of ACL 2004, Barcelona, Spain, 2004.

[11] D. Das and A.F.T. Martins, A Survey on Automatic Text Summarization, Language Technologies Institute, Carnegie Mellon University, 2007.

[12] E. Krahmer, E. Marsi and P.V. Pelt, "Query-based sentence fusion is better defined and leads to more preferred results than generic sentence fusion", Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics and Human Language Technologies, Columbus, Ohio, pp. 193-196, 2008.

[13] E. Lloret, "Text Summarization: An Overview", paper supported by the Spanish Government under the project TEXT-MESS (TIN2006-15265-C06-01), 2008.

[14] F. Boudin and E. Morin, "Keyphrase extraction for n-best reranking in multi-sentence compression", Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL- HLT 2013), Atlanta, Georgia, pp. 298-305, 2013.

[15] H. Kamp, A Theory of Truth and Semantic Representation, University of Amsterdam: Formal methods in the study of language, pp. 277-322, 1981.

[16] H.P. Le, T.M.H. Nguyen, M. Rossignol and A. Roussanaly, "An empirical study of maximum entropy approach for part-of-speech tagging of Vietnamese texts", Proceedings of the Actes du Traitement Automatique des Langues Naturelles (TALN-2010), 2010.

[17] H.X. Cao, Tiếng Việt: Sơ thảo ngữ pháp chức năng [Vietnamese: Brief of Functional Grammar], Nhà xuất bản giáo dục [Education Publisher], 2006.

[18] I.F. Moawad and M. Aref, "Semantic graph reduction approach for abstractive Text Summarization", Proceedings of the 7th International Conference on Computer Engineering & Systems (ICCES), pp. 132-138, 2012.

[19] I. Mani and M.T. Maybury, Advances In Automatic Text Summarization, MIT Press, 1999.

[20] K.A. Ganesan, C.X. Zhai and J. Han, "Opinosis: A Graph-Based Approach to Abstractive Summarization of Highly Redundant Opinions", Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China, 2010, pp. 340-348.

[21] K. Filippova, "Multi-Sentence Compression: Finding Shortest Paths in Word Graphs", Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China, pp. 322-330, 2010.

[22] K. Filippova and M. Strube, "Dependency Tree Based Sentence Compression", Proceedings of the 5th International Natural Language Generation Conference, Salt Fork, Ohio, 2008.

[23] K. Filippova and M. Strube, "Sentence Fusion via Dependency Graph Compression", Proceedings of the Conference on Empirical Methods in Natural Language Processing, Honolulu, Hawaii, 2008.

[24] K. Jezek and J. Steinberger, Automatic Text summarization, Vaclav Snasel (Ed.): Znalosti 2008, ISBN 978-80-227-2827-0, FIIT STU Brarislava, Ustav Informatiky a softveroveho inzinierstva, pp. 1-12, 2008.

[25] K.S. Jones, "Automatic Summarizing: Factors and Directions", in: I. Mani and M. Marbury (Eds), Advances in Automatic Text Summarization, MIT Press, 1999.

[26] K.S. Jones, Automatic Summarising: A Review and Discussion of The State of The Art, Technical Report 679, Computer Laboratory, University of Cambridge, 2007.

[27] K. Toutanova and C.D. Manning, "Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger", Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC-2000), pp. 63-70, 2000.

[28] K. Toutanova, D. Klein, C. Manning and Y. Singer, "Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network", Proceedings of HLT-NAACL 2003, pp. 252-259, 2003.

[29] M.A. Covington and N. Schmitz, An Implementation of Discourse Representation Theory, USA: Advanced Computational Methods Center, The University of Georgia, 1989.

[30] M.A.K. Halliday and C.M.I.M. Matthiessen, An Introduction to Functional Grammar, Third Edition, Hodder Arnold, 2004.

[31] M. Collins, "Head-Driven Statistical Models for Natural Language Parsing", Computational Linguistics, vol. 29, no. 4, pp. 589-637, 2003.

[32] M. Collins and M.P. Marcus, Head-driven statistical models for natural language parsing, University of Pennsylvania, Philadelphia, PA, 1999.

[33] N.R. Kasture, N. Yargal, N.N. Singh, N. Kulkarni and V. Mathur, "A Survey on Methods of Abstractive Text Summarization", International Journal for Research in Merging Science and Technology, vol. 1, iss. 6, pp. 53-57, 2014.

[34] O. Chaowalit and O. Sornil, "An Automatic Approach to Generating Abstractive Summary for Thai Opinions", International Journal of Advancements in Computing Technology, vol. 6, no. 3, pp. 142-150, 2014.

[35] P. Blackburn and J. Bos, Representation and Inference for Natural Language - Volume II: Working with Discourse Representation Structures, Germany: Department of Computational Linguistics, University of Saarland, 1999.

[36] P.E. Genest and G. Lapalme, "Framework for Abstractive Summarization using Text-to-Text Generation", Proceedings of the Workshop on Monolingual Text-to-Text Generation, Oregon, Portland, pp. 64-73, 2011.

[37] P.E. Genest and G. Lapalme, "Text Generation for Abstractive Summarization", Proceedings of the 3rd Text Analysis Conference, Gaithersburg, Maryland, USA, 2010.

[38] P.E. Genest and G. Lapalme, "Fully Abstractive Approach to Guided Summarization", Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volum 2, Jeju Island, Korea, pp. 354-358, 2012.

[39] R. Barzilay, K.R. Mckeown and M. Elhadad, "Information fusion in the context of multi-document summarization", Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, pp. 550-557, 1999.

[40] R. Barzilay and K.R. McKeown, "Sentence fusion for multidocument news summarization", Computational Linguistics, vol. 31, pp. 297-328, 2005.

[41] S. Gerani, Y. Mehdad, G. Carenini, T.NG. Raymond and B. Nejat, "Abstractive Summarization of Product Reviews Using Discourse Structure", Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP 2014), Doha, Qatar, pp. 1602-1613, 2014.

[42] S.K. Jagadish, K.G. Srinivasa and R.B. Eswara, "A Comprehensive Analysis of Guided Abstractive Text Summarization", International Journal of Computer Science Issues, vol. 11, iss. 6, no. 1, pp. 115-121, 2014.