進階搜尋


   電子論文尚未授權公開,紙本請查館藏目錄
(※如查詢不到或館藏狀況顯示「閉架不公開」,表示該本論文不在書庫,無法取用。)
系統識別號 U0026-0809201019584000
論文名稱(中文) 部落格群中針對主題性事件之意見擷取與分析
論文名稱(英文) Extracting Opinions from Topic-based Events in the Blogosphere
校院名稱 成功大學
系所名稱(中) 工程科學系碩博士班
系所名稱(英) Department of Engineering Science
學年度 98
學期 2
出版年 99
研究生(中文) 吳振銘
研究生(英文) Chen-Ming Wu
學號 N9697118
學位類別 碩士
語文別 英文
論文頁數 48頁
口試委員 口試委員-胡誌麟
口試委員-侯廷偉
口試委員-高宏宇
指導教授-鄧維光
中文關鍵字 意見擷取  情緒分析  部落格 
英文關鍵字 opinion extraction  sentiment analysis  blogosphere 
學科別分類
中文摘要 近年來隨著資訊科技的快速發展,使用者可以輕易地在網路上發表意見。另一方面,即便互不熟識,使用者亦可從網路上獲取他人多樣性的意見。典型搜尋意見的方法是利用相關的字詞透過搜尋引擎進行搜尋,但此方式不但耗費時間成本,而且得到的意見往往不是最新亦為較單方面的意見;另一方面,在部落格群中,使用者可擁有自己的部落格空間去發表個人意見、想法或是任何評論。因此,我們將部落格當作意見資料的主要來源。在本研究中我們提出兩個主要的議題,第一個是意見的多元性,我們快速地呈現多方面的意見給使用者參考。第二個是意見的變動性,我們分析群眾對於主題式事件在不同時期所呈現的情緒與意見。最後從實驗結果證實,我們提出的方法可以有效率地幫助使用者取得主題式事件的相關意見。
英文摘要 With the rapid development of information technologies in recent years, people can now easily publish their opinions on the Internet. On the other hand, people can also obtain various opinions from others in a few seconds even though they do not know each other. A typical approach is to use search engines with some relevant keywords. Nevertheless, it is a time-consuming process. The retrieved opinions are not the latest and single-sided. In the blogosphere, people have their own blog sites as a platform to publish personal thoughts, ideas or comments. We take the blogosphere as our major data source. Therefore, two crucial issues are carefully addressed in this work. First, we propose opinion diversity issue which provides multiple opinions efficiently. Second, opinion dynamics issue shows the sentiment analysis of topic-based events in different time spans. Empirical studies show that the proposed issues assist user to extract relevant opinions of topic-based events in an efficient way.
論文目次 Chapter 1 Introduction .................................................... 1
1.1 Motivation and Overview of the Thesis ................................. 1
1.2 Contributions of the Thesis ........................................... 2
Chapter 2 Preliminaries ................................................... 3
2.1 Discovery of User Communities ......................................... 3
2.1.1 Identifying Communities ............................................. 3
2.1.2 Social Relationship in Community .................................... 4
2.2 Opinion Leaders in Blogosphere ........................................ 5
2.2.1 Link Analysis for Opinion Leaders ................................... 5
2.2.2 Information Diffusion between Bloggers .............................. 6
2.3 Mining in Blog Entries ................................................ 7
2.3.1 Topics Detection in Blog Entries .................................... 7
2.3.2 Opinion Identification in Blog Entries .............................. 8
Chapter 3 Extacting Opinions in Topic-based Events ....................... 10
3.1 Discovering Opinions in Various Ways ................................. 10
3.2 Apporaches for Sentiment Analysis .................................... 14
3.3 Concept of Opinion Diversity ..........................................16
3.4 Concept of Opinion Dynamics .......................................... 19
3.5 A Scheme for Opinion Extraction System................................ 21
Chapter 4 Prototyping and Evaluation of Our Opinion Extraction Scheme .... 24
4.1 A Prototype of the Opinion Search System ............................. 24
4.2 Evalutaion for the Accuracy of Opinions .............................. 28
4.3 Evalutaion for the Response Time ..................................... 34
4.4 Evalutaion for the Opinion Dynamics................................... 35
4.5 Evalutaion for the Variety of Opinions ............................... 38
Chapter 5 Conclusions and Future Works ....................................42
Bibliography ..............................................................43
參考文獻 [1] N. Bansal, F. Chiang, N. Koudas, and F. W. Tompa, "Seeking Stable Clusters in the Blogosphere," Proceedings of the 33rd International Conference on Very Large Data Bases, pages 806-817, September 2007.
[2] P. Bo and L. Lillian, "A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts," Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, page 271-279, July 2004.
[3] K. Denecke, "Using sentiwordnet for multilingual sentiment analysis," Proceedings of the 24th International Conference on Data Engineering, April 2008.
[4] A. Esuli and F. Sebastiani, "SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining," Proceedings of the 5th Conference on Language Resources and Evaluation, pages 417-422, May 2006.
[5] Facebook, http://www.facebook.com/
[6] W. Francis and H. Kucera, "Brown Corpus Manual," Brown University, 1979.
[7] K. E. Gill, "How can We Measure the Influence of the Blogosphere," Proceedings of the 2nd Workshop on the Weblogging Ecosystem, pages 17-22, May 2004.
[8] Gizmodo, http://gizmodo.com/Google
[9] Blog Search, http://blogsearch.google.com/?hl=en&tab=wb
44
[10] D. Gruhl, R. Guha, D. Liben-Nowell, and A. Tomkins, "Information Diffusion through Blogspace," Proceedings of the 13th International Conference on World Wide Web, pages 491-501, May 2004.
[11] B. He, C. Macdonald, J. He, and I. Ounis, "An Effective Statistical Approach to Blog Post Opinion Retrieval," Proceeding of the 17th ACM Conference on Information and Knowledge Management, pages 1063-1072, October 2008.
[12] M. Helander, R. Lawrence, Y. Liu, C. Perlich, C. Reddy, and S. Rosset, "Looking for Great Ideas: Analyzing the Innovation Jam," Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis, pages 66-73, August 2007.
[13] A. Java, P. Kolari, T. Finin, A. Joshi, and T. Oates, "Feeds That Matter: A Study of Bloglines Subscriptions," Proceedings of the International Conference on Weblogs and Social Media, March 2007.
[14] A. Kavanaugh, T. T. Zin, J. M. Carroll, J. Schmitz, M. Pérez-Quiñones, and P. Isenhour, "When Opinion Leaders Blog: New Forms of Citizen Interaction," Proceedings of the 2006 International Conference on Digital Government Research, pages 79-88, May 2006.
[15] D. Kempe, J. Kleinberg, and É. Tardos, "Maximizing the Spread of Influence through a Social Network," Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 137-146, June 2003.
[16] J. W. Kim, K. S. Candan, u. Candan, and J. Tatemura, "Efficient Overlap and Content Reuse Detection in Blogs and Online News Articles," Proceedings of the 18th International Conference on World Wide Web, pages 81-90, April 2009.
[17] A. Krause and C. Guestrin, "Optimizing Sensing: From Water to the Web," Computer, vol. 42, pages 38-45, August 2009.
45
[18] A. Kritikopoulos, M. Sideri, and I. Varlamis, "BlogRank: Ranking Weblogs based on Connectivity and Similarity Features," Proceedings of the 2nd International Workshop on Advanced Architectures and Algorithms for Internet Delivery and Applications, October 2006.
[19] J. Leskovec, A. Krause, C. Guestrin, C. Faloutsos, J. VanBriesen, and N. Glance, "Cost-effective Outbreak Detection in Networks," Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 420-429, August 2007.
[20] Y.-M. Li, C.-Y. Lai, and C.-W. Chen, "Identifying Bloggers with Marketing Influence in the Blogosphere," Proceedings of the 11th International Conference on Electronic Commerce, pages 335-340, August 2009.
[21] Y.-R. Lin, H. Sundaram, Y. Chi, J. Tatemura, and B. Tseng, "Discovery of Blog Communities based on Mutual Awareness," Proceedings of the WWW Workshop on Web Intelligence, December 2006.
[22] Y.-R. Lin, H. Sundaram, Y. Chi, J. Tatemura, and B. L. Tseng, "Blog Community Discovery and Evolution Based on Mutual Awareness Expansion," Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, pages 48-56, November 2007.
[23] M. Mathioudakis and N. Koudas, "Efficient Identification of Starters and Followers in Social Media," Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, pages 708-719, March 2009.
[24] A. Mehra, A. L. Dixon, D. J. Brass, and B. Robertson, "The Social Network Ties of Group Leaders: Implications for Group Performance and Leader Reputation," Organization Science, vol. 17, pages 64-79, February 2006.
[25] S. Nakajima, J. Tatemura, Y. Hino, Y. Hara, and K. Tanaka, "Discovering Important
46
Bloggers based on Analyzing Blog Threads," Proceedings of the 2nd Annual Workshop on the Weblogging Ecosystem, May 2005.
[26] A. Qamra, B. Tseng, and E. Y. Chang, "Mining Blog Stories Using Community-Based and Temporal Clustering," Proceedings of the 15th ACM International Conference on Information and Knowledge Management, pages 58-67, November 2006.
[27] B. Rosario and M. A. Hearst, "Classifying Semantic Relations in Bioscience Texts," Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, pages 430, July 2004.
[28] G. Salton, "Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer," 1989.
[29] A. Singhal, C. Buckley, and M. Mitra, "Pivoted Document Length Normalization,」 Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 21-29, August 1996.
[30] X. Song, Y. Chi, K. Hino, and B. Tseng, "Identifying Opinion Leaders in the Blogosphere," Proceedings of the 16th ACM conference on Conference on Information and Knowledge Management, pages 971-974, November 2007.
[31] K. Sriphaew, H. Takamura, and M. Okumura, "Cool Blog Identification Using Topic-Based Models," Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, pages 402-406, December 2008.
[32] H. Tang, S. Tan, and X. Cheng, "A survey on sentiment detection of reviews," Expert Systems with Applications, 36(7):10760-10773, November 2009.
[33] Twitter Search, http://search.twitter.com/
[34] I. Varlamis, V. Vassalos, and A. Palaios, "Monitoring the Evolution of Interests in the
47
Blogosphere," Proceedings of the 24th International Conference on Data Engineering Workshops, April 2008.
[35] M. Viermetz and M. Skubacz, "Using Topic Discovery to Segment Large Communication Graphs for Social Network Analysis," Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, pages 95-99, October 2007.
[36] C. Wang, M. Zhang, L. Ru, and S. Ma, "Automatic Online News Topic Ranking Using Media Focus and User Attention based on Aging Theory," Proceeding of the 17th ACM Conference on Information and Knowledge Management, pages 1033-1042, October 2008.
[37] S.-H. Yoon, J.-H. Shin, S.-W. Kim, and S. Park, "Extraction of a Latent Blog Community based on Subject," Proceeding of the 18th ACM Conference on Information and Knowledge Management, pages 1529-1532, November 2009.
[38] H. Yu and V. Hatzivassiloglou, "Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences," Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing - Volume 10, pages 129-136, July 2003.
論文全文使用權限
  • 同意授權校內瀏覽/列印電子全文服務,於2010-12-31起公開。
  • 同意授權校外瀏覽/列印電子全文服務,於2020-12-31起公開。


  • 如您有疑問,請聯絡圖書館
    聯絡電話:(06)2757575#65773
    聯絡E-mail:etds@email.ncku.edu.tw