|
Michael Chau, Ph.D.
Associate Professor
School of Business
Faculty of Business and Economics
The University of Hong Kong
Email: mchau |at| business |dot| hku |dot| hk
http://www.business.hku.hk/~mchau/
|
[
Main Page |
Vitae |
Teaching |
Research |
Publications |
Professional Services |
Students |
Additional Information |
Call for Papers]
Current Research Areas
Business Intelligence on the Web
As an excellent information source, the Internet provides significant opportunities for business intelligence analysis. Web content, outgoing links, and incoming links of a company’s Web site can all provide important insights about the company's business and "online communities". Although analysis of these contents and communities can provide useful signals for a company and information about its stakeholder groups, the manual analysis process can be very time-consuming for business analysts and consultants. My research addresses this problem by proposing a new design that integrates best-first search, backlink search, meta-search, and text mining techniques to facilitate users in performing such business intelligence analysis on the Web. Several research prototypes were developed and applied in various business contexts. Experiments were conducted to evaluate the effectiveness, efficiency, and user satisfaction of tools and the results showed that these tools were statistically more effective than the benchmark approaches.
Web Mining
The World Wide Web is the largest repository of information that can be easily accessed by many. Web mining research is the use of data mining and text mining as well as other similar techniques to discover resources, patterns, and knowledge from the Web and Web-related data (such as Web usage data or Web server logs). Web mining research can be classified into three categories: Web content mining, Web structure mining, and Web usage mining. In my research, Web content and structure mining techniques have been used in the development of Web search agents and Web portals. Blog mining also has been proposed to extract interesting patterns from blogs - most of them being frequently-updated personal online diaries. Such analyses have useful business and practical implications (e.g., marketing). Web usage mining has been performed on Web logs and search logs to reveal the information needs of Web users in order to improve the design of Web sites and search engines.
Data Mining
The goal of this research is to advance techniques in data mining for different types of data. For example, data uncertainty is often inherent in real world applications that require interaction with the physical world. In particular, data collected from external environments is often imprecise due to measurement inaccuracy, sampling discrepancy, outdated data sources, or other errors. However, this is often not taken into account in existing data mining algorithms. One research direction is to study the various issues of mining uncertain data, particularly with respect to data clustering. Another example is to study the application of data mining techniques to the Web. While most data mining techniques have been applied to Web data directly, I proposed that it is possible to incorporate various Web analysis metrics into data mining models. In particular, a Hopfield Net model was proposed to represent the Web's structure to tightly couple the mining techniques with the nature of the Web.
Security Informatics
Intelligence agencies such as the FBI are actively collecting and analyzing large amount of data to investigate terrorists' activities. Local law enforcement agencies have also been using data and information technology to fight against the criminal activities in their own jurisdictions. One challenge to these agencies is the difficulty in analyzing the large volumes of data involved in criminal and terrorist activities. My research addresses this problem by assisting these agencies in analyzing criminal data and online materials using various machine learning and visualization techniques.
IT Education
Advances in computer and Internet technologies have made it more and more important for information technology professionals to acquire experience in a variety of aspects, including new technologies, system integration, database administration, and project management. My research in this area has two main goals: (1) study the design and usage of various hands-on tools in classroom, and (2) study the use of machine learning algorithms to process digital teaching materials (e.g., lecture videos and slides) to assist e-learning.
Funding Sources
- Applying Blog Mining Techniques in Suicide Research and Prevention,
General Research Fund, Hong Kong Research Grants Council,
2012 - 2014 (742012):
- Principal Investigator of the project
- Investigate the characteristics of bloggers in suicide-related online communities
- Develop methods for suicide intention detection and analysis for blogs
- Evaluate the performance of the proposed techniques for blogs in different languages in suicide detection
- Identifying and Understanding Influential Individuals in Online Social Networks,
HKU Small Project Funding,
Jun 2012 - May 2014:
- Principal Investigator of the project
- Devise methods to find the most influential individuals in consumer networks in social media
- Evaluate the effectiveness of existing social network analysis techniques and to propose new techniques for marketing in online social networks
- E-engagement Capacity Enhancement for NGOs,
Excellence and Capacity building for Entrepreneurship and Leadership for the Third Sector (ExCEL3), HKU-HKJC,
Feb 2012 - Feb 2014:
- Principal Investigator of the project (Co-I: C. Cheng, P. S. F. Yip)
- Explore the needs and preferences of the users of NGO's websites and applications
- Create a pedagogical workshop and training materials to empower local organisations to meet their online strategy goals
- Research on Key Techniques for Predicting Uncertain Trajectories of Moving Objects with Dynamic Environment Awareness,
National Natural Science Foundation of China,
Jan 2012 - Dec 2014 (61100045):
- Co-Investigator (PI: S. Qiao)
- Study the prediction of the trajectories of moving objects with uncertain trajectories by applying dynamic environment awareness
- Study the applications of the proposed techniques in various domains such as intelligent transpotation systems and crime analysis.
- Dissecting Hacker Behaviors in Online Communities and the Threats that They Pose,
General Research Fund, Research Grants Council, HKSAR Government,
2011 - 2013 (152611):
- Co-Investigator of the project (PI: W. T. Yue)
- Study the general phenomenon of hacker forums
- Propose countermeasures firms should undertake to account for the risks posed by these online communities
- Predicting Online Auction Prices Based on Feedback Histories and Text Comments,
HKU Small Project Funding,
Mar 2011 - Feb 2013:
- Principal Investigator of the project
- Investigate the effect of feedback histories and text comments on final auction prices using data mining techniques and economic models.
- Evaluate the model using real-world data collected from large auction sites like eBay.
- Understanding and Analyzing Online Public Opinion in Hong Kong Cyberspace,
Central Policy Unit (Contract Research), HKSAR Government
2010-2011:
- Co-Principal Investigator of the project (PI: K.W. Fu)
- Design and implement a system for collecting and analyzing online public opinions in a systematic manner
- Study the impact of online public opinions on policy formulation and public governance
- Promoting Mental Well-Being in Youths through an Interactive Online Game on a Social Networking Website,
Health Care and Promotion Fund, Food and Health Bureau, HKSAR Government,
Apr 2010 - Sept 2011 (23090514):
- Principal Investigator of the project (Co-I: P. W. C. Wong and P. S. F. Yip)
- Design and develop an online game for promoting mental health and study its effectiveness.
- Investigate the issues related to the use of Web 2.0 media for mental health promotion.
- Exploring the Use of Three-Dimensional Virtual Worlds for Teaching and Learning Activities,
HKU Earmarked Teaching Development Grant,
Nov 2009 - Oct 2010 (10100316):
- Principal Investigator of the project (Co-I: M. Wang and A. Wong)
- Explore the feasibility and performance of using 3D virtual worlds for teaching and learning at HKU.
- Develop a prototype 3D environment in Second Life for teaching and learning activities.
- Social Mining for Web 2.0 Applications,
HKU Seed Funding for Basic Research,
May 2009 - December 2011 (10400397):
- Principal Investigator of the project (Co-I: J. Xu)
- Investigate the nature and characteristics of the social network in Web 2.0 sites and compare that with existing online social networks.
- Develop a framework for the automatic identification, collection, and analysis of social networks in Web 2.0 sites.
- Apply graph theory and social network analysis algorithms on these social networks to reveal interesting patterns.
- Applying Information Visualization Techniques to Web Search Results,
HKU Seed Funding for Basic Research,
March 2008 - February 2010 (10208140):
- Principal Investigator of the project.
- Evaluate the effectiveness of different visualization techniques for displaying results from
Web search engine
- Study the user interface factors that influence the effectiveness and efficiency in Web
information seeking
- Analysis and Visualization of Web Search Queries,
HKU Seed Funding for Basic Research,
January 2007 - December 2008 (10207565):
- Principal Investigator of the project.
- Study the information seeking behavior of Web users.
- Perform analysis and data mining on Web search query logs.
- Investigate the use of visualization techniques in helping users locate information on the Web.
- Searching and Analyzing Blogs for Competitive Advantages,
HKU Seed Funding for Basic Research,
January 2006 - December 2007 (10206775):
- Principal Investigator of the project.
- Study the weblogging (blogging) phenomenon and its business implications.
- Evaluate the use of data mining and text mining techniques in analyzing blogs for
market intelligence.
- Improving Document Clustering Techniques for Web Search Result Analysis,
HKU Incentive Award,
2005 - 2006:
- Principal Investigator of the project.
- Study the incorporation of hyperlink information in Web page clustering.
- Evaluate the performance of different clustering and visualization techniques on
post-retrieval analysis of Web pages.
- Searching the World Wide Web Backwards for Business Intelligence Analysis,
HKU Seed Funding for Basic Research,
January 2005 - December 2006 (10206086):
- Principal Investigator of the project.
- Study the use of back-link (in-link) search and text mining techniques in
performing business intelligence analysis on the World Wide Web.
- Evaluate the effectiveness of using back-link information in identifying implicit
cyber-communities on the Web.
- Examining Organizations’ Motivations, Strategies and Effectiveness of
Information Systems/Technology Outsourcing: A Contingency Approach,
HKU Seed Funding for Basic Research,
January 2005 - December 2006:
- Co-PI of the project (PI: P. Chau).
- Investigate the motivation and success/failure of IT outsourcing projects in
organizations in Hong Kong.
- Using Content and Link Analysis in Developing Domain-specific
Web Search Engines: A Machine Learning Approach,
HKU Seed Funding for Basic Research,
February 2004 - January 2006 (10205294):
- Principal Investigator of the project.
- Studied the use of content-based and hyperlink-based analysis in Web
page filtering using various data mining techniques such as neural
networks and support vector machines.
- Designed and compared several algorithms to create Web portals for
specific domains (e.g., medicine, nanotechnology, business intelligence).
- UMLS Enhanced Dynamic Agents to Manage Medical Knowledge,
National Institutes of Health/National Library of Medicine,
February 2001 - February 2004 (1 R01 LM06919-1A1):
- Designed and implemented a
medical search engine, and evaluated its usability and impact
to the medical community.
- Developed neural network algorithms for efficiently spidering
and filtering medical Web pages and evaluated such approaches.
- Applied natural language processing techniques to Web page
indexing.
- COPLINK Center for Excellence: Information and Knowledge Management
in Law Enforcement,
NSF Digital Government Initiative,
September 2000 - August 2003 (EIA-9983304):
- Project leader and system architect for the COPLINK Agent System and
the COPLINK Narrative Analysis and Visualization System.
- Researched on applying machine learning and visualization techniques
to improve information analysis and collaboration among users in the
law enforcement domain.
- Applied computational linguistics and machine learning
techniques for entity-extraction in textual analysis for police narrative
reports.
- NanoPort: Intelligent Web Searching for Nanoscale Science and
Engineering,
NSF/CTS/SGER,
February 2002 - November 2002 (CTS-0204375):
- Project leader and major investigator.
- Researched on the different issues involved in building a Web
portal for nanoscale science and engineering.
- Designed and implemented the
NanoPort
system.
- High Performance Digital Library Classification Systems: From
Information Retrieval to Knowledge Management,
NSF/ARPA/NASA Digital Library Initiative Phase 2,
May 1999 - April 2002 (IIS-9817473):
- Designed and developed different variations of
Internet spiders, including
Competitive Intelligence Spider,
Meta Spider,
Cancer Spider, and
Nano Spider.
Designed and conducted user studies to evaluate the efficiency,
effectiveness and usability of these tools.
- Investigated the use of noun phrasing and self-organizing map
techniques in helping users retrieve and analyze information on the
Internet more efficiently.
- Designed and implemented the
Spiders"R"Us
digital library toolkit which allows users to build personal digital
libraries and search engines.
- An Intelligent CSCW Workbench: Analysis, Visualization, and Agents,
NSF/CISE/CSS,
June 1998 - June 2001 (IIS-9800696):
- Project leader and chief system architect.
- Designed and studied multi-agent based architectures for different
applications, including collaborative filtering, competitive
intelligence, medical information retrieval, e-commerce, and
law enforcement.
- Researched on different agent communication languages such as
KQML.