NSF III-Core-Small: MoveMine: Mining
Sophisticated Patterns and Actionable Knowledge from Massive Moving Object Data
E-mail: hanj
at cs.uiuc.edu, URL: http://www.cs.uiuc.edu/~hanj
List of Supported Students and Staff
§ Zhenhui Li, Ph.D. student, Department of Computer
Science, University of Illinois at Urbana-Champaign (duration working on this
project: 2010-2012) (Graduated in 2012, now assistant professor, Penn State
Univ.)
§ Lu An Tang, Ph.D. student,
Department of Computer Science, University of Illinois at Urbana-Champaign (duration working on this project: 2010-2013)
(Graduated in 2013, now technical staff, NEC, New Jersey)
§ Manish
Gupta, Ph.D. student, Department of Computer Science, University of
Illinois at Urbana-Champaign (duration working on this project: 2010-2013) (Graduated
in 2013, now technical staff, Microsoft India)
§ Jingjing
Wang, Ph.D. student, Department of Computer Science, University of Illinois
at Urbana-Champaign (duration working on this project: 2012-2016) (Graduated in
2016, now technical staff, Google Inc.)
§ Chao Zhang, Ph.D.
student, Department of Computer Science, University of Illinois at
Urbana-Champaign (duration working on this project: 2013-present)
§ Doris Xin, PhD
student, Department of Computer Science, University of Illinois at
Urbana-Champaign (duration working on this project: 2014-present)
§ Wenqi
He, MS student, Department of Computer Science, University of Illinois at
Urbana-Champaign (duration working on this project: 2015-present)
§ Quan
Yuan, Postdoc Research Scientist, Department of Computer Science,
University of Illinois at Urbana-Champaign (duration working on this project:
2015-present)
Project Summary
This research
project is to investigate principles and methods for uncovering sophisticated
patterns and actionable knowledge from massive moving object data. Thanks to the rapid progress and broad
adoption of sensor, GPS, wireless network, and other advanced technologies,
moving object data have been accumulating in unprecedented scale. However,
moving object data could be dynamic,
sparse, scattered, and noisy, and patterns and knowledge to be mined could
be deeply hidden, sophisticated, and
subtle. The MoveMine
project investigates effective and scalable methods for mining various kinds of
complex patterns from dynamic and noisy moving object data, finding multiple
interleaved periodic patterns, and performing in-depth multidimensional
analysis of moving object data. It
integrates and extends multiple disciplinary approaches derived from
spatiotemporal data analysis, data mining, pattern recognition, statistics, and
machine learning. The study takes bird
and animal movement data and traffic data as the major sources of data for
investigation. However, developed
methods can be applied to the analysis of many other kinds of moving object
data for environmental study, traffic control, law enforcement, and protection
of homeland security. The study also
addresses the issue of ensuring privacy and security protection while
developing powerful pattern and knowledge discovery mechanisms. The research results are to be published in
various research and application forums and be integrated into the educational
programs at UIUC. The research results
are also disseminated via the project Web site:
(http://www.cs.uiuc.edu/homes/hanj/projs/movemine.htm).
Publications
and Products:
Note: Please search and download all the papers in
PDF, if available, at our groups publication website by following the link: Selected
research publications.
Books
(authored or edited)
1.
Ashok N. Srivastava and Jiawei Han (eds.), Machine Learning and
Knowledge Discovery for Engineering Systems Health Management: Detection,
Diagnostics, and Prognostics, Chapman & Hall, 2011.
2.
Yizhou Sun and Jiawei Han, Mining Heterogeneous
Information Networks: Principles and Methodologies, Morgan &Claypool
Publishers, July 2012
3.
Manish Gupta, Jing Gao, Charu Aggarwal, and
Jiawei Han, Outlier Detection for Temporal Data, Morgan & Claypool
Publishers, 2014.
4.
Chi Wang and Jiawei Han, Mining Latent Entity Structures, Synthesis
Lectures on Data Mining and Knowledge Discovery, Morgan & Claypool
Publishers, 2015.
Journal articles
1. Jun Li, Jingjing Wang, Junfei Zhang, Qiming Qin, Tanvi Jindal, Jiawei
Han, "A probabilistic approach to detect mixed periodic patterns from
moving object data", Geoinformatica, 2016.
2.
Lu-An Tang, Xiao Yu, Quanquan Gu, Jiawei Han, Guofei Jiang,
Alice Leung, and Thomas La Porta, "A Framework of Mining Trajectories from
Untrustworthy Data in Cyber-Physical System", ACM Transactions on
Knowledge Discovery from Data (TKDD), 9(3):16:1-16:35 (2015)
3.
Zhenhui Li, Jingjing Wang, and Jiawei Han, "e-Periodicity:
Mining Event Periodicity from Incomplete Observations", IEEE Transactions
on Knowledge and Data Engineering (TKDE), 27(5): 1219-1232 (2015)
4.
Jae-Gil Lee, Jiawei Han, and Xiaolei
Li, "A Unifying Framework of Mining Trajectory Patterns of Various
Temporal Tightness," IEEE Transactions on Knowledge and Data Engineering
(TKDE), 27(6): 1478-1490 (2015)
5.
Jun Li, Qiming Qin, Jiawei Han,
Lu An Tang, Kin Hou Lei, "Mining Trajectory Data
and Geotagged Data in Social Media for Road Map Inference", Transactions
in GIS (T. GIS) 19(1): 1-18 (2015)
6.
Chao Zhang,
Jiawei Han, Lidan Shou, Jiajun Lu, and Thomas F. La Porta, "Splitter: Mining
Fine Grained Sequential Patterns in Semantic Trajectories", PVLDB 7(9): 769-780,
2014 (Also, Proc.
2014 Int. Conf. on Very Large Data Bases (VLDB'14), Hangzhou, China,
Sept. 2014.)
7.
Manish Gupta, Jing Gao, Charu
C. Aggarwal, and Jiawei Han, "Outlier Detection for Temporal Data: A
Survey", IEEE Transactions on Knowledge and Data Engineering (TKDE), 26(9): 2250-2267, 2014.
8. Lu-An Tang, Yu
Zheng, Jing Yuan, Jiawei Han, Alice Leung, Wen-Chih
Peng, Thomas La Porta, A Framework of Traveling Companion Discovery on Trajectory Data Streams", ACM Transactions on
Intelligent Systems and Technology (ACM TIST), 5(1):3, 2013.
9. Lu-An Tang, Xiao Yu, Sangkyum
Kim, Quanquan Gu, Jiawei
Han, Alice Leung, Thomas La Porta, Trustworthiness
Analysis of Sensor Data in Cyber-Physical Systems, accepted
by Special Issue on Data Warehousing and Knowledge Discovery from Sensors and
Streams, Journal of Computer and System Sciences (JCSS), 79(3): 383-401, 2013.
10.
Lu-An Tang, Xiao Yu, Sangkyum Kim, Jiawei Han, Wen-Chih
Peng, Yizhou Sun, Alice Leung, Thomas La Porta, Multidimensional
Sensor Data Analysis in Cyber-Physical Systems: An Atypical Cube Approach, International Journal of Distributed Sensor Networks, Vol.
2012, 2012.
11. Zhenhui Li, Jiawei Han, Bolin Ding, and Roland Kays,
Mining Periodic Behaviors of Object
Movements for Animal and Biological
Sustainability Studies, Data Mining and Knowledge Discovery, 24(2):355-386, 2012.
12.
Zhenhui Li, Jiawei Han,
Ming Ji, Lu-An Tang, Yintao Yu, Bolin Ding, Jae-Gil
Lee, and Roland Kays, "MoveMine: Mining Moving Object Data for Discovery of Animal Movement
Patterns", ACM Transactions on Intelligent Systems and Technology
(ACM TIST), 2(4):37, 2011.
13. Zhenhui Li, Jiawei Han,
Ming Ji, Lu-An Tang, Yintao Yu, Bolin Ding, Jae-Gil
Lee, and Roland Kays, MoveMine: Mining Moving Object
Data for Discovery of Animal Movement Patterns", ACM Transactions on
Intelligent Systems and Technology (ACM TIST) (Special Issue on Computational Sustainability), 2(4):37, 2011.
14. Jae-Gil Lee,
Jiawei Han, Xiaolei Li, and Hong Cheng, Mining
Discriminative Patterns for Classifying Trajectories on Road Networks", IEEE
Transactions on Knowledge and Data Engineering, 23(5):713-725, 2011.
15. Zhenhui Li, Bolin Ding,
Jiawei Han, and Roland Kays, Swarm: Mining Relaxed Temporal Moving Object
Clusters", PVLDB 3(1): 723-734, 2010. (Also, Proc. 2010 Int. Conf. on Very
Large Data Bases (VLDB'10), Singapore, Sept. 2010.)
Book
Chapters
1.
Zhenhui Li and Jiawei
Han, "Mining Periodicity from Dynamic and Incomplete Spatiotemporal
Data", in Wesley W. Chu (ed.), Data Mining and Knowledge Discovery for Big
Data, pp. 41-82, Springer, 2014.
Refereed Conference Publications
1.
Quan
Yuan, Wei Zhang, Chao Zhang, Xinhe Geng, Gao Cong and Jiawei Han, "Periodic Region Detection for Mobility Modeling of Social
Media Users", Proc. of 2017 ACM Int. Conf. on Web Search and Data
Mining (WSDM'17), Cambridge UK, Feb. 2017
2. Chao Zhang, Keyang
Zhang Quan Yuan, Luming
Zhang, Tim Hanratty, and Jiawei Han, "GMove: Group-Level Mobility Modeling Using Geo-Tagged
Social Media", in Proc. of 2016 ACM SIGKDD Conf. on Knowledge
Discovery and Data Mining (KDD'16), San Francisco, CA, Aug. 2016
3. Meng Jiang, Christos Faloutsos,
Jiawei Han, "CatchTartan: Representing and
Summarizing Dynamic Multicontextual
Behaviors", in Proc. of 2016 ACM SIGKDD Conf. on Knowledge
Discovery and Data Mining (KDD'16), San Francisco, CA, Aug. 2016
4. Xiang Ren, Wenqi
He, Meng Qu, Clare R. Voss, Heng
Ji, Jiawei Han, "Label Noise Reduction in Entity Typing by Heterogeneous
Partial-Label Embedding", in Proc. of 2016 ACM SIGKDD Conf. on
Knowledge Discovery and Data Mining (KDD'16), San Francisco, CA, Aug. 2016
5. Mengting Wan, Xiangyu Chen, Lance
Kaplan, Jiawei Han, Jing Gao, Bo Zhao, "An Uncertainty-Aware Model to
Summarize Trustworthy Quantitative Information", in Proc. of
2016 ACM SIGKDD Conf. on Knowledge Discovery and Data Mining (KDD'16), San
Francisco, CA, Aug. 2016
6. Chao Zhang, Guangyu Zhou,
Quan Yuan, Honglei Zhuang,
Yu Zheng, Lance Kaplan, Shaowen Wang, Jiawei Han,
"GeoBurst: Real-time Local Event Detection in
Geo-tagged Tweet Stream", in Proc. of 2016 ACM SIGIR Conf. on
Research & Development in Information Retrieval (SIGIR'16), Pisa, Italy,
July 2016
7.
Jingjing Wang, Changsung Kang, Yi Chang and Jiawei
Han, "Learning Hostname Preference to Enhance Search
Relevance", in Proc. of 2016 Int. Joint Conf. on Artificial
Intelligence (IJCAI'16), New York City, NY, July 2016
8.
Wei Zhang,
Quan Yuan, Jiawei Han and Jianyong
Wang, "Collaborative Multi-Level Embedding Learning from Reviews for
Rating Prediction", in Proc. of 2016 Int. Joint Conf. on
Artificial Intelligence (IJCAI'16), New York City, NY, July 2016
9.
Huan Gui, Jiawei Han, Quanquan Gu, "Towards Faster
Rates and Oracle Property for Low-Rank Matrix Estimation", in Proc. of 2016
Int. Conf. on Machine Learning (ICML'16), New York City, NY, June
2016.
10. Jingbo Shang, Wenzhu Tong, Jian
Peng, and Jiawei Han, "DPClass: An
Effective but Concise Discriminative Patterns-Based Classification
Framework", in Proc of 2016 SIAM Int. Conf. on Data Mining (SDM'16),
Miami, FL, May 2016
11. Jingbo Shang, Jian Peng, and Jiawei Han, "MACFP:
Maximal Approximate Consecutive Frequent Pattern Mining under Edit Distance",
in Proc of 2016 SIAM Int. Conf. on Data Mining (SDM'16), Miami, FL, May 2016
12. Chenguang Wang, Yizhou Sun, Yanglei Song, Jiawei Han, Yangqiu
Song, Lidan Wang, and Ming
Zhang, "Querying Similar Relations in Schema-Rich Heterogeneous
Information Networks", in Proc of 2016 SIAM Int. Conf. on Data Mining
(SDM'16), Miami, FL, May 2016
13. Jialu Liu, Xiang Ren, Jingbo
Shang, Taylor Cassidy, Clare Voss and Jiawei Han, "Representing Documents
via Latent Keyphrase Inference", in Proc.
of 2016 Int. World-Wide Web Conf. (WWW'16), Montreal, Canada, April
2016
14. Yang Li, Shulong Tan, Huan Sun, Jiawei Han, Dan Roth and Xifeng
Yan, "Entity Disambiguation with Linkless
Knowledge Bases", in Proc. of 2016 Int. World-Wide Web
Conf. (WWW'16), Montreal, Canada, April 2016
15. Aston Zhang, Amit Goyal, Ricardo Baeza-Yates, Yi Chang, Jiawei Han, Carl A. Gunter and Hongbo Deng, "Towards Mobile Query
Auto-Completion: An Efficient Mobile Application-Aware
Approach", in Proc. of 2016 Int. World-Wide Web
Conf. (WWW'16), Montreal, Canada, April 2016
16. Xiuli Ma, Guangyu Zhou, Jingjing Wang, Jian Peng, and Jiawei Han, "Complexes
Detection in Biological Networks via Diversied
Dense Subgraphs Mining", in Proc. of 2016 Int. Conf. on Research
in Computational Molecular Biology (RECOMB'16), Los Angeles, CA, Apr.,
2016
17. Chenguang Wang, Yangqiu Song, Haoran Li, Ming Zhang and Jiawei Han, "Text
Classification with Heterogeneous Information Network Kernels", in Proc.
of 2016 AAAI Conf. on Artificial Intelligence (AAAI'16),
Phoenix, AZ, Feb. 2016
18. Jingjing Wang, Min Li, Jiawei Han and Xiaolong
Wang, "Modeling Check-in Preferences with Multidimensional Knowledge: A
Minimax Entropy Approach", in Proc. of 2016 Int. Conf. on Web Search and
Data Mining (WSDM'16), San Francisco, CA. Feb. 2016.
19. Jingjing Wang, Wenzhu Tong, Hongkun Yu, Min Li, Xiuli Ma, Haoyan Cai, Tim Hanratty, and Jiawei Han, "Mining Multi-Aspect
Reflection of News Events in Twitter: Discovery, Linking and
Presentation", in Proc. of 2015 IEEE Int. Conf. on Data Mining
(ICDM'15), Atlantic City, NJ, Nov. 2015
20.
Chao Zhang, Shan Jiang, Yucheng Chen, Yidan Sun, and Jiawei Han,"Fast
Inbound Top-K Query for Random Walk with Restart", in Proc. of 2015
European Conf. on Machine Learning and Principles and Practices of Knowledge
Discovery in Databases (ECMLPKDD'15), Porto, Portugal, Sept. 2015. (Received the Best Student Paper Runner-Up
award at ECML/PKDD 2015)
21.
Chao Zhang, Yu Zheng, Xiuli Ma, Jiawei Han, "Assembler: Efficient
Discovery of Spatial Coevolving Patterns in Massive Geosensory
Data", in Proc. of 2015 ACM SIGKDD Int. Conf. on Knowledge Discovery and
Data Mining (KDD'15), Sydney, Australia, Aug. 2015
22.
Xiang Ren, Ahmed El-Kishky, Chi Wang, Fangbo Tao, Clare R. Voss, Heng
Ji, Jiawei Han, "ClusType: Effective Entity
Recognition and Typing by Relation Phrase-Based Clustering", in Proc. of
2015 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'15),
Sydney, Australia, Aug. 2015
23.
Younghoon Kim, Jiawei Han, Cangzhou Yuan, "TOPTRAC: Topical Trajectory Pattern
Mining", in Proc. of 2015 ACM SIGKDD Int. Conf. on Knowledge Discovery and
Data Mining (KDD'15), Sydney, Australia, Aug. 2015
24.
Wei Feng, Chao Zhang, Wei Zhang, Jiawei Han, Jianyong
Wang, Charu Aggarwal, Jianbin
Huang, " STREAMCUBE: Hierarchical Spatio-temporal
Hashtag Clustering for Event Exploration over the Twitter Stream", in Proc
of 2015 IEEE Int. Conf on Data Engineering (ICDE'15),
Seoul, Korea, Apr. 2015
25.
Honglei Zhuang, Jing Zhang, George Brova, Jie Tang, Hasan Cam, Xifeng
Yan and Jiawei Han, "Mining Query-Based Subnetwork Outliers in
Heterogeneous Information Networks", in Proc. of 2014 IEEE Int. Conf. on
Data Mining (ICDM'14), Shenzhen, China, Dec. 2014.
26.
Manish Gupta, Arun Mallya,
Subhro Roy, Jason H. D. Cho, and Jiawei Han,"Local Learning for Mining Outlier Subgraphs from
Network Datasets", in Proc. of 2014 SIAM Int. Conf.on
Data Mining (SDM'14), Philadelphia, PA, April 2014.
27. Manish Gupta,
Jing Gao, Xifeng Yan, Hasan Cam, and Jiawei Han,
"Top-K Interesting Subgraph Discovery in Information Networks", Proc.
2014 IEEE Int. Conf. on Data Engineering (ICDE'14), Chicago, IL, Mar. 2014.
28. Manish Gupta, Jing
Gao, Xifeng Yan, Hasan Cam, and Jiawei Han, On Detecting
Association-Based Clique Outliers in Heterogeneous Information Networks", Proc. of 2013
IEEE/ACM Int. Conf. on Social
Networks Analysis and Mining (ASONAM'13), Niagara Falls, Canada, Aug. 2013
29. Lu-An Tang, Xiao
Yu, Quanquan Gu, Jiawei
Han, Alice Leung, and Thomas La Porta, Mining Lines in the Sand: On Trajectory Discovery From Untrustworthy
Data in Cyber-Physical System", Proc. of 2013 ACM
SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'13), Chicago, IL, Aug.
2013.
30.
Zhenhui Li, Jingjing Wang, and Jiawei Han, "Mining
Periodicity for Sparse and Incomplete Event Data", Proc. of 2012
ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD'12), Beijing,
China, Aug. 2012
31.
Jingjing Wang and Bhaskar Prabhala, "Periodicity Based Next Place Prediction", Proc. of
Workshop on Mobile Data Challenge by Nokia, Newcastle, UK, June 2012
32. Lu-An Tang, Yu Zheng, Jing Yuan, Jiawei Han, Alice Leung, Chih-Chieh Hung, and Wen-Chih
Peng, "On Discovery of Traveling Companions
from Streaming Trajectories", Proc. 2012 IEEE Int. Conf. on Data
Engineering (ICDE'12), Arlington, VA, Apr. 2012.
33. Lu-An
Tang, Xiao Yu, Sangkyum Kim, Jiawei Han, Yizhou Sun, Wen-Chih Peng, Hector
Gonzalez, Sebastian Seith, "Multidimensional
Analysis of Atypical Events in Cyber-Physical Data", Proc. 2012 IEEE Int.
Conf. on Data Engineering (ICDE'12), Arlington, VA, Apr. 2012.
34.
Zhenhui Li, Cindy Xide Lin, Bolin Ding, and
Jiawei Han, Mining Significant Time Intervals for
Relationship Detection, Proc. of 2011 Int. Symp. on Spatial and Temporal
Databases (SSTD'11), Minneapolis, MN, Aug. 2011.
35.
Lu-An Tang, Yu Zheng, Xing Xie, Jing Yuan, Xiao Yu, Jiawei Han, Retrieving
k-Nearest Neighboring Trajectories by a Set of Point Locations, Proc. of 2011 Int. Symp. on Spatial and Temporal Databases (SSTD'11), Minneapolis,
MN, Aug. 2011.
36. Xiao Yu, Ang
Pan, Lu-An Tang, Zhenhui Li and Jiawei Han, "Geo-Friends Recommendation in GPS-based
Cyber-Physical Social Network", Proc. of 2011
Int. Conf. on Advances in Social Network Analysis and Mining (ASONAM'11),
Kaohsiung, Taiwan, July 2011.
37. Zhijun Yin, Liangliang
Cao, Jiawei Han, Jiebo Luo, and Thomas Huang, Diversified Trajectory Pattern Ranking in
Geo-tagged Social Media, Proc. of 2011 SIAM Conf. on
Data Mining (SDM'11), Phoenix, AZ, Apr. 2011.
38. Zhijun Yin, Liangliang
Cao, Jiawei Han, Chengxiang Zhai,
and Thomas Huang, Geographical
Topic Discovery and Comparison, Proc. of 2011 Int.
World Wide Web Conf. (WWW'11), Hyderabad, India, Mar. 2011 (Full paper).
Ph.D.
Dissertations
1.
Jingjing Wang, Ph.D., Aug. 2016, thesis title: Leveraging Multi-Dimensional, Multi-Source
Knowledge for User Preference Modeling and Event Summarization in Social Media", link to Ph.D. dissertation
2.
Lu An Tang, Ph.D.,
August 2013, thesis title: Mining Sensor and
Mobility Data in Cyber Physical Systems, link to Ph.D. dissertation
3.
Manish Gupta,
Ph.D., March 2013, thesis title: Outlier Detection
for Information Networks", link to Ph.D. dissertation
4.
Zhenhui Li, Ph.D., Sept. 2012, thesis title: Mining periodicity and object relationship in
spatial and temporal data", link to Ph.D. dissertation
Project Impact
§
Education: Parts of the new research
results are used in Data Mining courses (CS412, CS512) for both undergraduate
and graduate students being taught in the Department of Computer
Science, the
§
Collaborations: For this project we have established collaborations with Boeing, ARL,
NASA, HP Labs,
Current and Future Activities
The
following are some of the highlights of our ongoing work. Please refer to the section: Publications and
Products section for related references.
1. Study
object movement mining in the context of cyber-physical networks
2. Study
efficient methods for mining more sophisticated movement patterns than the
state-of-the-art
3. Study
methods for anomaly detection for moving objects in sensor network environment
Area Background
This project is based on the previous research on data mining, spatiotemporal
data analysis, and data cube and multidimensional analysis. There
have been many research papers published on these themes. Several
textbooks on data mining, information retrieval and information network
analysis provide good overviews of the principles and algorithms, including
(Han, Kamber, and Pei, Data Mining, 3rd ed.
2011), and (Miller and Han, Geographical
Data Mining and Knowledge Discovery, 2nd ed. 2009).
Area References
Potential
Related Projects
Project Web
site URL: http://www.cs.uiuc.edu/~hanj/projs/movemine.htm
Online software: Online
software can be downloaded at http://illimine.cs.uiuc.edu, and online
system demo is at http://dm.cs.uiuc.edu/movemine
Online resources: Research publications
related to this project can be downloaded at Selected
Publications