Wed. October 10, 2007 - Fri. October 12, 2007

Baltimore, MD 21202

NGDM '07

National Science Foundation Symposium on Next Generation
of Data Mining and Cyber-Enabled Discovery for Innovation


SCHEDULE
 

NGDM’07 Preliminary Schedule

Wednesday, Oct 10, 2007

 

7:00—8:00am

Breakfast, Registration

8:00—8:10am

Welcome Note: Hillol Kargupta, University of Maryland, Baltimore County

 

Session Title: Data Mining in e-Science and Engineering
Session Chair: Hillol Kargupta, University of Maryland, Baltimore County

8:10—8:35am

Jiawei Han, University of Illinois at Urbana-Champaign, Research Challenges for Data Mining in Science and Engineering [pdf] [slides]

8:35—9:00am

Saso Dzeroski, Jozef Stefan Institute, Inductive Databases and Queries for Computational Scientific Discovery [pdf] [slides]

9:00—9:25am

Chris Fischer, Kevin Tibbetts, Dane Morgan, Gerbrand Ceder, MIT and Wisconsin Madison, Machine Learning for the Computational Materials Scientist [pdf] [slides]

9:25—9:50am

Vipin Kumar, University of Minnesota, Discovery of Patterns in the Global Climate System using Data Mining [pdf] [slides]

9:50—10:05am                      

Coffee Break

  Session Chair: Philip Yu, IBM TJ Watson Research Center

10:05—10:30am

Haym Hirsh, National Science Foundation and Rutgers, From Data to Knowledge for Cyber-Enabled Discovery and Innovation [slides]

10:30—10:55am

Steven Salzberg, University of Maryland, College Park, Computational Gene Finding in the Human Genome: How many genes do we have? [pdf] [slides]

10:55—11:20am

Wei Wang, University of North Carolina Chapel-Hill, Data Mining for Enabling Genomic-Wide Computing [pdf] [slides]

11:20—11:45am

D. Quick and Margaret Dunham, Southern Methodist University, TCGR: A Novel DNA/RNA Visualization Technique [pdf] [slides]

11:45—12:10pm

Kirk Borne, George Mason University, A Machine Learning Classification Broker for Petascale Mining of Large-scale Astronomy Sky Survey Databases. [pdf] [slides]

12:10—12:35pm

Larry Hall and Kevin Bowyer, University of South Florida, Finding Lookmarks for Extreme-scale Simulation and Scientific Data [pdf] [slides]

12:35—1:50pm

Lunch in Harbor Club Room (Pier-5 Hotel), provided by the NGDM'07

1:50—2:35pm

Panel + Open Discussion Session:

Topic: Future Research Challenges and Needed Resources for Data Mining Research in e-Science and Engineering 

Panelists:  (10 + 10 = 20 Mins)
1) Pat Langley, Inst. for the Study of Learning & Expertise [slides]
2) Alok Choudhary, Northwestern University [slides]

Open Discussion: 25 mins

 

Session Title: The Web, Semantics, and Data Mining
Session Chair:
Doina Caragea, Kansas State University

2:35—3:00pm

Raghu Ramakrishnan, Yahoo Research and University of Wisconsin, Web Data Management: Powering the New Web [pdf] [slides]

3:00—3:25pm

Daniel J. Weitzner, MIT/W3C Information Accountability [pdf] [slides]

3:25—3:40pm

Coffee Break

3:40—4:05pm

Nitin Agarwal, Huan Liu, John Salerno, Philip Yu, Arizona State University, Air Force Research Laboratory, IBM T. J. Watson Research Laboratory,  Searching for "Familiar Strangers" on Blogosphere: Problems and Challenges [pdf] [slides]

4:05—4:30pm

Vasant Honavar and Doina Caragea, Iowa State University and Kansas State University, Towards Semantics-Enabled Infrastructure for Knowledge Acquisition of Distributed Data. [pdf] [slides]

4:30—4:55pm

Anupam Joshi, Tim Finin, Akshay Java, Anubhav Kale, and Pranam Kolari, University of Maryland, Baltimore County, Web 2.0 Mining: Analyzing Social Media [pdf] [slides]

4:55—5:20pm

Philip Yu, Xifeng Yan, Jiawei Han, Hong Cheng, Feida Zhu, IBM TJ Watson Center and University of Illinois at Urbana-Champaign, Approximate Frequent Pattern Mining [pdf] [slides]

5:20pm—6:05pm

Panel + Open Discussion Session:

Topic: Future Research Challenges and Needed Resources for “The Web, Semantics, and Data Mining” 

Panelists:  (10 + 10 = 20 Mins)
1) Tim Finin, UMBC [slides]
2)  Gary Strong, Johns Hopkins University [slides]

Open Discussion: 25 mins

6:15—9:00pm

Poster Session & Welcome Reception

6:15—6:20pm, Welcome note: Hillol Kargupta
6:20—6:35pm, Welcome note: Freeman Hrabowski, President, University of Maryland, Baltimore County
6:35—6:55pm, Brief talks (2 slides) by poster presenters
6:55—9:00pm, poster exhibition

 

Thursday, Oct 11, 2007

 

7:00—8:00am

Breakfast, Registration

8:00—8:10am

Announcements/Remarks

 

Session Title: Interfacing Data Mining with Social Science, Finance, Medicine, and Other Disciplines
Session Chair: Hillol Kargupta, University of Maryland, Baltimore County

8:10—8:35am

V. Hristidis, F. Farfan, R. Burke, A. Rossi and J. White, Florida International University, Miami Children’s Hospital and Teges Corporation Information Discovery on Electronics Medical Records. [pdf]

8:35—9:00am

David Covell, National Cancer Institute, Omics-based Discovery Strategies: Collection, Mining and Analysis [pdf] [slides]

9:00—9:25am

Olfa Nasraoui, University of Louisville, Market Based Decentralized Profile Infrastructure: Giving Back to the User [pdf] [slides]

9:25—9:50am

David Goldberg, University of Illinois at Urbana-Champaign, Three Lessons of Ancient and Modern Philosophy for Creative Human-Centered Computation [pdf] [slides]

9:50—10:05am                      

Coffee Break

  Session Chair: Ee-Peng Lim, Nanyang Technological University

10:05—10:30am

Andrew Kusiak, University of Iowa, Data Mining and Innovation Science [pdf] [slides]

10:30—10:55am

James Gentle, George Mason University, Challenges in computational finance and financial data analysis [pdf] [slides]

10:55—11:20am

Matt Kirschenbaum, University of Maryland, College Park, The Remaking of Reading: Data Mining and the Digital Humanities [pdf] [slides]

11:20—12:05am

Panel + Open Discussion Session:

Topic: Future Research Challenges and Needed Resources for Integrating Data Mining with Social Science, Finance, and Medicine”

Panelists:  (10 + 10 = 20 Mins)

1) Diane Lambert, Google Research
2) Henry Goldberg, Financial Industry Regulatory Authority (FINRA)  [slides]

Open Discussion: 25 mins

12:30—1:20pm

Lunch in Harbor Club Room (Pier-5 Hotel)
provided by the NGDM'07

1:20—2:10pm

Panel Discussion

Topic: Vision, Resources, and Opportunities for Next Generation of Data Mining and Cyber-Enabled Discovery for Innovation 

Panelists: (Incomplete)
1) H. K. Ramapriyan, NASA [slides]
2) Grace Yang, NSF Division of Mathematical Sciences
3)  Maria Zemankova, NSF Computer & Information Science & Engineering Directorate [slides]

 

Session Title: Data mining in Security, Surveillance, and Privacy Protection
Session Chair: Xintao Wu, University of North Carolina Charlotte

2:10—2:35pm

Alessandro Acquisti and Ralph Gross, Carnegie Mellon University, Privacy Risks from Mining Online Social Networks [pdf]

2:35—3:00pm

Chris Clifton, Wei Jiang, Mummoorthy Muruguesan, M. Ercan Nergiz, Purdue University, Is Privacy Still an Issue for Data Mining? [pdf] [slides]

3:00—3:25pm

Nishith Pathak, Sandeep Mane, Jaideep Srivastava, University of Minnesota, Extraction and Analysis of Cognitive Networks from Electronic Communication [pdf] [slides]

3:25—3:40pm

Coffee Break

  Session Chair: Shusaku Tsumoto, Shimane University, Faculty of Medicine

3:40—4:05pm

Shashi Shekhar, Latifur Khan, and Bhavani Thuraisingham, University of Minnesota and University of Texas at Dallas, Spatial and Spatial Temporal Data Mining: Accomplishments and Challenges [pdf] [slides:Part 1] [slides:Part 2]

4:05—4:30pm

Michael Berry, University of Tennessee, Automating the Detection of Anomalies and Trends from Text [pdf] [slides]

4:30—4:55pm

Lisa Singh, Georgetown University, Exploring graph mining approaches for dynamic heterogeneous networks. [pdf] [slides]

4:55—5:40pm

Panel + Open Discussion Session:

Topic: Future Research Challenges and Needed Resources for “Data mining in Security, Surveillance, and Privacy Protection” 

Panelists:  (10 + 10 = 20 Mins)
1) Ted Senator, SAIC [slides]
2) Joe Kielman, DHS

Open Discussion: 25 mins

5:40pm—6:05pm

Ian Foster, Argonne National Laboratory and University of Chicago, Grid and Data Mining: More Related Than You Might Think [pdf] [slides]

6:30—9:45pm

NGDM’07 Banquet (Lady Pintail Yacht Cruise)
6:30 boarding, Dinner and cultural program onboard Pintail.

 

Friday, Oct 12, 2007

 

7:00—8:00am

Breakfast, Registration

 

Session Title: Ubiquitous, Distributed, and High Performance Data Mining
Session Chair: Gautam Das, University of Texas, Arlington

8:00—8:25am

Xindong Wu, Jeffrey E. Stone, and Marc Greenblatt, University of Vermont, User Centered Biological Information Location by Combining User Profiles and Domain Knowledge [pdf] [slides]

8:25—8:50am

Michael May, Fraunhofer Institute for Intelligent Analysis, Research Challenges in Ubiquitous Knowledge Discovery [pdf] [slides]

8:50—9:15am

Marc Snir, University of Illinois at Urbana-Champaign, High Performance Computing and Data Mining [pdf] [slides]

9:15—9:40am

Hillol Kargupta, University of Maryland, Baltimore County and Agnik, Thoughts on Human Emotions, Communication Breakthroughs, and the Next Generation of Data Mining [pdf] [slides]

9:40—9:55am                        

Coffee Break

  Session Chair: Joao Gama, University of Porto, Portugal

9:55—10:20am

Domenico Talia and Paolo Trunfio, University of Calabria. How Distributed Data Mining Tasks can Thrive as services on Grids [pdf] [slides]

10:20—10:45am

Robert Grossman, University of Illinois at Chicago, Distributed Discovery in E-Science: Lessons from the Angle Project [pdf] [slides]

10:45—11:10am

Srinivasan Parthasarathy, Ohio State University,  Architecture Conscious Data Mining [pdf] [slides]

11:10—11:35am

Christos Faloutsos, Carnegie Mellon University, Large Graph Mining [pdf] [slides]

11:35—12:25pm

Panel + Open Discussion Session

Topic: Future Research Challenges and Needed Resources for “Ubiquitous, Distributed, and High Performance Data Mining”  

Panelists:  (10 + 10 = 20 Mins)
1) Peggy Agouris, George Mason University [slides]
2) Hillol Kargupta, University of Maryland, Baltimore County 

Open Discussion: 20 mins

12:25pm

Closing Remarks. NGDM’07 Symposium Ends

 

POSTERS (Brief Talk and Exhibition on Wednesday, October 10, 6pm):

1.      Jeffrey Campbell, Karyn Molines and Christopher W. Swarth. Data Mining for Ecological Field Research: Lessons Learned from Amphibian and Reptile Activity Analysis. [pdf]

2.      N. Balac, Olschanowsky, H. Karimabadi, T. Sipes, S. Ferenci, R. Chandran, R. Fujimoto and A. Roberts. Distributed Data Mining System with Gateway for Virtual Observatories. [pdf]

3.      Madhu Ahluwalia, Zhiyuan Chen, Aryya Gangopadhyay and Zhiling Guo. Preserving Privacy in Supply Chain Management: A Challenge for Next Generation Data Mining. [pdf]

4.      Hector Jasso, William Hodgkiss, Chaitan Baru, Tony Fountain, Don Reich, Kurt Warner. Spatio-temporal Characteristics of 911 Emergency Call Hotspots. [pdf]

5.      Shusaku Tsumoto and Shoji Hirano. Data Mining for Risk Management in Hospital Information Systems. [pdf]

6.      Louiqa Raschid, Padmini Srinivasan and Woei-Jyh Lee. A Framework for Discovering Associations from the Annotated Biological Web. [pdf]

7.      Jiang Tang, Ruixin Yang, Daniel Barbara and Menas Kafatos. Mining Conditions in Rapid Intensifications of Tropical Cyclones. [pdf]

8.      Mario Boley. Intelligent Pattern Mining via Quick Parameter Evaluation. [pdf]

9.      Li Yang and Mustafa Sanver. Multiresolution Data Aggregation and Analytical Exploration of Large Data Sets. [pdf]

10.  Joao Gama. Issues and Challenges in Learning from Data Streams. [pdf]

11.  C. Lee Giles, Prasenjit Mitra, Karl Mueller, James Z. Wang, Bingjun Sun, Levent Bolelli, Xiaonan Lu, Ying Liu, Isaac Councill, William Brower, Qingzhao Tan, Anuj Jaiswal, James Kubicki, Barbara Garrison and Joel Bandstra. ChemXSeer: An eChemistry Web Search Engine and Repository. [pdf]

12.  Noriko Imafuji Yasui, Shunsuke Saruwatari, Xavier Llora and David E. Goldberg. Message Feature Map toward Effective Facilitation on On-line Discussions. [pdf]

13.  David Lo and Siau-Cheng Khoo. Software Specification Discovery: A New Data Mining Approach. [pdf]

14.  Li Xiong, Pawel Jurczyk and Ling Liu. Mining Distributed Private Databases using Random Response Protocols. [pdf]

15.  Saurav Sahay, Eugene Agichtein, Baoli Li, Ernest V. Garcia and Ashwin Ram. Semantic Annotation and Inference for Medical Knowledge Discovery. [pdf]

16.  Mark Meiss, Filippo Menczer and Alessandro Vespignani. A Framework for Analysis of Anonymized Network Flow Data. [pdf]

17.  Hady Lauw and Ee-Peng Lim. A Multitude of Opinions: Mining Online Rating Data. [pdf]

18. Rodney Martin. Investigation of Optimal Alarm System Performance for Anomaly Detection. [pdf]

19. Kamalika Das, Kun Liu and Hillol Kargupta. A Game Theoretic Perspective Toward Practical Privacy Preserving Data Mining. [pdf]

20. Murat Kantarcioglu, Bowei Xi and Chris Clifton. A Game Theoretical Framework for Adversarial Learning. [pdf]