Graduate
PhD proposal: A Semantic Resolution Framework for Manufacturing Capability Data Integration

Ph.D. Dissertation Proposal

A Semantic Resolution Framework for
Manufacturing Capability Data Integration

10:30am Tuesday, May 14, 2013, ITE 346, UMBC

Yan Kang

Building flexible manufacturing supply chains requires interoperable and accurate manufacturing service capability (MSC) information of all supply chain participants. Today, MSC information, which is typically published either on the supplier’s web site or registered at an e-marketplace portal, has been shown to fall short of the interoperability and accuracy requirements. This issue can be addressed by annotating the MSC information using shared ontologies. However, ontology-based approaches face two main challenges: 1) lack of an effective way to transform a large amount of complex MSC information hidden in the web sites of manufacturers into a representation of shared semantics and 2) difficulties in the adoption of ontology-based approaches by the supply chain managers and users because of their unfamiliar of the syntax and semantics of formal ontology languages such as OWL and RDF and the lack of tools friendly for inexperienced users.

The objective of our research is to address the main challenges of ontology-based approaches by developing an innovative approach that can effectively extract a large volume of manufacturing capability instance data, accurately annotate these instance data with semantics and integrate these data under a formal manufacturing domain ontology. To achieve the objective, a Semantic Resolution Framework is proposed to guides every step of the manufacturing capability data integration process and to resolve semantic heterogeneity with minimal human supervision. The key innovations of this framework includes 1) three assisting systems, including a Triple Store Extractor, a Triple Store to Ontology Mapper and a Ontology-based Extensible Dynamic Form, that can efficiently and effectively perform the automatic processes of extracting, annotating and integrating manufacturing capability data.; 2) a Semantic Resolution Knowledge Base (SR-KB) that incrementally filled with, among other things, rules/patterns learned from errors. This SR-KB together with an Upper Manufacturing Domain Ontology (UMO) provide knowledge for resolving semantic differences in the integration process; 3) an evolution mechanism that enables SR-KB to continuously improve itself and gradually reduce the human involvement by learning from mistakes.

Committee: Yun Peng (chair), Charles Nicholas, Tim Finin, Yaacov Yesha, Boonserm Kulvatunyou (NIST)

UMBC’s 2013 summer cybersecurity courses

The UMBC Cybersecurity Masters in Professional Studies (MPS) program will offer the following courses over the Summer 2013 session:

  • CYBR 620: Introduction to Cybersecurity
  • CYBR 621: Cyber Warfare
  • CYBR 691: Special Topics in Cybersecurity: Application Security Principles/Practices

Each class will meet one or two days a week in the late afternoon or evening, depending on the length of the session where the course is offered.

For those living in Washington, D.C., Northern Virginia, Frederick, MD, and points west, UMBC's Cybersecurity MPS will launch at the Universities at Shady Grove (USG) in Fall 2013.  Courses offered the first semester at that campus will be:

  • CYBR 620: Introduction to Cybersecurity
  • CYBR 623: Cybersecurity Law & Policy

The deadline to apply for Fall 2013 admission to the UMBC Graduate Cybersecurity Program is August 1, 2013.

Omar Shehab (CS Ph.D) awarded NSF travel grants for upcoming conferences

shehab-front-face

Congratulations to Omar Shehab (CS Ph.D.), who has been awarded two NSF travel grants to attend research conferences this June.

First, Omar has received an NSF travel grant to attend the IEEE Conference on Computational Complexity. The conference celebrates research in all areas of computation complexity theory, taking a look at the absolute and relative power of computational models under resource constraints. Specific topics include probalistic and interactive proof systems, proof complexity, and descriptive complexity. The conference will be held in Palo Alto California, June 5-7.

Omar has also received an NSF travel grant to attend the 45th ACM Symposium on the Theory of Computing (STOC 2013). Here, he will be presenting a poster entited: "Hamiltonian complexity of Trefoil knot transformations." The conference is sponsored by the ACM Special Interest Group on Algorithms and Computation Theory (SIGACT). It will explore original research on the theory of computation. The conference will be held in Palo Alto California, June 1-4.

Omar started UMBC’s Computer Science Ph.D. program in 2010. He is currently pursuing research under the supervision of Dr. Samuel J. Lomonaco Jr. Omar’s doctoral work exlpores adiabatic quantum Hamiltonian complexity, quantum computational simulation of topology and use of quantum optics to understand device independent cryptography. He is currently a Teaching Assistant for CMSC 641: Design and Analysis of Algorithms.

MS defense: Modeling Individual Nodes in Dynamic Link Prediction

MS Defense

Modeling Individual Nodes In Dynamic Link Prediction

Maksym Morawski

2:00pm Thursday, 25 April 2013, ITE325b, UMBC

The question of how to predict which links will form in a graph, given the graph’s history, is an open research problem in computer science. There are many different approaches to the link prediction problem, one of which involves building a set of features for pairs of nodes and using supervised learning to build a model that predicts when these pairs of nodes will link. Typically, this model is learned over the entire graph. In this thesis, I investigate building this model over each individual node in an attempt to learn the particular ways in which that node behaves before making predictions about it. In addition, research into link prediction to date lacks intelligent ways of utilizing the graph over large timespans. To address this, I introduce a variety of ways to include temporality into the link prediction process by introducing new ways of using existing features.

Committee: Dr. Marie desJardins (Chair), Dr. Tim Oates, Dr. Tim Finin

MS Defense: Text and Ontology Driven Clinical Decision Support System

MS Thesis Defense

Text and Ontology Driven
Clinical Decision Support System

Deepal Dhariwal

9:00am Tuesday 23 April 2013, ITE325b, UMBC

This thesis discusses our ongoing research in the domain of text and ontology driven clinical decision support system. The proposed framework uses text analytics to extract clinical entities from electronic health records and semantic web analytics to generate a domain specific knowledge base (KB) of patients’ clinical facts. Clinical Rules expressed in the Semantic Web Language OWL are used to reason over the KB to infer additional facts about the patient. The KB is then queried to provide clinically relevant information to the physicians. In the first phase, standard text pre processing techniques such as section tagging, dependency parsing, gazetteer lists are used filter clinical terms from the raw data.

In the second phase, a domain specific medical ontology is used to establish relation between the extracted clinical terms. The output of this phase is a Resource Description Framework KB that stores all possible medical facts about the patient. In the final phase, an OWL reasoner and clinical rules are used to infer additional facts about patient and generate a richer KB. This KB can then be queried for a variety of clinical tasks. To demonstrate a proof of concept of this framework, we have used discharge summaries from the cardiovascular domain and determined the TIMI Risk Score and San Francisco Syncope Score for a patient. The goal of this research is to combine factual knowledge about patients, procedural knowledge (clinical rules), and structured knowledge (medical ontologies) to develop a clinical decision support system.

Committee: Dr. Anupam Joshi (chair), Dr. Michael Grasso, Dr. Tim Finin, Dr. Yelena Yesha

UMBC Cybersecurity MPS program now in Shady Grove

We are now offering the UMBC Cybersecurity MPS program at Shady Grove in Montgomery County, MD.

The Cybersecurity Master’s in Professional Studies degree provides students the essential knowledge required to serve in leadership and operational roles throughout the industry. Through the program, students will learn how to analyze cybersecurity risks and assess available countermeasures. The program will expose students to practical managerial and operational considerations needed to conduct cybersecurity activities for large organizations.

PhD defense: Analysis of brain network connectivity using spatial information

PhD Dissertation Defense

Analysis of brain network connectivity
using spatial information

Sai Ma

1:00pm Thursday, 18 April 2013, ITE 325b

In current functional magnetic resonance imaging (fMRI) research, one of the most active areas involves exploring statistical dependencies among brain regions, known as functional connectivity analysis. Data-driven methods, especially independent component analysis (ICA), have been successfully applied to fMRI data to extract distributed brain networks and offer an opportunity to investigate functional connectivity on a network level, thus at a multivariate level. However, the independence assumption in ICA is neither necessarily nor typically satisfied in real applications and an extension is desirable. Furthermore, most current ICA-based studies focus on the use of temporal information and second-order statistics for functional connectivity analysis. Taking spatial information and higher-order statistics in fMRI data into account is expected to lead to better understanding of the overall brain network connectivity in healthy controls and also in patients with mental disorders, such as schizophrenia.

We develop a dependent component analysis (DCA) framework to generalize the ICA-based connectivity analysis methods by grouping components into maximally independent clusters. First, we define functional network connectivity as the statistical dependence among spatial components, instead of the typically used temporal correlation. Based on this definition, we use a hypothesis test to automatically generate functional connectivity structure for a large number of brain networks. After that, we separate dependent components within a given cluster using prior information, such as sparsity and experimental paradigm information, to achieve a better decomposition. We also combine this DCA-based clustering analysis with graph-theoretical analysis to discover significant group differences in topological properties of functional connectivity structure. To extend the methodologies currently available for functional connectivity, we propose an independent vector analysis (IVA) based scheme to extract and analyze dynamic functional connectivity.

The methods we develop offer advantages for effective and efficient examination of not only static, but also dynamic functional connectivity among different brain networks. We identify significant differences in functional connectivity structure between healthy controls and patients with schizophrenia, which may prove useful to serve as potential biomarkers for diagnosis. We also find task-induced modulations in functional connectivity when comparing different active states in the brain. Furthermore, we observe temporal variability in functional connectivity structure and physiologically meaningful group differences in dynamic connectivity among several brain networks. Our methods can provide insights to understanding of functional characteristics of the brain network organization in healthy individuals and patients with schizophrenia.

Committee: Dr. Adali (Chair), Dr. Morris, Dr. Rutledge, Dr. LaBerge, Dr. Phlypo, Dr. Calhoun, and Dr. Westlake

PhD defense: Data-driven group analysis of complex-valued fMRI data

image_sixhund

PhD Dissertation Defense

Data-driven group analysis of complex-valued fMRI data

Pedro A. Rodriguez

11:00am Tuesday, 16 April 2013, ITE 346, UMBC

Analysis of functional magnetic resonance imaging (fMRI) data in its native, complex form has been shown to increase the sensitivity of the analysis both for data-driven techniques such as independent component analysis (ICA) and for model-driven techniques. The promise of an increase in sensitivity and specificity in clinical studies provides a powerful motivation for utilizing both the phase and magnitude data; however, the unknown and noisy nature of the phase poses a challenge for successful study of the fMRI data. In addition, complex-valued analysis algorithms, such as ICA, suffer from an inherent phase ambiguity, which introduces additional difficulty for group analysis and visualization of the results. We present solutions for these issues, which have been among the main reasons phase information has been traditionally discarded, and show their effectiveness when used as part of a complex-valued group ICA algorithm application. The developed methods become key components of a framework that allows the development of new fully complex data-driven and semi-blind methods to process, analyze, and visualize fMRI data.

In this dissertation, we first introduce the methods developed as part of the fully complex framework for ICA of fMRI data. We introduce a physiologically motivated de-noising method that uses phase quality maps to successfully identify and eliminate noisy voxels—3D pixels—in the fMRI complex images so they can be used in individual and group studies. We also introduce a phase correction scheme that can be either applied sub-sequent to ICA of fMRI data or can be incorporated into the ICA algorithm in the form of prior information to eliminate the need for further processing for phase correction. Finally, we present two visualization methods that are used to augment the sensitivity and specificity in the detection of activated voxels. We show the benefits of using the developed methods on actual complex-valued fMRI data.

In the remainder of the dissertation, we focus on developing constrained ICA (C-ICA) algorithms for complex-valued fMRI data. C-ICA uses prior information, hence providing a balance between model-based and data-driven approaches such as ICA to improve the source estimation performance and robustness to noise. C-ICA algorithms have been used to improve the estimation performance in real-valued fMRI data, but—to our knowledge—have not been applied to complex-valued fMRI data. We develop the first C-ICA algorithm that uses complex-valued references to constrain either the sources or the mixing coefficients. The designed algorithm is not restricted to having a unitary demixing matrix, which is a major assumption in existing C-ICA algorithms. We show, on both simulated and actual fMRI data, how the performance of ICA improves by using prior information about the fMRI paradigm.

Committee: Dr. Adali (Chair), Dr. Morris, Dr. Rutledge, Dr. Laberge, Dr. Phlypo, and Dr. Calhoun

PhD defense: Independent Vector Analysis: Theory, Algorithms and Applications, 4/17

datafusion

PhD Dissertation Defense

Independent Vector Analysis:
Theory, Algorithms, and Applications

Matthew Anderson

1:45pm Wednesday, 17 April 2013, ITE 325B

The field of blind source separation (BSS) is a well studied discipline within the signal processing community due to its applicability to a variety of problems when the data observation model is poorly known or difficult to model. For example, in the study of the human brain with functional magnetic resonance imaging (fMRI), a neuroimaging sensor, BSS algorithms are able to provide medical researchers and practitioners with a decomposition of a three-dimensional ‘movie’ of the brain that is amenable to analysis. BSS algorithms achieve this decomposition with only a few justifiable assumptions; this is contrary to methods based on the general linear model, which require prespecified models of the expected or desired response to achieve analysis of fMRI data.

Most BSS algorithms consider just a single dataset, but it also desirable to have methods that can analyze multiple subjects or data collections in fMRI jointly, so as to provide insights beyond that achieved with individual analysis of single datasets. Several frameworks for using BSS on multiple datasets jointly have been proposed. The subject of this dissertation is the study of one of these frameworks, which has been termed independent vector analysis (IVA). IVA is a recent extension of the classical independent component analysis (ICA) model to BSS of multiple datasets and it has been the subject of significant research interest. In this dissertation, we provide a formulation of IVA that accounts for sources which possess properties such as a) following Gaussian or non-Gaussian distributions; b) samples are independently and identically distributed (iid) or are dependent; and c) having either linear or nonlinear dependence of sources between datasets. The proposed IVA formulation utilizes the likelihood to define the objective function. This formulation admits to theoretical analysis. In particular, we provide the identification conditions, i.e., we determine when the sources can be ‘blindly’ recovered by IVA, and give a lower bound on the source separation performance.

Several algorithms exist for achieving IVA. We provide several new approaches to developing IVA algorithms and apply these approaches using a Gaussian distribution source model and a more general Kotz distribution model. The former, in addition to leading to efficient IVA algorithms, serves as the distribution model that directly connects canonical correlation analysis (CCA) and ICA.  

Committee: Dr. Tulay Adali (Chair), Dr. Joel Morris, Dr. Aninyda Roy, Dr. Ronald Phlypo, and Dr. Mike Novey

PhD defense: Digital Forensics for Infrastructure-as-a-Service Cloud Computing

Dissertation Defense

Digital Forensics for
Infrastructure-as-a-Service Cloud Computing

Josiah Dykstra

10:00am Tuesday, 16 April 2013, ITE 325b

We identify important issues in the application of digital forensics to Infrastructure-as-a-Service cloud computing and develop new practical forensic tools and techniques to facilitate forensic exams of the cloud. When investigating suspected cases involving cloud computing, forensic examiners have been poorly equipped to deal with the technical and legal challenges. Because data in the cloud are remote, distributed, and elastic, these challenges include understanding the cloud environment, acquiring and analyzing data remotely, and applying the law to a new domain. Today digital forensics for cloud computing is challenging at best, but can be performed in a manner consistent with federal law using the tools and techniques we developed.

The first problem is understanding how and why criminal and civil actions in and against cloud computing are unique and difficult to prosecute. We analyze a digital forensic investigation of crime in the cloud, and present two hypothetical case studies that illustrate the unique challenges of acquisition, chain of custody, trust, and forensic integrity. Understanding these issues introduces legal challenges which are also important for federal, state, and local law enforcement who will soon be called upon to conduct cloud investigations.

The second problem is the lack of practical technical tools to conduct cloud forensics. We examine the capabilities for forensics today, evaluate the use of existing tools including EnCase and FTK, and discuss why these tools are incapable of trustworthy cloud acquisition. We design consumer-driven forensic capabilities for OpenStack, including new features for acquiring trustworthy firewall logs, API logs, and disk images.

The third problem is a deficit of legal instruments for seizing cloud-based electronically-stored information. We analyze the application of existing policies and laws to the new domain of cloud computing by analyzing case law and legal opinions about digital evidence discovery, and suggest modifications that would enhance cloud the prosecution of cloud-based crimes. We offer guidance about how to author a search warrant for cloud data, and what pertinent data to request.

This dissertation enhances our understanding of technical, trust, and legal issues needed to investigate cloud-based crimes and offers new tools and techniques to facilitate such investigations.

Committee: Dr. Alan T. Sherman (Chair), Dr. Charles Nicholas, Dr. Richard Forno, Dr. Simson Garfinkel (Naval Postgraduate School), Mr. Donald Flynn, JD (Department of Defense Cyber Crime Center)