ÿþ<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> <!-- saved from url=(0036)http://chenwsdb.fulton.ad.asu.edu/xseek/home.htm --> <!-- saved from url=(0038)http://chenwsdb.fulton.ad.asu.edu/smartflow/home.htm --><!-- saved from url=(0028)http://extract.asu.edu/home/ --><HTML><HEAD><TITLE>XSEEK : Intelligent Search Engine for Semi-Structured Data</TITLE> <META content="text/html; charset=unicode" http-equiv=Content-Type> <META name=description content="A floating menu that stays put even when the page is scrolled."> <META name=keywords content="slide in menu, float menu"><LINK rel=stylesheet type=text/css href="XSEEK%20%20Intelligent%20Search%20Engine%20for%20Semi-Structured%20Data_files/mainstyle.css"><LINK rel=stylesheet type=text/css href="XSEEK%20%20Intelligent%20Search%20Engine%20for%20Semi-Structured%20Data_files/styles.css" media=screen><LINK rel=stylesheet type=text/css href="XSEEK%20%20Intelligent%20Search%20Engine%20for%20Semi-Structured%20Data_files/print.css" media=print><LINK rel=stylesheet type=text/css href="XSEEK%20%20Intelligent%20Search%20Engine%20for%20Semi-Structured%20Data_files/widget-02.css" media=all> <META name=GENERATOR content="MSHTML 9.00.8112.16437"> <STYLE type=text/css>.STYLE1 { FONT-SIZE: 18px } .STYLE2 { COLOR: #000000; FONT-SIZE: 120%; FONT-WEIGHT: bold } .STYLE4 { FONT-SIZE: 100% } .STYLE5 { COLOR: #999999 } .STYLE6 { COLOR: #333333 } .STYLE7 { COLOR: #666666 } .STYLE8 { COLOR: #99cc66 } .STYLE10 { COLOR: #ff9900 } .STYLE11 { COLOR: #ffcc33 } .STYLE12 { COLOR: #000000 } .STYLE13 { FONT-WEIGHT: bold } .STYLE14 { FONT-SIZE: 100%; FONT-WEIGHT: bold } </STYLE> </HEAD> <BODY> <DIV id=top class=hide name="top"> <TABLE style="MARGIN-LEFT: 200px; MARGIN-RIGHT: 200px" cellSpacing=0 cellPadding=0> <TBODY> <TR width="100%"> <TD width=800 align=left><A href="http://www.asu.edu/" target=blank><IMG border=0 src="XSEEK%20%20Intelligent%20Search%20Engine%20for%20Semi-Structured%20Data_files/asu.png" height=50></A></TD> <TD width=500 align=center><IMG border=0 src="XSEEK%20%20Intelligent%20Search%20Engine%20for%20Semi-Structured%20Data_files/xseek2.png" height=80></TD> <TD width=1000 align=right></TD></TR></TBODY></TABLE> <P></P><!--<TABLE style="MARGIN-LEFT: 200px; MARGIN-RIGHT: 200px" cellSpacing=0 cellPadding=0> <TBODY> <TR width="100%"> <TD align=center width="33%"> <DIV><A href="http://www.asu.edu/"><IMG border=0 src="wise_files/asu1.jpg" height=50></A></DIV></TD> <TD align=center width="70%"> <DIV align=center> <H1 class=STYLE1> <H1><IMG border=0 src="wise_files/xseek2.png" height=200></H1><SPAN class=STYLE7> <H1></H1></H1></DIV></SPAN></TD> <TD align=center width="100%"> <DIV><A href="http://chenwsdb.fulton.ad.asu.edu/xseek/home.htm"><IMG border=0 src="wise_files/nsf1.png" height=70></A></DIV></TD></TR></TBODY></TABLE> --></DIV><FONT size=2 face=tahoma> <DIV style="BACKGROUND-COLOR: #f4f4f4; MARGIN-LEFT: 200px; MARGIN-RIGHT: 200px" id=content> <DIV> <TABLE> <TBODY> <TR> <TD> <DIV id=tblock class=titleblock name="tblock"> <H2>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; XSEEK : <SPAN class=STYLE7>Intelligent Search</SPAN> <SPAN class=STYLE8>Engine for</SPAN> <SPAN class=STYLE11>Semi-Structured</SPAN> <SPAN class=STYLE10>Data</SPAN></H2> <H4>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Developed by <A href="http://chenwsdb.fulton.ad.asu.edu:88/"><U><FONT color=blue>WSDB</FONT></U></A>&nbsp;@&nbsp;ASU</H4></DIV></TD></TR></TBODY></TABLE></DIV> <DIV style="BORDER-BOTTOM: double; BORDER-LEFT: double; BACKGROUND-COLOR: #ffffff; BORDER-TOP: double; BORDER-RIGHT: double" id=divOverview name="divOverview"> <DIV style="BACKGROUND-COLOR: #f75d59"> <H2><A id=motivation class=STYLE6 name=motivation>Motivation</A></H2></DIV> <P>Information search is an indispensable component of our lives. Web search engines, such as Google, Yahoo! and Bing, are widely used for searching textual documents, images, and video. However, there are also vast collections of structured and semi-structured data both on the Web and in enterprises, such as relational databases, XML data, etc. </SPAN>The classical way of accessing these data sources is through issuing structured queries, such as SQL/XPath/XQuery. However, this demands users to learn these query languages and comprehend the possibly complex and fast-evolving data schema, which is inconvenient or impossible for users in many applications. <SPAN>To relieve web and scientific users from the learning curve and enable them to easily access (semi-)structured and semi-structured data, </SPAN>supporting keyword search on such data is highly desirable.</P></DIV> <P>&nbsp;</P> <DIV style="BORDER-BOTTOM: double; BORDER-LEFT: double; BACKGROUND-COLOR: #ffffff; BORDER-TOP: double; BORDER-RIGHT: double" id=divOverview name="divOverview"> <DIV style="BACKGROUND-COLOR: #f88017"> <H2><A id=approach class=STYLE6 name=approach>Approach</A></H2></DIV> <P>We have been identifying a spectrum of problem space in the domain of supporting keyword search on semi-structured data, ranging from evaluation framework of various search strategies, generating high-quality results, to helping users to analyze results.</P> <P>We have been developing techniques to address these problems for users to achieve better search quality and enhanced search experience than searching unstructured data (e.g. web pages), by exploiting the rich meta-information embedded in (semi-) structured data, as outlined below.&nbsp;</P> <UL> <LI> <P>Initiating an axiomatic framework to evaluate the quality of XML search engines (<A href="http://xseek.asu.edu/maxmatch.pdf"><FONT color=blue>pdf</FONT></A>).&nbsp; Using axioms for quality evaluation is a cost-effective and easy to use complement to the traditional empirical evaluation through user-study which often involves a long time and extensive work from domain experts and users.</P> <LI> <P>Identifying explicit relevant nodes (among all nodes that match keywords) (<A href="http://xseek.asu.edu/maxmatch.pdf"><FONT color=blue>pdf</FONT></A>) and implicit relevant nodes (among all nodes that do not match keywords) (<A href="http://xseek.asu.edu/xseek.pdf"><FONT color=blue>pdf</FONT></A>) for keyword search on XML. This is the first work on identifying implicit relevant nodes in structured data search.</P> <LI> <P>Composing ranking-friendly results from explicit and implicit relevant nodes for keyword search on XML (<A href="http://xseek.asu.edu/target.pdf"><FONT color=blue>pdf</FONT></A>). We propose a result composition approach driven by inferred user search target, based on the observation that each result should have exactly one target instance along with all associated evidence, so that ranking and top-k query processing can be meaningfully based on target instances. </P> <LI> <P>Initiating the first study of generating query-biased snippets for XML search (<A href="http://xseek.asu.edu/snippets.pdf"><FONT color=blue>pdf</FONT></A>). To compensate the inaccuracy of ranking functions, snippets are highly helpful for users to quickly get the essence of a result and to retrieve relevant results.</P> <LI> <P>Initiating the first study of differentiating search results on structured (relational or XML) data (<A href="http://xseek.asu.edu/xred.pdf"><FONT color=blue>pdf</FONT></A>). Due to the absence of general tools that can effectively analyze and differentiate multiple results, a user has to manually read and comprehend potentially large results in an exploratory search. We proposed an efficient and effective way that helps users for result comparison and differentiation. </P> <LI> <P>Initiating the first study of query-dependent result clustering on structured data. We propose algorithms for automatically generating a user-specified number of describable clusters given a set of query results (<A href="http://portal.acm.org/citation.cfm?id=1735886.1735889&amp;coll=ACM&amp;dl=ACM&amp;idx=J777&amp;part=transaction&amp;WantType=Transactions&amp;title=ACM%20Transactions%20on%20Database%20Systems%20(TODS)&amp;CFID=96264689&amp;CFTOKEN=88882853"><FONT color=blue>link</FONT></A>), as well as studying the effectiveness and efficiency of clustering query results using their snippets (<A href="http://tods.acm.org/Upcoming.html"><FONT color=blue>link</FONT></A>). </P> <LI> <P>Initiating the first study of processing keyword searches on workflow hierarchies (to appear in VLDB 2010). We define an informative, self-contained and concise search result on workflows to be a projection of a workflow hierarchy on a two dimensional viewing plane inferred from user queries. We then design and develop an efficient keyword search engine for workflows. Experimental evaluation demonstrates the effectiveness of our approach.</P> <LI> <P>Initiating the first study of answering keyword queries on XML data using materialized views (<A href="http://xseek.asu.edu/view.pdf"><FONT color=blue>pdf</FONT></A>). Semantic caching, or generally, materialized views, have been proved successful for performance optimization in evaluating structured queries on XML and databases. We investigated the feasibility and present a general framework for answering XML keyword searches using materialized views. </P></LI></UL></DIV> <P>&nbsp;</P> <DIV style="BORDER-BOTTOM: double; BORDER-LEFT: double; BACKGROUND-COLOR: #ffffff; BORDER-TOP: double; BORDER-RIGHT: double" id=divPublications name="divPublications"> <DIV style="BACKGROUND-COLOR: #fffc17"> <H2><A id=publication class=STYLE6 name=publication>Publications</A></H2></DIV> <UL>Ziyang Liu, and Yi Chen. Differentiating Search Results on Structured Data , TODS, 2012. <HR> Ziyang Liu, and Yi Chen: <A href="http://www.springerlink.com/content/h616036m21k51417/"><FONT color=blue> Processing Keyword Search on XML: a Survey</FONT></A> World Wide Web Journal 14(5-6), (2011) <HR> Brian Ackerman, and Yi Chen: <A href="http://www.public.asu.edu/~ychen127/ucersti11.pdf"><FONT color=blue> Evaluating Rank Accuracy based on Incomplete Pairwise Preferences.</FONT></A> Workshop of User-Centric Evaluation of Recommender Systems and Their Interfaces (UCERSTI), 2011 <HR> Ziyang Liu, Sivaramakrishnan Natarajan, and Yi Chen: <A href="http://www.public.asu.edu/~ychen127/pvldb11.pdf"><FONT color=blue> Query Expansion Based on Clustered Results</FONT></A> PVLDB 4(6), 2011 <HR> Ziyang Liu, Qihong Shao, Yi Chen: <A href="http://www.public.asu.edu/~ychen127/pvldb10_wise.pdf"><FONT color=blue> WISE: Searching Workflow Hierarchies.</FONT></A>To Appear in VLDB 2010. <HR> Ziyang Liu, Yu Huang, Yi Chen: <A href="http://tods.acm.org/accepted/2010/LiuImproving.pdf"><FONT color=blue>Improving XML Search by Generating and Utilizing Informative Result Snippets. </FONT></A>ACM Trans. Database Syst. 35(3): (2010) <HR> Ziyang Liu, Yi Chen: <A href="http://dl.acm.org/citation.cfm?doid=1735886.1735889"><FONT color=blue>Return specification inference and result clustering for keyword search on XML. </FONT></A>ACM Trans. Database Syst. 35(2): (2010) <HR> Ziyang Liu, SivaramaKrishnan Natarajan, Stephen Booher, Tim Meehan, Robert Winkler, Yi Chen: <A href="http://www.public.asu.edu/~ychen127/pvldb10_xsact.pdf"><FONT color=blue>XSACT: A Structured Search Result Comparison Tool. </FONT></A>To Appear in VLDB 2010. <HR> Ziyang Liu, Yi Chen: <A href="http://chenwsdb.fulton.ad.asu.edu/xsact/invited2.pdf"><FONT color=blue>Query Results Ready, Now What?</FONT></A> IEEE Data Eng. Bull. 33(1): 46-53 (2010) <HR> Ziyang Liu, Yichuan Cai, Yi Chen: <A href="http://xseek.asu.edu/target.pdf"><FONT color=blue>TargetSearch: A Ranking Friendly XML Keyword Search Engine</FONT></A>. ICDE 2010 <HR> Ziyang Liu, Peng Sun, Yu Huang, Yichuan Cai, Yi Chen: <A href="http://xseek.asu.edu/invited.pdf"><FONT color=blue>Challenges, Techniques and Directions in Building XSeek: an XML Search Engine</FONT></A>. IEEE Data Eng. Bull. 32(2): 36-43 (2009) <HR> Ziyang Liu, Peng Sun, Yi Chen: <A href="http://xseek.asu.edu/xred.pdf"><FONT color=blue>Structured Search Result Differentiation</FONT></A>. PVLDB 2(1): 313-324 (2009) <HR> Qihong Shao, Peng Sun, Yi Chen: <A href="http://chenwsdb.fulton.ad.asu.edu/xsact/wisedemo.pdf"><FONT color=blue>WISE: A Workflow Information Search Engine</FONT></A>. ICDE 2009: 1491-1494 <HR> Ziyang Liu, Yi Chen: <A href="http://xseek.asu.edu/view.pdf"><FONT color=blue>Answering Keyword Queries on XML Using Materialized Views</FONT></A>. ICDE 2008: 1501-1503 <HR> Yu Huang, Ziyang Liu, Yi Chen: <A href="http://xseek.asu.edu/snippets.pdf"><FONT color=blue>Query Biased Snippet Generation in XML Search</FONT></A>. SIGMOD Conference 2008: 315-326 <HR> Ziyang Liu, Yi Chen: <A href="http://xseek.asu.edu/maxmatch.pdf"><FONT color=blue>Reasoning and Identifying Relevant Matches for XML Keyword Search</FONT></A>. PVLDB 1(1): 921-932 (2008) <HR> Yu Huang, Ziyang Liu, Yi Chen: <A href="http://xseek.asu.edu/snippetsdemo.pdf"><FONT color=blue>eXtract: A Snippet Generation System for XML Search</FONT></A>. PVLDB 1(2): 1392-1395 (2008). VLDB 2007: 1330-1333 <HR> Ziyang Liu, Yi Chen: <A href="http://xseek.asu.edu/xseek.pdf"><FONT color=blue>Identifying Meaningful Return Information for XML Keyword Search</FONT></A>. SIGMOD Conference 2007: 329-340 <HR> Ziyang Liu, Jeffrey Walker, Yi Chen: <A href="http://xseek.asu.edu/xseekdemo.pdf"><FONT color=blue>XSeek: A Semantic XML Search Engine Using Keywords</FONT></A>. VLDB 2007: 1330-1333 </UL></DIV> <P>&nbsp;</P> <DIV style="BORDER-BOTTOM: double; BORDER-LEFT: double; BACKGROUND-COLOR: #ffffff; BORDER-TOP: double; BORDER-RIGHT: double" id=divPublications name="divPublications"> <DIV style="BACKGROUND-COLOR: #6afb92"> <H2><A id=tutorial class=STYLE6 name=tutorial>Tutorial</A></H2></DIV> <UL>Yi Chen, Wei Wang, and Ziyang Liu.: <A href="http://www.public.asu.edu/~ychen127/icde11.pdf"><FONT color=blue> Keyword-based Search and Exploration on Databases</FONT></A>. Slides<A href="http://www.public.asu.edu/~ychen127/icde11_tutorial.pptx"><FONT color=blue></FONT></A> ICDE Conference 2011 </UL> <UL>Yi Chen, Wei Wang, and Ziyang Liu.: <A href="http://www.public.asu.edu/~ychen127/dasfaa11.pdf"><FONT color=blue> Searching, Analyzing and Exploring Databases</FONT></A>. DASFAA Conference 2011 </UL> <UL>Yi Chen, Wei Wawng, Ziyang Liu, Xuemin Lin: <A href="http://chenwsdb.fulton.ad.asu.edu/xsact/sigmod09tutorial.pdf"><FONT color=blue>Keyword Search on Structured and Semi-structured Data</FONT></A>. Slides<A href="http://www.public.asu.edu/~ychen127/keyword_sigmod09_tutorial.pptx"><FONT color=blue></FONT></A> SIGMOD Conference 2009: 1005-1010 </UL></DIV> <P>&nbsp;</P> <DIV style="BORDER-BOTTOM: double; BORDER-LEFT: double; BACKGROUND-COLOR: #ffffff; BORDER-TOP: double; BORDER-RIGHT: double" id=divPeople name="divPeople"> <DIV style="BACKGROUND-COLOR: #82caff"> <H2><A id=people class=STYLE6 name=people>People</A></H2></DIV> <P class=STYLE2><STRONG>Faculty</STRONG>:</P> <P><STRONG>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</STRONG><A href="http://www.public.asu.edu/~ychen127"><STRONG><FONT color=blue>Yi Chen</FONT></STRONG></A><STRONG> &lt; yi@asu.edu &gt;</STRONG></P> <P class=STYLE2> <STRONG>Students</STRONG>:</P> <P><STRONG>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Yi Shan &lt; yishan@asu.edu &gt;</STRONG></P> <P><STRONG>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Brian Ackerman &lt; bjackerm@asu.edu &gt;</STRONG></P> <P><STRONG>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Doug Stoeckmann &lt; Douglas.Stoeckmann@asu.edu &gt;</STRONG></P> <P class=STYLE2><STRONG>Alumni</STRONG>:</P> <P><STRONG>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Stephen Booher, Yichuan Cai, Ziyang Liu, Tim Meehan, SivaramaKrishnan Natarajan, Peng Sun, Jeffrey Walker </STRONG></P></DIV> <P>&nbsp;</P> <DIV style="BORDER-BOTTOM: double; BORDER-LEFT: double; BACKGROUND-COLOR: #ffffff; BORDER-TOP: double; BORDER-RIGHT: double" id=divDemo name="divDemo"> <DIV style="BACKGROUND-COLOR: #736aff"> <H2><A id=Acknowledgement class=STYLE6 name=acknowledgement>Acknowledgement</A></H2></DIV> <P><A href="http://www.nsf.gov/" target=blank><IMG border=0 src="XSEEK%20%20Intelligent%20Search%20Engine%20for%20Semi-Structured%20Data_files/nsf1.png" height=50> <A href="http://www.google.com/" target=blank> <IMG border=0 src="XSEEK%20%20Intelligent%20Search%20Engine%20for%20Semi-Structured%20Data_files/google.png" height=50></A> This project is supported by <A href="http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0845647"><FONT color=blue>NSF CAREER Award 0845647</FONT></A> and a Google Research Award.</P></DIV> <P></P> <DIV style="BACKGROUND-COLOR: #ffffff" class=footer align=center>Copyright © 2006 - 2010 <A href="http://chenwsdb.fulton.ad.asu.edu:88/"><FONT color=blue>WSDB</FONT></A> @ Arizona State University</DIV></DIV> <SCRIPT> if (!document.layers) document.write('<div id="divStayTopLeft" style="position:absolute">') </SCRIPT> <DIV style="POSITION: absolute; TOP: 3px; LEFT: -30px" id=divStayTopLeft><LAYER id=divStayTopLeft left="100"><!--EDIT BELOW CODE TO YOUR OWN MENU--> <TABLE border=0 cellSpacing=0 cellPadding=0 width=100> <TBODY> <TR> <TD colSpan=3> <DIV id=avmenu> <UL> <LI><A class=STYLE5 href="http://chenwsdb.fulton.ad.asu.edu/xseek/home.htm#motivation">Motivation</A> <LI><A class=STYLE5 href="http://chenwsdb.fulton.ad.asu.edu/xseek/home.htm#approach">Approach</A> <LI><A class=STYLE5 href="http://chenwsdb.fulton.ad.asu.edu/xseek/home.htm#publication">Publications</A> <LI><A class=STYLE5 href="http://chenwsdb.fulton.ad.asu.edu/xseek/home.htm#tutorial">Tutorial</A> <LI><A class=STYLE5 href="http://chenwsdb.fulton.ad.asu.edu/xseek/home.htm#people">People</A> <LI><A class=STYLE5 href="http://chenwsdb.fulton.ad.asu.edu/xseek/home.htm#acknowledgement">Acknowledgement</A> </LI></UL></DIV></TD></TR></TBODY></TABLE><!--END OF EDIT--></LAYER></FONT> <SCRIPT type=text/javascript> /* Floating Menu script- Roy Whittle (http://www.javascript-fx.com/) Script featured on/available at http://www.dynamicdrive.com/ This notice must stay intact for use */ var verticalpos="fromtop" if (!document.layers) document.write('</div>') function JSFX_FloatTopDiv() { var startX = 60; var startY = 280; var ns = (navigator.appName.indexOf("Netscape") != -1); var d = document; function ml(id) { var el=d.getElementById?d.getElementById(id):d.all?d.all[id]:d.layers[id]; if(d.layers) el.style=el; el.sP=function(x,y){this.style.left=x;this.style.top=y;}; el.x = startX; if (verticalpos=="fromtop") el.y = startY; else{ el.y = ns ? pageYOffset + innerHeight : document.body.scrollTop + document.body.clientHeight; el.y -= startY; } return el; } window.stayTopLeft=function() { if (verticalpos=="fromtop"){ var pY = ns ? pageYOffset : document.body.scrollTop; ftlObj.y += (pY + startY - ftlObj.y)/8; } else { var pY = ns ? pageYOffset + innerHeight : document.body.scrollTop + document.body.clientHeight; ftlObj.y += (pY - startY - ftlObj.y)/8; } ftlObj.sP(ftlObj.x, ftlObj.y); setTimeout("stayTopLeft()", 10); } ftlObj = ml("divStayTopLeft"); stayTopLeft(); } JSFX_FloatTopDiv(); </SCRIPT> </DIV></BODY></HTML>