DERI Galway
National University of Ireland, Galway   Science Foundation Ireland
 

On this page past reading groups are listed.

Managing Information in Online Product Review Communities: Two Approaches

based on Managing Information in Online Product Review Communities: A Comparison of Two Approaches by Jahna Otterbacher

Maciej Dabrowski

Abstract

Virtual communities often suffer from a number of problems, including questionable information quality and information overload, which threaten their utility and stability. To address this, social filtering techniques may be used, in which users rate the postings, guiding others to the important ones. This method is contrasted to information retrieval techniques, in which intrinsic properties of texts, such as length or keywords, are used to rank them by perceived relevance to a topic. Each approach has advantages and disadvantages. Social navigation assumes that users actively rate messages, however, soliciting sufficient participation is a known challenge. Additionally, what is interesting for one user may not be for others. Currently, we compare these approaches in the context of an e-commerce product review forum at Amazon.com. We find that while a significant proportion of reviews go unrated, these reviews are typically of low quality. Interestingly, we also find that the rankings produced using reader-assigned “helpful votes” are correlated to the rankings assigned by some simple information retrieval algorithms. The conclusion is that a number of approaches for filtering product reviews could effectively be used in such online communities in order to accommodate user preferences, and thus, in reinforcing the utility of the community.


Date:

10th of March 2010


Original Paper:

is2.lse.ac.uk/as...


Sketching User Experiences - getting the design right and the right design

based on Sketching User Experiences - getting the design right and the right design by Bill Buxton

Krystian Samp

Abstract

There is almost a fervor in the way that new products, with their rich and dynamic interfaces, are being released to the public-typically promising to make lives easier, solve the most difficult of problems, and maybe even make the world a better place. The reality is that few of these products survive, much less deliver on their promise. The folly? An absence of design, and an over reliance on just technology and/or traditional practice.
We need design. But design as described here depends on the skills of a number of different communities-each essential, but on their own, none sufficient. In this rich ecology, designers are faced with new challenges-challenges that build on, rather than replace, existing skills and practice.
Sketching User Experiences approaches design and design thinking as something distinct that needs to be better understood by both designers and the people with whom they need to work in order to achieve success with these new types of products and systems. So while the focus is on design, the approach is holistic. Hence, the book speaks to designers, usability specialists, people from HCI, product managers and business executives. There is an emphasis on balancing the back-end concern with usability and engineering excellence (getting the design right) with an up-front investment in sketching and ideation (getting the right design). Overall, the objective is building the notion of informed design, molding emerging technology into a form that serves our society and reflects its values. Grounded in both practice and scientific research, Bill Buxton's engaging work aims to spark the imagination while encouraging the use of new techniques, breathing new life into user experience design.


Date:

24th of February 2010


Crowd-Coordination: 2 Studies of Wikipedia Talk Pages

based on 2 papers by several authors

Jodi Schneider

Abstract

Wikipedia, the 6th most visited website in the world according to Alexa, is a success story in massive collaboration. Despite tremendous growth and high traffic, Wikipedia is resilient to malicious editing. The fastest growing areas of Wikipedia are devoted to coordination and organization. Based on two qualitative and quantitative studies of Wikipedia, we describe the coordination work happening in Talk pages, and how volunteer editors use these pages to plan editing, enforce standards, and ensure the quality of information in this online encyclopedia.

Based on:

[1] F.B. Viegas, M. Wattenberg, J. Kriss, and F.V. Ham, “Talk Before You Type: Coordination in Wikipedia,” 40th Annual Hawaii International Conference on System Sciences, 2007. HICSS 2007. pp. 78-87.
http://doi.ieeecomputersociety.org/10.1109/HICSS.2007.511

[2] B. Stvilia, M.B. Twidale, L.C. Smith, and L. Gasser, “Information Quality Work Organization in Wikipedia,” Journal of the American Society for Information Science and Technology, vol. 59, 2008, pp. 983-1001.
http://dx.doi.org/10.1002/asi.20813



Date:

10th of February 2010


Populating the Semantic Web with Relations from Text

based on "Populating the Semantic Web with Relations from Text" and "Populating the Semantic Web—Combining Text and Relational Databases as RDF Graphs" by Kate Byrne, Claire Grover, Ewan Klein

Behrang Qasemizadeh

Abstract

Great hopes are cherished for the Semantic Web. It is intended to make linking data as easy as HTML makes linking Web documents. The basic information is in easily searchable databases, but huge amounts of content are locked in text that is difficult to query systematically. Extracting coherent facts from related but separate collections is another difficulty. The papers discuss how the semantic web can offer solution to both of these problems, and suggests RDF graph as unified format for combining all relevant contents. The focus will be more on the process of text to RDF.


Date:

03rd of February 2010


Mining Web Data for Competency Management

based on Mining Web Data for Competency Management by J.Zhu, A.L.Gonçalves, V.S.Uren, E.Motta, R.Pacheco

Georgeta Bordea

Abstract

We present CORDER (COmmunity Relation Discovery by named Entity Recognition) an un-supervised machine learning algorithm that exploits named entity recognition and co-occurrence data to associate individuals in an organization with their expertise and associates. We discuss the problems associated with evaluating unsupervised learners and report our initial evaluation experiments.


Date:

27th of January 2010


Building the Santa Fe Artificial Stock Market

based on Building the Santa Fe Artificial Stock Market by LeBaron, B.

Daniel Paraschiv

Abstract

This short summary presents an insider's look at the construction of the Santa Fe artificial stock market. The perspective considers the many design questions that went into building the market from the perspective of a decade of experience with agent-based financial markets. The market is assessed based on its overall strengths and weaknesses.



Date:

20th of January 2010


Original Paper:

www.citeulike.or...


Pioneering Theories for Diffusion of Information in Networks

based on 2 papers by various authors

Partha Basuchowdhuri

Abstract

based on:
[1] Frank M. Bass. A New Product Growth for Model Consumer Durables. In Management Science, Vol. 15, No. 5, Theory Series (Jan., 1969), pp. 215-227.
[2] Mark S. Granovetter. The Strength of Weak Ties. In The American Journal of Sociology, Vol. 78, No. 6 (May, 1973), pp. 1360-1380.

Abstract:
The talk is presenting two key theories [1][2] that have made significant contribution to the understanding of the spread of an information or an influence within a network.

Abstract of [1]:
A growth model for the timing of initial purchase of new products is developed and tested empirically against data for eleven consumer durables. The basic assumption of the model is that the timing of a consumer's initial purchase is related to the number of previous buyers. A behavioral rationale for the model is offered in terms of innovative and imitative behavior. The model yields good predictions of the sales peak and the timing of the peak when applied to historical data. A long-range forecast is developed for the sales of color television sets.

Abstract of [2]:
Analysis of social networks is suggested as a tool for linking micro and macro levels of sociological theory. The procedure is illustrated by elaboration of the macro implications of one aspect of small-scale interaction: the strength of dyadic ties. It is argued that the degree of overlap of two individuals' friendship networks varies directly with the strength of their tie to one another. The impact of this principle on diffusion of influence and information, mobility opportunity, and community organization is explored. Stress is laid on the cohesive power of weak ties. Most network models deal, implicitly, with strong ties, thus confining their applicability to small, well-defined groups. Emphasis on weak ties lends itself to discussion of relations between groups and to analysis of segments of social structure not easily defined in terms of primary groups.

Papers available at:
[1] http://math.asu.edu/~dieter/courses/mat451/fall08/References/bass_1969.pdf
[2] http://www.sna.pl/teksty/Granovetter73.pdf


Date:

06th of January 2010


Mapping the World's Photos

based on Mapping the World's Photos by David Crandall, Lars Backstrom, Daniel Huttenlocher, Jon Kleinberg

Gabriela Vulcu

Abstract

We investigate how to organize a large collection of geotagged photos, working with a dataset of about 35 million images collected from Flickr. Our approach combines content analysis based on text tags and image data with structural analysis based on geospatial data. We use the spatial distribution of where people take photos to define a relational structure between the photos that are taken at popular places. We then study the interplay between this structure and the content, using classification methods for predicting such locations from visual, textual and temporal features of the photos. We find that visual and temporal features improve the ability to estimate the location of a photo, compared to using just textual features. We illustrate using these techniques to organize a large photo collection, while also revealing various interesting properties about popular cities and landmarks at a global scale.



Date:

16th of December 2009


Original Paper:

www.cs.cornell.e...


Social phishing

based on Social phishing by Tom N. Jagatic, Nathaniel A. Johnson, Markus Jakobsson, Filippo Menczer

Slawomir Grzonkowski

Abstract

Phishing is a form of deception in which an attacker attempts to fraudulently acquire sensitive information from a victim by impersonating a trustworthy entity. Phishing attacks typically employ generic “lures.” For instance, a phisher misrepresenting himself as a large banking corporation or popular online auction site will have a reasonable yield, despite knowing little to nothing about the recipient. In a study by Gartner Group [9], about 19% of all those surveyed reported having clicked on a link in a phishing email message, and 3% admitted to giving up financial or personal information. The research project described here was designed to provide us with a baseline success rate for individual phishing attacks, and was, when it was performed in 2005, the first study to achieve this goal.


Date:

09th of December 2009


Original Paper:

doi.acm.org/10.1...


Thinking inside the box: System-level failures of Tamper Proofing

based on Thinking inside the box: System-level failures of Tamper Proofing by Saar Drimer, Steven J. Murdoch, Ross Anderson

Oana-Elena Ureche

Abstract

PIN entry devices (PEDs) are critical security components in EMV smartcard payment systems as they receive a customer's card and PIN. Their approval is subject to an extensive suite of evaluation and certification procedures. In this paper, we demonstrate that the tamper proofing of PEDs is unsatisfactory, as is the certification process. We have implemented practical low-cost attacks on two certified, widely-deployed PEDs -- the Ingenico i3300 and the Dione Xtreme. By tapping inadequately protected smartcard communications, an attacker with basic technical skills can expose card details and PINs, leaving cardholders open to fraud. We analyze the anti-tampering mechanisms of the two PEDs and show that, while the specific protection measures mostly work as intended, critical vulnerabilities arise because of the poor integration of cryptographic, physical and procedural protection. As these vulnerabilities illustrate a systematic failure in the design process, we propose a methodology for doing it better in the future. These failures also demonstrate a serious problem with the Common Criteria. So we discuss the incentive structures of the certification process, and show how they can lead to problems of the kind we identified. Finally, we recommend changes to the Common Criteria framework in light of the lessons learned.


Date:

02nd of December 2009


Original Paper:

ieeexplore.ieee....


The role of architecture in solving problems

based on The end of an architectural era: (it's time for a complete rewrite) by Michael Stonebraker, Samuel Madden, Daniel J. Abadi, Stavros Harizopoulos, Nabil Hachem, Pat Helland

Benjamin Heitmann

Abstract

In previous papers [SC05, SBC+07], some of us predicted the end of "one size fits all" as a commercial relational DBMS paradigm. These papers presented reasons and experimental evidence that showed that the major RDBMS vendors can be outperformed by 1--2 orders of magnitude by specialized engines in the data warehouse, stream processing, text, and scientific database markets.
Assuming that specialized engines dominate these markets over time, the current relational DBMS code lines will be left with the business data processing (OLTP) market and hybrid markets where more than one kind of capability is required. In this paper we show that current RDBMSs can be beaten by nearly two orders of magnitude in the OLTP market as well. The experimental evidence comes from comparing a new OLTP prototype, H-Store, which we have built at M.I.T. to a popular RDBMS on the standard transactional benchmark, TPC-C.

We conclude that the current RDBMS code lines, while attempting to be a "one size fits all" solution, in fact, excel at nothing. Hence, they are 25 year old legacy code lines that should be retired in favor of a collection of "from scratch" specialized engines. The DBMS vendors (and the research community) should start with a clean sheet of paper and design systems for tomorrow's requirements, not continue to push code lines and architectures designed for yesterday's needs.


Date:

01st of December 2009


Original Paper:

portal.acm.org/c...


Designing experiments and performing statistical analysis using ANOVA - Introduction

based on Comparing the effects of text size and format on the readability of computer-displayed Times New Roman and Arial text by Bernard, M.L., Chaparro, B.S., Mills, M.M., Halcomb, C.G.

Jacek Jankowski

Abstract

Jacek will make a short and simple tutorial on how to design an experiment and how to analyze evaluation results with analysis of variance (ANOVA). He will also show how to evaluate pairwise comparisons using Bonferroni procedure. Presentation will be based on the evaluation described in the related paper.



Date:

25th of November 2009


Original Paper:

portal.acm.org/c...


Evaluation in Human Computer Interaction

based on 2 papers by various authors

VinhTuan Thai

Abstract

The talk will be based on two papers from the CHI conference series:

Greenberg, S. and Buxton, B. Usability evaluation considered harmful (some of the time). In Proceeding of CHI '08.

Abstract:
Current practice in Human Computer Interaction as encouraged by educational institutes, academic review processes, and institutions with usability groups advocate usability evaluation as a critical part of every design process. This is for good reason: usability evaluation has a significant role to play when conditions warrant it. Yet evaluation can be ineffective and even harmful if naively done 'by rule' rather than 'by thought'. If done during early stage design, it can mute creative ideas that do not conform to current interface norms. If done to test radical innovations, the many interface issues that would likely arise from an immature technology can quash what could have been an inspired vision. If done to validate an academic prototype, it may incorrectly suggest a design's scientific worthiness rather than offer a meaningful critique of how it would be adopted and used in everyday practice. If done without regard to how cultures adopt technology over time, then today's reluctant reactions by users will forestall tomorrow's eager acceptance. The choice of evaluation methodology - if any - must arise from and be appropriate for the actual problem or research question under consideration.

Barkhuus, L. and Rhode, J.A., From Mice to Men - 24 years of Evaluation in CHI. In Extended Proceedings of CHI '07

Abstract:
This paper analyzes trends in the approach to evaluation taken by CHI papers in the last 24 years. A set of papers was analyzed according to our schema for classifying type of evaluation. Our analysis traces papers’ trend in type and scope of evaluation. Findings include an increase in the proportion of papers that include evaluation, and a decrease in the median number of subjects in quantitative studies. We also critique the types of subjects, in particular an over reliance on students, and lack of appropriately gender balanced samples. We contextualize these findings in historical trends as we move from machines intended for the technical elite in laboratories to computers integrated into the daily life of everyone.



Date:

11th of November 2009


To Buy or Not to Buy: Mining Airfare Data to Minimize Ticket Purchase Price

based on To Buy or Not to Buy: Mining Airfare Data to Minimize Ticket Purchase Price by Oren Etzioni, Rattapoom Tuchinda, Craig A. Knoblock, Alexander Yates

Maciej Zaremba

Abstract

As product prices become increasingly available on the World Wide Web, consumers attempt to understand how corporations vary these prices over time. However, corporations change prices based on proprietary algorithms and hidden variables (e.g., the number of unsold seats on a fight). Is it possible to develop data mining techniques that will enable consumers to predict price changes under these conditions? This paper reports on a pilot study in the domain of airline ticket prices where we recorded over 12,000 price observations over a 41 day period. When trained on this data, Hamlet - our multi-strategy data mining algorithm - generated a predictive model that saved 341 simulated passengers $198,074 by advising them when to buy and when to postpone ticket purchases. Remarkably, a clairvoyant algorithm with complete knowledge of future prices could save at most $320,572 in our simulation, thus Hamlet's savings were 61.8% of optimal. The algorithm's savings of $198,074 represents an average savings of 23.8% for the 341 passengers for whom savings are possible. Overall, Hamlet saved 4.4% of the ticket price averaged over the entire set of 4,488 simulated passengers. Our pilot study suggests that mining of price data available over the web has the potential to save consumers substantial sums of money per annum.

This paper is an excellent example of how interesting research can be turned into practical and valuable business. The research paper has been published in 2003, in the same year Oren Etizioni established airline fare prediction company called Hamlet, later on rebranded to Farecast. Over the years price prediction methods offered by Farecast became more mature and number of different US routes and airlines has been supported. Farecast was acquired for $115 million by Microsoft in 2008 and airline ticket price prediction is part of the Bing travel [2]. Price prediction functionality is currently in beta version and it is planned to support many more international routes. Online price prediction proposed by Etizioni et al. is protected by the international patent filed in 2006 [3].

[1] http://portal.acm.org/citation.cfm?doid=956750.956767
[2] http://www.bing.com/travel/about/howAirPredictions.do
[3] http://www.freepatentsonline.com/7010494.pdf


Date:

04th of November 2009


Original Paper:

portal.acm.org/c...


Data Integration in Mashups

based on Data Integration in Mashups by Giusy Di Lorenzo, Hakim Hacid, Hye-young Paik, Boualem Benatallah

Wassim Derguech

Abstract

Mashup is a new application development approach that allows users to aggregate multiple services to create a service for a new purpose. Even if the Mashup approach opens new and broader opportunities for data/service consumers, the development process still requires the users to know not only how to write code using programming languages, but also how to use the different Web APIs from different services. In order to solve this problem, there is increasing effort put into developing tools which are designed to support users with little programming knowledge in Mashup applications development. The objective of this study is to analyze the richnesses and weaknesses of the Mashup tools with respect to the data integration aspect.


Date:

21st of October 2009


Original Paper:

portal.acm.org/c...


Evaluation of Software Systems

based on Evaluation of Software Systems by Günther Gediga, Kai-Christoph Hamborg

Simon Scerri

Abstract

Evaluation as an aid for software development has been applied since the last decade, when the comprehension of the role of evaluation within Human-Computer Interaction had changed. Software can be evaluated with respect to different aspects, for example, functionality, reliability, usability, efficiency, maintainability and portability.
In this survey, we concentrate on the aspect of usability from an ergonomic point of view. This aspect has gained particular importance during the last decade with the increasing use of interactive software. Whereas in earlier times evaluation of software took place at the end of the developing phase, using experimental designs and statistical analysis, evaluation is nowadays used as a tool for information gathering within iterative design.
Within this context, instruments for evaluation are not primarily used for global evaluation of an accomplished product, but these instruments are applied during the development of a product. Indeed, most experts agree nowadays that the development of usable software can only be done by a systematic consideration of usability aspects within the life-cycle model. One prominent part is the evaluation of prototypes with respect to usability aspects, employing suitable evaluation techniques in order to find usability errors and weaknesses of the software at an early stage.


Date:

07th of October 2009


Material (Slides):

fileadmin/docume...


Representing, Querying and Transforming Social Networks with RDF/SPARQL

based on Representing, Querying and Transforming Social Networks with RDF/SPARQL by Mauro San Martín, Claudio Gutierrez

Sheila Kinsella

Abstract

As social networks are becoming ubiquitous on the Web, the Semantic Web goals indicate that it is critical to have a standard model allowing exchange, interoperability, transformation, and querying of social network data. In this paper we show that RDF/SPARQL meet this desiderata. Building on developments of social network analysis, graph databases and Semantic Web, we present a social networks data model based on RDF, and a query and transformation language based on SPARQL meeting the above requirements. We study its expressive power and complexity showing that it behaves well, and present an illustrative prototype.


Date:

30th of September 2009


Original Paper:

www.dcc.uchile.c...


Opinion Mining and Sentiment Analysis - Two relevant directions

based on Various papers by various authors

Tudor Groza

Abstract

Argumentative discussions can be found in a variety of domains from traditional scientific publishing to today's modern social software. An interactive argumentative discussion usually consists of an initial proposition stated by a single creator, followed by supporting propositions or counter-propositions from other contributors. These propositions intrinsically externalize the contributors' sentiments and opinions, with the current Web 2.0 technologies enabling more and more their free expression. Consequently, this transforms the Web into an extremely valuable source for mining people's positions and sentiments. Lately, the topic of opinion mining and sentiment analysis gained a lot of interest and several research directions have been followed. In this presentation I will focus, in particular, on two such directions: document level and sentence level opinion mining and sentiment analysis.


Based on the following collection of papers:

(Liu 2007): Bing Liu, Web Data Mining - Exploring Hyperlinks, Contents and Usage Data, Springer, December 2006

(Turney, 2002): Peter D. Turney - Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews, ACL 2002

(Pang et al., 2002): Bo Pang, Lillian Lee and Shivakumar Vaithyanathan - Thumbs up? Sentiment Classification using Machine Learning Techniques, EMNLP 2002

(Dave et al., 2003): Kushal Dave, Steve Lawrence, David M. Pennock - Mining the peanut gallery: opinion extraction and semantic classification of product reviews, WWW 2003

(Mullen and Collier, 2004): Tony Mullen and Nigel Collier - Sentiment Analysis using Support Vector Machines with Diverse Information Sources, EMNLP 2004

(Pang and Lee, 2005): Bo Pang and Lillian Lee - Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales, ACL 2005

(Chaovalit and Zhou, 2005): Pimwadee Chaovalit, Lina Zhou - Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches, HICSS 2005

(Wiebe et al, 1999): Janyce Wiebe, Rebecca F. Bruce, Thomas P. O'Hara - Development and Use of a Gold-Standard Data Set for Subjectivity Classifications.

(Yu and Hazivassiloglou, 2003): Hong Yu and Vasileios Hatzivassiloglou - Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences, EMNLP 2003

(Rilloff and Wiebe, 2003): Ellen Riloff and Janyce Wiebe - Learning Extraction Patterns for Subjective Expressions, EMNLP 2003

(Hatzivassiloglou and Wiebe, 2000): Vasileios Hatzivassiloglou and Janyce Wiebe - Effects of Adjective Orientation and Gradability on Sentence Subjectivity, Coling 2000

(Wiebe and Rilloff, 2005): Janyce Wiebe and Ellen Riloff - Creating Subjective and Objective Sentence Classifiers from Unannotated Texts, CICLing 2005

(Kim and Hovy, 2004): Soo-Min Kim and Eduard Hovy - Determining the sentiment of opinions, COLING 2004



Date:

23rd of September 2009


Tesseract: Interactive Visual Exploration of Socio-Technical Relationships in Software Development

based on Tesseract: Interactive Visual Exploration of Socio-Technical Relationships in Software Development by Anita Sarma, Larry Maccherone, Patrick Wagstrom,, Jim Herbsleb

Aftab Iqbal

Abstract

Software developers have long known that project success requires a robust understanding of both technical and social linkages. However, research has largely considered these independently. Research on networks of technical artifacts focuses on techniques like code analysis or mining project archives. Social network analysis has been used to capture information about relations among people. Yet, each type of information is often far more useful when combined, as when the “goodness” of social networks is judged by the patterns of dependencies in the technical artifacts. To bring such information together, we have developed Tesseract, a socio-technical dependency browser that utilizes cross-linked displays to enable exploration of the myriad relationships between artifacts, developers, bugs, and communications. We evaluated Tesseract by (1) demonstrating its feasibility with GNOME project data (2) assessing its usability via informal user evaluations, and (3) verifying its suitability for the open source community via semi-structured interviews.



Date:

16th of September 2009


Original Paper:

portal.acm.org/c...


Optimizing Peer Relationships in a Super-Peer Network

based on Optimizing Peer Relationships in a Super-Peer Network by Pawe Garbacki, Dick H. J Epema, Maarten Van Steen

Sanaullah Nazir

Abstract

Super-peer architectures exploit the heterogeneity of nodes in a P2P network by assigning additional responsi- bilities to higher-capacity nodes. In the design of a super- peer network for file sharing, several issues have to be addressed: how client peers are related to super-peers, how super-peers locate files, how the load is balanced among the super-peers, and how the system deals with node failures. In this paper we introduce a self-organizing super-peer network architecture (SOSPNET) that solves these issues in a fully decentralized manner. SOSPNET maintains a super- peer network topology that reflects the semantic similarity of peers sharing content interests. Super-peers maintain semantic caches of pointers to files which are requested by peers with similar interests. Client peers, on the other hand, dynamically select super-peers offering the best search performance. We show how this simple approach can be em- ployed not only to optimize searching, but also to solve generally difficult problems encountered in P2P architectures such as load balancing and fault tolerance. We evaluate SOSPNET using a model of the semantic structure derived from the 8-month traces of two large file-sharing communi- ties. The obtained results indicate that SOSPNET achieves close-to-optimal file search performance, quickly adjusts to changes in the environment (node joins and leaves), sur- vives even catastrophic node failures, and efficiently distributes the system load taking into account peer capacities.



Date:

09th of September 2009


Case-based reasoning - short introduction

based on Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches by A. Aamodt, E. Plaza

Adam Gzella

Abstract

Case-based reasoning is a recent approach to problem solving and learning that has got a lot of attention over the last few years. Originating in the US, the basic idea and underlying theories have spread to other continents, and we are now within a period of highly active research in case-based reasoning in Europe, as well. This paper gives an overview of the foundational issues related to case-based reasoning, describes some of the leading methodological approaches within the field, and exemplifies the current state through pointers to some systems. Initially, a general framework is defined, to which the subsequent descriptions and discussions will refer. The framework is influenced by recent methodologies for knowledge level descriptions of intelligent systems. The methods for case retrieval, reuse, solution testing, and learning are summarized, and their actual realization is discussed in the light of a few example systems that represent different CBR approaches. We also discuss the role of case-based methods as one type of reasoning and learning method within an integrated system architecture.


Date:

02nd of September 2009


Original Paper:

www2.iiia.csic.e...


An overview of RDB2RDF techniques and tools

based on several papers by various authors

Nuno Lopes

Abstract

Many science archive centres publish very large volumes of image, simulation, and experiment data. In order to integrate and analyse the available data, scientists need to be able to (i) identify and locate all the data relevant to their work; (ii) understand the multiple heterogeneous data models in which the data is published; and (iii) interpret and process the data they retrieve. RDF has been shown to be a generally successful framework within which to perform such data integration work. It can be equally successful in the context of scientific data, if it is demonstrably practical to expose that data as RDF. In this paper we investigate the capabilities of RDF to enable the integration of scientific data sources. Specifically, we discuss the suitability of sparql for expressing scientific queries, and the performance of several triple stores and RDB2RDF tools for executing queries over a moderately sized sample of a large astronomical data set. We found that more research and improvements are required into SPARQL and RDB2RDF tools to efficiently expose existing science archives for data integration.

Based on:

A. J. G. Gray, N. Gray, and I. Ounis. Can RDB2RDF Tools Feasibly Expose Large Science Archives for Data Integration? In L. Aroyo, P. Traverso, F. Ciravegna, P. Cimiano, T. Heath, E. Hyvönen, R. Mizoguchi, E. Oren, M. Sabou, and E. P. B. Simperl, editors, ESWC, volume 5554 of Lecture Notes in Computer Science, pages 491–505. Springer, 2009.

A. Malhotra. W3C RDB2RDF Incubator Group Report. http://www.w3.org/2005/Incubator/rdb2rdf/XGR-rdb2rdf/, January 2009.

S. S. Sahoo, W. Halb, S. Hellmann, K. Idehen, T. T. Jr, S. Auer, J. Sequeda, and A. Ezzat. A Survey of Current Approaches for Mapping of Relational Databases to RDF. W3C RDB2RDF XG Report, W3C, 2009.



Date:

26th of August 2009


Original Paper:

www.w3.org/2005/...


Bokode: Imperceptible Visual tags for Camera Based Interaction from a Distance

based on Bokode: Imperceptible Visual tags for Camera Based Interaction from a Distance by Ankit Mohan, Grace Woo, Shinsaku Hiura, Quinn Smithwick, Ramesh Raskar

Gearoid Hynes

Abstract

We show a new camera based interaction solution where an ordinary camera can detect small optical tags from a relatively large distance. Current optical tags, such as barcodes, must be read within a short range and the codes occupy valuable physical space on products. We present a new low-cost optical design so that the tags can be shrunk to 3mm visible diameter, and unmodified ordinary cameras several meters away can be set up to decode the identity plus the relative distance and angle. The design exploits the bokeh effect of ordinary cameras lenses, which maps rays exiting from an out of focus scene point into a disk like blur on the camera sensor. This bokeh-code or Bokode is a barcode design with a simple lenslet over the pattern. We show that a code with 15μm features can be read using an off-the-shelf camera from distances of up to 2 meters. We use intelligent binary coding to estimate the relative distance and angle to the camera, and show potential for applications in augmented reality and motion capture. We analyze the constraints and performance of the optical system, and discuss several plausible application scenarios.


Date:

19th of August 2009


Original Paper:

delivery.acm.org...


RDF-3X: a RISC-style engine for RDF

based on RDF-3X: a RISC-style engine for RDF by Thomas Neumann, Gerhard Weikum

Aidan Hogan

Abstract

RDF is a data representation format for schema-free structured information that is gaining momentum in the context of Semantic-Web corpora, life sciences, and also Web 2.0 platforms. The "pay-as-you-go" nature of RDF and the flexible pattern-matching capabilities of its query language SPARQL entail efficiency and scalability challenges for complex queries including long join paths. This paper presents the RDF-3X engine, an implementation of SPARQL that achieves excellent performance by pursuing a RISC-style architecture with a streamlined architecture and carefully designed, puristic data structures and operations. The salient points of RDF-3X are: 1) a generic solution for storing and indexing RDF triples that completely eliminates the need for physical-design tuning, 2) a powerful yet simple query processor that leverages fast merge joins to the largest possible extent, and 3) a query optimizer for choosing optimal join orders using a cost model based on statistical synopses for entire join paths. The performance of RDF-3X, in comparison to the previously best state-of-the-art systems, has been measured on several large-scale datasets with more than 50 million RDF triples and benchmark queries that include pattern matching and long join paths in the underlying data graphs.



Date:

12th of August 2009


Material (Slides):

sw.deri.org/~aid...


Original Paper:

www.vldb.org/pvl...


Inferences under time pressure: how opportunity costs affect strategy selection

based on Inferences under time pressure: how opportunity costs affect strategy selection by Rieskamp, J., Hoffrage, U.

Anna Dabrowska

Abstract

Do the inference strategies people select depend on the magnitude of time pressure? Is this dependency modified by the type of time pressure? These questions are addressed in three experimental studies in which participants made inferences after having searched for information on a computerized information board. In Study 1, time pressure was induced indirectly by imposing opportunity costs of being slow, a form of time pressure that is common in daily life but that has rarely been examined in the literature. A simple lexicographic heuristic (LEX) achieved the best fit in predicting participants' inferences. Studies 2 and 3 induced high time pressure either indirectly by imposing opportunity costs in terms of time or directly by limiting the time for each choice. Regardless of how time pressure was induced, under high time pressure the inferences could be best predicted with LEX, whereas under low time pressure a weighted linear model that integrates all available information predicted the inferences best. We conclude that people select strategies adaptively depending on characteristics of the situation.


Date:

05th of August 2009


Original Paper:

www.sciencedirec...


Human computation and the ESP Game story

Ilko Grigorov

Abstract

Based on three papers.

“We introduce a new interactive system: a game that is fun and can be used to create valuable output. When people play the game they help determine the contents of images by providing meaningful labels for them.” [1]

“In this paper, we propose a competitive human computation game, KissKissBan (KKB), for image annotation. KKB is different from other human computation games since it integrates both collaborative and competitive elements in the game design.” [2]

“The ESP Game was designed to harvest human intelligence to assign labels to images - a task which is still difficult for even the most advanced systems in image processing. However, the ESP Game as it is currently implemented encourages players to assign “obvious” labels, which are most likely to lead to an agreement with the partner.” “We present a language model which, given enough instances of labeled images as training data, can assign probabilities to the next label to be added. This model is then used in a program, which plays the ESP game without looking at the image. Even without any understanding of the actual image, the program manages to agree with the randomly assigned human partner on a label for 69% of all images, and for 81% of images which have at least one “off-limits” term assigned to them.” [3]


[1]. Luis von Ahn and Laura Dabbish (Carnegie Mellon University, USA), Labeling images with a computer game. In Conference on Human Factors in Computing Systems (CHI’04), pages 319–326, 2004.

[2]. Chien-Ju Ho, Tao-Hsuan Chang, Jong-Chuan Lee, Jane Yung-Jen Hsu, Kuan-Ta Chen, KissKissBan: A Competitive Human Computation Game for Image Annotation, In Human Computation Workshop KDD2009

[3]. Ingmar Weber (EPFL - Lausanne, Switzerland), Stephen Robertsonand Milan Vojnovic (Microsoft Research, Cambridge UK), Rethinking the ESP Game, Technical Report MSR-TR-2008-132



Date:

29th of July 2009


Semantic Grounding of Tag Relatedness in Social Bookmarking Systems by Smitashree Choudhury

based on Semantic Grounding of Tag Relatedness in Social Bookmarking Systems by Smitashree Choudhury by Ciro Cattuto, Dominik Benz, Andreas Hotho,, Gerd Stumme

Smitashree Choudhury

Abstract

Collaborative tagging systems have nowadays become important data sources for populating semantic web applications. For tasks like synonym detection and discovery of concept hierarchies, many researchers introduced measures of tag similarity. Eventhough most of these measures appear very natural, their design often seems to be rather ad hoc, and the underlying assumptionson the notion of similarity are not made explicit. A more systematic characterization and validation of tag similarity interms of formal representations of knowledge is still lacking. Here we address this issue and analyze several measures oftag similarity: Each measure is computed on data from the social bookmarking system del.icio.us and a semantic grounding isprovided by mapping pairs of similar tags in the folksonomy to pairs of synsets in Wordnet, where we use validated measuresof semantic distance to characterize the semantic relation between the mapped tags. This exposes important features of theinvestigated similarity measures and indicates which ones are better suited in the context of a given semantic application.


Date:

15th of July 2009


Material (Slides):

www.deri.ie/file...


Original Paper:

isiosf.isi.it/~c...


Fresnel: a Browser-Independent Presentation Vocabulary for RDF

based on Fresnel: a Browser-Independent Presentation Vocabulary for RDF by Christian Bizer, Ryan Lee, Emmanuel Pietriga

Stéphane Corlosquet

Abstract

SemanticWeb browsers and other tools aimed at displaying RDF data to end users are all concerned with the same problem: presenting content primarily intended for machine consumption in a human-readable way. Their solutions differ but in the end address the same two high-level issues, no matter the underlying representation paradigm: specifying (i) what information contained in RDF models should be presented (content selection) and (ii) how this information should be presented (content formatting and styling). However, each tool currently relies on its own ad hoc mechanisms and vocabulary for specifying RDF presentation knowledge, making it difficult to share and reuse such knowledge across applications. Recognizing the general need for presenting RDF content to users and wanting to promote the exchange of presentation knowledge, we designed Fresnel as a browser-independent vocabulary of core RDF display concepts. In this paper we describe Fresnel’s main concepts and present several RDF browsers and visualization tools that have adopted the vocabulary so far.


Date:

24th of June 2009


Original Paper:

iswc2006.semanti...


A survey on wireless multimedia sensor networks

based on A survey on wireless multimedia sensor networks by Ian F. Akyildiz, Tommaso Melodia, Kaushik R. Chowdhury

Lei Shu

Abstract

The availability of low-cost hardware such as CMOS cameras and microphones has fostered the development of Wireless Multimedia Sensor Networks (WMSNs), i.e., networks of wirelessly interconnected devices that are able to ubiquitously retrieve multimedia content such as video and audio streams, still images, and scalar sensor data from the environment. In this paper, the state of the art in algorithms, protocols, and hardware for wireless multimedia sensor networks is surveyed, and open research issues are discussed in detail. Architectures for WMSNs are explored, along with their advantages and drawbacks. Currently off-the-shelf hardware as well as available research prototypes for WMSNs are listed and classified. Existing solutions and open research issues at the application, transport, network, link, and physical layers of the communication protocol stack are investigated, along with possible cross-layer synergies and optimizations.


Date:

17th of June 2009


Original Paper:

www.eng.buffalo....


IRLbot: scaling to 6 billion pages and beyond

based on IRLbot: scaling to 6 billion pages and beyond by Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, Dmitri Loguinov

Jürgen Umbrich

Abstract

This paper shares our experience in designing a web crawler that can download billions of pages using a single-server implementation and models its performance. We show that with the quadratically increasing complexity of verifying URL uniqueness, BFS crawl order, and fixed per-host rate-limiting, current crawling algorithms cannot effectively cope with the sheer volume of URLs generated in large crawls, highly-branching spam, legitimate multi-million-page blog sites, and infinite loops created by server-side scripts. We offer a set of techniques for dealing with these issues and test their performance in an implementation we call IRLbot. In our recent experiment that lasted 41 days, IRLbot running on a single server successfully crawled 6.3 billion valid HTML pages ($7.6$ billion connection requests) and sustained an average download rate of 319 mb/s (1,789 pages/s). Unlike our prior experiments with algorithms proposed in related work, this version of IRLbot did not experience any bottlenecks and successfully handled content from over 117 million hosts, parsed out 394 billion links, and discovered a subset of the web graph with 41 billion unique nodes.


Date:

10th of June 2009


Material (Slides):

umbrich.net/irlb...


Original Paper:

www2008.org/pape...


Generality in AI (and its Relation to the Semantic Web)

based on Several papers by Various authors

Vit Novacek

Abstract

Vit will present selected parts from the research and visions of six seminal thinkers active in fields ranging from AI and Semantic Web through cognitive science to psychology and economics. In particular, the talk will be based on four papers, which may seem to be dealing with rather disparate topics, however, they all are related in one way or another to generality, a fundamental feature assumed to be inherent to any truly intelligent system. First Vit will recall the McCarthy's take on generality in AI based on his ACM Turing Award lecture article [1]. Then he will present a sort of alternative and competing position, as given in Hofstadter's Analogical Mind book chapter [2]. He will show that Hofstadter's heretic non-formalist visions might have had at least a partial rigorous support since already long time ago according to the Tversky and Kahneman's Science article on judgment under uncertainty [3]. Following the cognitive science detour, he will get back to more familiar grounds and provide a summary of the major Semantic Web modelling paradigms based on Patel-Schneider and Horrocks' position paper from WWW 2006 [4]. After presenting the core content of the talk in as objective way as possible, he will provide a slightly personal synthesis and ask few questions we should try to answer in case he is right.

[1] John McCarthy. Generality in Artificial Intelligence. Communications
of ACM, 30 (12). ACM Press, 1987.
URL: http://www-formal.stanford.edu/jmc/generality.pdf

[2] Douglas R. Hofstadter. Analogy as the Core of Cognition. The Analogical
Mind: Perspectives from Cognitive Science}, Dedre Gentner, Keith J. Holyoak, and
 Boicho N. Kokinov (eds.). Cambridge MA: The MIT Press/Bradford Book, 2001.
URL: http://prelectur.stanford.edu/lecturers/hofstadter/analogy.html

[3] Amos Tversky, Daniel Kahneman. Judgment under Uncertainty:
Heuristics and Biases. Science, 185 (4157). AAAS, 1974.
URL: http://psiexp.ss.uci.edu/research/teaching/Tversky_Kahneman_1974.pdf

[4] Peter F. Patel-Schneider, Ian Horrocks. Position Paper: A Comparison
of Two Modelling Paradigms in the Semantic Web. In Proceedings of
WWW'06. ACM Press, 2006.
URL: http://portal.acm.org/citation.cfm?doid=1135777.1135784


Date:

27th of May 2009


Original Paper:

140.203.154.209/...


Presentation of POWDER

Jürgen Umbrich, Michael Hausenblas


Date:

20th of May 2009


Material (Slides):

fileadmin/docume...


Original Paper:

www.w3.org/2007/...


Personal Information Management by Note-taking

based on Information scraps: How and why information eludes our personal information management tools by Michael S. Bernstein, Max Van Kleek, David R. Karger, M. C. Schraefel

Laura Dragan


Date:

20th of May 2009


Perceptually grounded meaning creation

based on Perceptually grounded meaning creation by Luc Steels

Antonio Aguilare

Abstract

The paper proposes a mechanism for the spontaneous formation of perceptually grounded meanings under the selectionist pressure of a discrimination task. The mechanism is defined formally and the results of some simulation experiments are reported.

Keywords: origins of meanings, self-organization, distributed agents, open systems.


Date:

13th of May 2009


Material (Slides):

www.antonio-agui...


Original Paper:

Perceptually gro...


Consumer decision making in online shopping environments: The effects of interactive decision aids

based on Consumer decision making in online shopping environments: The effects of interactive decision aids by Häubl, G., Trifts, V.

Maciej Dabrowski

Abstract

Despite the explosive growth of electronic commerce and the rapidly increasing number of consumers who use interactive media (such as the World Wide Web) for prepurchase information search and online shopping, very little is known about how consumers make purchase decisions in such settings. The availability of interactive decision tools for consumers may lead to a transformation of the way in which shoppers search for product information and make purchase decisions. The primary objective of this paper is to investigate the nature of the effects that interactive decision aids may have on consumer decision making in online shopping environments. This paper examines the effects of two decision aids on purchase decision making in an online store. The first interactive tool, a recommendation agent (RA), allows consumers to more efficiently screen the (potentially very large) set of alternatives available in an online shopping environment. Based on self-explicated information about a consumer's own utility function (attribute importance weights and minimum acceptable attribute levels), the RA generates a personalized list of recommended alternatives. The second decision aid, a comparison matrix (CM), is designed to help consumers make in-depth comparisons among selected alternatives.
Based on theoretical and empirical work in marketing, judgment and decision making, psychology, and decision support systems, we develop a set of hypotheses pertaining to the effects of these two decision aids on various aspects of consumer decision making In particular, we focus on how use of the RA and CM affects consumers' search for product information, the size and quality of their consideration sets, and the quality of their purchase decisions in an online shopping environment. In sum, our findings suggest that interactive tools designed to assist consumers in the initial screening of available alternatives and to facilitate in-depth comparisons among selected alternatives in an online shopping environment may have strong favorable effects on both the quality and the efficiency of purchase decisions.


Date:

23rd of April 2009


Original Paper:

www.business.ual...


OWL2: New Features of the Web Ontology Language

based on OWL 2 Web Ontology Language by Various authors

Ratnesh Sahay

Abstract

The Web Ontology Language OWL has been a W3C recommendation since January 2004. However, there have been many requests from both practitioners and theoretician to improve and enhance the features of this language. This led to a reconsideration of the specification which is currently being worked on. The result of this work will be a new recommendation for the second version of the Web Ontology Language, called OWL 2. The specifications are almost in their final state and will be officially recommended by the end of 2009.



Date:

22nd of April 2009


Original Paper:

www.w3.org/2007/...


Hybrid NLG in a Generic Dialog System

based on Hybrid NLG in a Generic Dialog System by Martin Klarner

Brian Davis

Abstract

Natural Language Generation (NLG) systems are increasingly becoming available as "market-ready" products, mainly due to the now-removed boundary between shallow and deep generation and the emergence of hybrid systems as a de-facto standard. In this paper, we present Hyperbug, a novel approach towards hybrid NLG, coupling shallow and deep processing not only with respect to the resources used for parsing and generation, but also on the architectural level to increase the generative power of the shallow generation branch and the processing efficiency of the whole generation system. The architecture is discussed both in theory and in practice, using a comprehensive example spanning the complete output part of our dialog system.


Date:

08th of April 2009


Introduction to Description Logics

Antoine Zimmermann

Abstract

Description Logics (DLs) are considered one of the most important knowledge representation formalisms. Their flexibility and modularity, together with the amount of theoretical results and practical work on them, have made DLs the formalism of choice for the Web Ontology Language (OWL), which is used to provide meaning to Linked Data on the Semantic Web.


Date:

01st of April 2009


Flickr Tag Recommendation based on Collective Knowledge

based on Flickr Tag Recommendation based on Collective Knowledge by Börkur Sigurbjörnsson, Roelof van Zwol

Peyman Nasirifard

Abstract

Online photo services such as Flickr and Zooomr allow users to share their photos with family, friends, and the online community at large. An important facet of these services is that users manually annotate their photos using so called tags, which describe the contents of the photo or provide additional contextual and semantical information. In this paper we investigate how we can assist users in the tagging phase. The contribution of our research is twofold. We analyse a representative snapshot of Flickr and present the results by means of a tag characterisation focusing on how users tags photos and what information is contained in the tagging. Based on this analysis, we present and evaluate tag recommendation strategies to support the user in the photo annotation task by recommending a set of tags that can be added to the photo. The results of the empirical evaluation show that we can effectively recommend relevant tags for a variety of photos with different levels of exhaustiveness of original tagging.


Date:

25th of February 2009


Original Paper:

www2008.org/pape...


L2R: A Logical method for Reference Reconciliation

based on L2R: A Logical method for Reference Reconciliation by Fatiha Saïs, Nathalie Pernelle, Marie-Christine Rousset

Renaud Delbru

Abstract

The reference reconciliation problem consists in deciding whether different identifiers refer to the same data, i.e., correspond to the same world entity. The L2R system exploits the semantics of a rich data model, which extends RDFS by a fragment of OWL-DL and SWRL rules. In L2R, the semantics of the schema is translated into a set of logical rules of reconciliation, which are then used to infer correct decisions both of reconciliation and no reconciliation. In contrast with other approaches, the L2R method has a precision of 100% by construction. First experiments show promising results for recall, and most importantly significant increases when rules are added.



Date:

18th of February 2009


Original Paper:

www-lsr.imag.fr/...


Mixed Initiative Reasoning

Krystian Samp

Abstract

based on papers:
[1] J. F. Allen, Mixed-Initiative Interaction, IEEE Intelligent Systems, 1999
[2] E. Horvitz, Uncertainty, action, and interaction: in pursuit of mixed-initiative computing, IEEE Intelligent Systems, 1999
[3] C. I. Guinn, Evaluating mixed-initiative dialog, IEEE Intelligent Systems, 1999
[4] E. Horvitz, Principles of Mixed-Initiative User Interfaces, CHI 1999
[5] G. Tecuci, M. Boicu, M. T. Cox, Seven Aspects of Mixed-Initiative Reasoning, AAAI special issue on mixed-initiative assistants, 2007


Date:

11th of February 2009


Challenges on Distributed Web Retrieval

based on Challenges on Distributed Web Retrieval by R. Baeza-Yates, C. Castillo, F. Junqueira, V. Plachouras, F. Silvestri

Michele Catasta


Date:

17th of December 2008


Original Paper:

ieeexplore.ieee....


RDFa

based on RDFa W3C Recommendation by W3C

Benjamin Heitmann


Date:

03rd of December 2008


Original Paper:

www.w3.org/TR/rd...


SBVR Use cases

based on SBVR Use cases by Mark H. Linehan

Raluca Zaharia


Date:

19th of November 2008


Original Paper:

www.springerlink...


Index structures and algorithms for querying distributed RDF repositories

Brahmananda Sapkota

Abstract

based on the paper: "Index structures and algorithms for querying distributed RDF repositories". Proceedings of the 13th international conference on World Wide Web (WWW2004), 2004, by Stuckenschmidt, H., Vdovjak, R., Houben, G.J., Broekstra, J. http://portal.acm.org/citation.cfm?id=988672.988758


Date:

05th of November 2008


Index structures and algorithms for querying distributed RDF

based on Index structures and algorithms for querying distributed RDF by Heiner Stuckenschmidt, Richard Vdovjak, GeertJan, Jeen Broekstra

Brahmananda Sapkota


Date:

05th of November 2008


Original Paper:

portal.acm.org/f...


Extended Workflow Patterns

based on Workflow Control-Flow Patterns: A Revised View by N. Russell, A.H.M. ter Hofstede, W.M.P. van der Aalst, N. Mulyar

Armin Haller


Date:

08th of October 2008


Original Paper:

www.workflowpatt...


Categorization and Optimization of Synchronization Dependencies in Business Processes

based on Categorization and Optimization of Synchronization Dependencies in Business Processes by Quini Wu, Calton Pu, Akhil Sahai, Roger Barga

Gabriela Vulcu


Date:

17th of September 2008


Material (Slides):

ieeexplore.ieee....


Supporting the dynamic evolution of Web service protocols in service-oriented architectures

Maria Jose

Abstract

This presentation is based on the paper "Supporting the dynamic evolution of Web service protocols in service-oriented architectures". ACM Transactions on the Web (TWEB), Volume 2 , Issue 2 (April 2008) by Seung Hwan Ryu, Fabio Casati, Halvard Skogsrud, Boualem Benatallah, and Régis Saint-Paul.


Date:

10th of September 2008


Material (Slides):

portal.acm.org/c...


RESTful Web Services vs. ``Big'' Web Services: Making the Right Architectural Decision.

Maciej Zaremba

Abstract

based on the paper: Cesare Pautasso, Olaf Zimmermann, Frank Leymann. RESTful Web Services vs. ``Big'' Web Services: Making the Right Architectural Decision. WWW 2008.


Date:

27th of August 2008


Material (Slides):

www2008.org/pape...


DECLARE: Full Support for Loosely-Structured Processes

based on DECLARE: Full Support for Loosely-Structured Processes by Maja Pesic, Helen Schonenberg, Wil M.P. van der Aalst

ZhangBing Zhou


Date:

13th of August 2008


Original Paper:

is.tm.tue.nl/res...


Wishful Search: Interactive Composition of Data Mashups

based on Wishful Search: Interactive Composition of Data Mashups by Anton V. Riabov, Eric Bouillet, Mark D. Feblowitz, Zhen Liu, Anand Ranganathan

Sami Bhiri


Date:

30th of July 2008


Original Paper:

www2008.org/pape...


SA-REST and (S)mashups: Adding Semantics to RESTful Services

based on SA-REST and (S)mashups: Adding Semantics to RESTful Services by Amit Sheth, Karthik Gomadam, Jon Lathem

Nikos Loutas


Date:

09th of July 2008


Original Paper:

ieeexplore.ieee....


Process and Pitfalls in Writing Information Visualization Research Papers

based on Process and Pitfalls in Writing Information Visualization Research Papers by Tamara Munzner

Cosmin Basca


Date:

30th of April 2008


Original Paper:

www.cs.ubc.ca/la...


The Symbol Grounding Problem Has Been Solved. So What's next?

based on The Symbol Grounding Problem Has Been Solved. So What's next? by L. Steels

Antonio Aguilar


Date:

23rd of April 2008


Original Paper:

www.csl.sony.fr/...


Benchmarking Database Representations of RDF/S Stores

based on Benchmarking Database Representations of RDF/S Stores by Y. Theoharis, V. Christophides, G. Karvounarakis

Stéphane Corlosquet


Date:

16th of April 2008


Original Paper:

www.springerlink...


Empowering Software Maintainers with Semantic Web Technologies

based on Empowering Software Maintainers with Semantic Web Technologies by René Witte, Yonggang Zhang, Juergen Rilling

Tudor Groza


Date:

26th of March 2008


Original Paper:

www.eswc2007.org...


Efficient Search in Large Textual Collections with Redundancy

based on Efficient search in large textual collections with redundancy by Jiangong Zhang, Torsten Suel

Renaud Delbru


Date:

12th of March 2008


Original Paper:

portal.acm.org/c...


Minimal Deductive systems for RDF

based on Minimal Deductive Systems for RDF by Sergio Munoz, Jorge Pérez, Claudio Gutierrez

Ratnesh Sahay


Date:

27th of February 2008


Original Paper:

www.eswc2007.org...


Semantics and Complexity of SPARQL

Richard Cyganiak

Abstract

Perez et al., Semantics and Complexity of SPARQL. SPARQL is the W3C candidate recommendation query language for RDF. In this paper we address systematically the formal study of SPARQL, concentrating in its graph pattern facility. We consider for this study simple RDF graphs without special semantics for literals and a simplified version of filters which encompasses all the main issues. We provide a compositional semantics, prove there are normal forms, prove complexity bounds, among others that the evaluation of SPARQL patterns is PSPACE-complete, compare our semantics to an alternative operational semantics, give simple and natural conditions when both semantics coincide and discuss optimization procedures.


Date:

30th of January 2008


Material (Slides):

iswc2006.semanti...


Introducing Time into RDF

Thomas Krennwallner

Abstract

The talk will investigate the following paper:

Claudio Gutierrez, Carlos A. Hurtado, Alejandro A. Vaisman: Introducing Time into RDF. IEEE Trans. Knowl. Data Eng. 19(2): 207-218. 2007

http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4039284 



Date:

16th of January 2008


Ontology mining, evaluation and embedding in a larger modeling framework

Dr. Peter SpynsFree University of Brussels (VUB)


Date:

17th of December 2007


Answer Set Programming for the Semantic Web

Axel Polleres

Abstract

This is a fast-forward version of the Tutorial given at ESWC 2006 (see http://www.eswc2006.org/tutorials.html#tutorial1).  From this original full-day tutorial, we plan to cover Unit1, parts of Unit2, and Unit 4. This shall give a gentle introduction to Answer Set programming and also clarify some differences/relations between Answer Set Programming (and Logic
Programming in general) to SW languages like OWL and RDF. DERI


Date:

21st of November 2007


Material (Slides):

asptut.gibbi.com


Semantic Web technologies in life science and health care

Matthias Samwald

Abstract

The life science and health care sector has the potential to become a driving force for the development of the Semantic Web. The problems with information integration in the vast field spanning basic research on cells and proteins up to the development of new clinical therapies are starting to put a noticeable halt to scientific progress – putting the lives of patients waiting for new treatments in danger. The Semantic Web technologies are seen by many experts as one of the best long-term solutions to this problem.
The W3C Semantic Web Health Care and Life Sciences Interest Group has made
good progress in demonstrating the value of the Semantic Web for
large-scale information integration of heterogeneous information
resources, which will be presented in this talk.
My presentation will consist of three parts: First, I will showcase the
"HCLS Demo", a large infrastructure of linked RDF/OWL data from the life
sciences with hundreds of millions of RDF triples.
Second, I will highlight some of the specific approaches, needs and
problems encountered when dealing with Semantic Web technologies in the
life science area.
Third, I will point out similarities between current projects at DERI and
projects in the life science community – with the goal of identifying
overlaps and possibilities for future collaborations.

http://www.w3.org/2001/sw/hcls/
http://neuroscientific.net/curriculumW3C Semantic Web Health Care and Life Sciences


Date:

09th of October 2007


Reward Oriented Packet Filtering Algorithm for Wireless Sensor Networks

Lei Shu


Date:

03rd of October 2007


VIP Bridge: Leading Ubiquitous Sensor Networks to the Next Generation

Lei Shu


Date:

26th of September 2007


Stream Data Gathering in Wireless Sensor Networks within Expected Lifetime

Lei Shu


Date:

05th of September 2007


Foster, ME, White, M (2004). Techniques for text planning with XSLT.

Brian Davis


Date:

25th of July 2007


Material (Slides):

www.hcrc.ed.ac.u...


Software Testing

Radu Ciora


Date:

11th of July 2007


Material (Slides):

books.google.com...


Towards European Patient Summaries based on Triple Space Computing

Doug Foxvog


Date:

16th of May 2007


Material (Slides):

www.tripcom.org/...


Exploiting Similarity for Multi-Source Downloads using File Handprints

Sanaullah Nazir


Date:

09th of May 2007


Material (Slides):

www.cs.cmu.edu/~...


Analysis, Dissemination, Visualization, Insight and Semantic Enhancement

Andreas Harth


Date:

18th of April 2007


Material (Slides):

portal.acm.org/c...


Context-Awareness in Health Care: A Review

Ratnesh Sahay


Date:

11th of April 2007


Material (Slides):

dx.doi.org


Web Service Mining and Verification of Properties: An Approach Based on Event Calculus

Walid Gaaloul


Date:

04th of April 2007


Material (Slides):

www.springerlink...


Introduction to Social Network Analysis

Sheila Kinsella


Date:

28th of March 2007


Material (Slides):

sw.deri.org/wiki...


Sensemaking - Information Understanding in Large Document Collections Note: rescheduled to Thursday!

Tomasz Woroniecki


Date:

22nd of March 2007


Material (Slides):

csdl.computer.or...


Human Computation: Play a Game to Develop an Ontology

Peyman Nasirifard


Date:

14th of March 2007


Material (Slides):

www.cs.cmu.edu/~...


The Scientific Method in Software Evaluation

Benjamin Heitmann, Eyal Oren


Date:

06th of March 2007


Material (Slides):

sw.deri.org/wiki...


On (proper) Annotation Methodology or How Linguistics Overcame the Arbitrary Age

Alexander Schutz


Date:

28th of February 2007


Material (Slides):

sw.deri.org/wiki...


Towards Semantic-driven, Flexible and Scalable Framework for Peering and Querying e-Catalog Communities (Pt. II)

Sami Bhiri


Date:

21st of February 2007


Spreading Activation

Smitashree Choudhury


Date:

07th of February 2007


Material (Slides):

sw.deri.org/wiki...


Speech Acts

Simon Scerri


Date:

31st of January 2007


Material (Slides):

sw.deri.org/wiki...


Introduction to eLearning

Siobhán Dervan


Date:

24th of January 2007


Material (Slides):

internettime.com...


Introduction to P2P Systems (Pt III)

Manfred Hauswirth


Date:

13th of December 2006


IT Competitive Strategy: A General Overview of Strategic Alignment

Tadhg Nagel


Date:

06th of December 2006


Material (Slides):

sw.deri.org/wiki...


Paper: "Toward Human-Level Machine Intelligence,"

Laurentiu Vasiliu


Date:

29th of November 2006


Introduction to P2P Systems (Pt II) - starts at 13:00!!

Manfred Hauswirth


Date:

22nd of November 2006


Introduction to P2P Systems (Pt I)

Manfred Hauswirth


Date:

15th of November 2006


Material (Slides):

lsirpeople.epfl....


Paper: "A Survey of Approaches to Automatic Schema Matching"

Xia Wang


Date:

01st of November 2006


Material (Slides):

sw.deri.org/wiki...


Paper: Towards Ontologies for Formalizing Modularization and Communication in Large Software Systems

Matthew Moran


Date:

11th of October 2006


Material (Slides):

www.aifb.uni-kar...


Paper: The Semantic Web - The Roles of XML and RDF

Knud Möller


Date:

04th of October 2006


Material (Slides):

citeseer.ist.psu...


Paper: Content-Based Multimedia Information Retrieval: State of the Art and Challenges

Fergal Monaghan


Date:

20th of September 2006


Material (Slides):

portal.acm.org/f...


The Role of Myth in Information System Design

Bill McDaniel


Date:

13th of September 2006


Browsing information with TreeMaps

Sebastian Ryszard Kruk


Date:

06th of September 2006


Material (Slides):

del.icio.us/skru...


A user-interface framework for text searches

Kieran Hannon


Date:

30th of August 2006


Material (Slides):

www.dlib.org/dli...


Book: Designing Scalable Websites: Building, Scaling, and Optimizing the Next Generation of Web Applications; Cal Henderson

Gearoid Hynes


Date:

16th of August 2006


Material (Slides):

sw.deri.org/wiki...


Web Service Registries

Kashif Iqbal


Date:

26th of July 2006


Material (Slides):

www.infosys.tuwi...


CLIE - Controlled Language for Information Extraction

Brian Davis


Date:

19th of July 2006


Material (Slides):

gate.ac.uk/sale/...


Linking in the Wild

Ann Johnston


Date:

11th of July 2006


Material (Slides):

dx.doi.org/10.10...


Human Memory Augmentation

Cliodhna Hurst


Date:

07th of July 2006


Material (Slides):

sw.deri.org/wiki...


The Social Web: Creating An Open Social Network with XDI

Ina O'Murchu


Date:

03rd of July 2006


Material (Slides):

journal.planetwo...


Complexity and expressive power of logic programming

Andreas Harth


Date:

30th of June 2006


Business Process Management

Armin Haller


Date:

14th of June 2006


Ways of Capturing and Calculating Trust in On-line Communities

Slawomir Grzonkowski


Date:

31st of May 2006


The database research self-assessment

Sami Bhiri


Date:

24th of May 2006


Material (Slides):

portal.acm.org/c...


Expanding the Notion of Links

Pat Croke


Date:

17th of May 2006


Material (Slides):

portal.acm.org/c...


XUL - XML User Interface Language

Uldis Bojars


Date:

10th of May 2006


Material (Slides):

sw.deri.org/wiki...


PageRank and Related Methods

John Breslin


Date:

12th of April 2006


Material (Slides):

sw.deri.org/~jbr...


Ontologizing EDI Semantics

Doug Foxvog


Date:

04th of April 2006


History of Programming Languages

Tudor Groza


Date:

28th of March 2006


Material (Slides):

sw.deri.org/wiki...


Domain Modelling

Antonio Aguilar


Date:

21st of March 2006


History of the WWW

Eyal Oren


Date:

14th of March 2006


Material (Slides):

sw.deri.org/wiki...


Visualization Providers for ontologies elements, based on wikis. A practical approach with demo included

Mariano Rico


Date:

21st of February 2006


Material (Slides):

sw.deri.org/wiki...


Firefox Extensions

Kim Tighe


Date:

14th of February 2006


Paper: D. Maynard, M. Yankova, A. Kourakis, and A. Kokossis. Ontology-based information extraction for market monitoring and technology watch. In ESWC Workshop ?End User Aspects of the Semantic Web?, Heraklion, Crete, 2005.

VinhTuan Thai


Date:

07th of February 2006


Material (Slides):

kmi.open.ac.uk/e...


DEMO - Design & Engineering Methodology for Organizations

Eyal Oren


Date:

31st of January 2006


Semantic Peer-to-Peer Overlay Networks

Brahmananda Sapkota


Date:

24th of January 2006


Material (Slides):

sw.deri.org/wiki...


Metadata Discourse in Business Reports

Sean O'Riain


Date:

13th of December 2005


Context

Ann Johnston


Date:

06th of December 2005


Automated Business-to-Business Integration of a Logistics Supply Chain using Semantic Web Services Technology

Paavo Kotinurmi


Date:

29th of November 2005


Material (Slides):

sw.deri.org/wiki...


Metadata-enabled File Systems

Knud Möller


Date:

22nd of November 2005


Material (Slides):

sw.deri.org/wiki...


Annotea

Pat Croke


Date:

01st of November 2005


WSMO Ontology Management

Mick Kerrigan


Date:

25th of October 2005


Distributed Query Processing

Matteo Magni


Date:

27th of September 2005


Conversations with Web Services

Juan Miguel Gomez


Date:

20th of September 2005


Collaborative Ontology Management

Michal Wozniak, Pawel Szczecki, Piotr Piotrowski


Date:

13th of September 2005


Semantic Wikis

Eyal Oren


Date:

06th of September 2005


Jack

Ann Johnston


Date:

12th of July 2005


SWS Digital Libraries

Sebastian Ryszard Kruk


Date:

05th of July 2005


Spatial Predicates for Semantic Web Metadata

Marco Neumann


Date:

28th of June 2005


AJAX

Kim Tighe


Date:

16th of June 2005


SIOC

Uldis Bojars


Date:

17th of May 2005


Speech Acts and Semantic Communication

Lars Ludwig


Date:

10th of May 2005


Co-location

Cliodhna Hurst


Date:

03rd of May 2005


Social Networking and Collaboration Tools

Ina O'Murchu


Date:

26th of April 2005


RSS Blogging on the SW

John Breslin


Date:

19th of April 2005


WS Reliability

Aine Leddy


Date:

05th of April 2005


Jump to top.
Valid XHTML 1.0 TransitionalRDF Resource Description Framework Icon
(C) Copyright 2004-2009 by the Digital Enterprise Research Institute (DERI). All rights reserved.
DERI Locations
Events
image: Linked AI: AAAI Spring Symposium - Linked Data Meets Artificial Intelligence 1
AAAI Spring Symposium
March 22-24, 2010
Stanford, CA
image: 25th Symposium On Applied Computing 1
SAC
March 22-26, 2010
Sierre, Switzerland
image: 19th International WWW Conference 1
WWW2010
April 26-30, 2010
Raleigh Convention Center, NC, U.S.A.