Andreas Heß // Publications // 2022 :: 2021 :: 2020 :: 2018 :: 2010 :: 2009 :: 2008 :: 2007 :: 2006 :: 2005 :: 2004 :: 2003 :: 2001
Journal of Intelligent Manufacturing volume 33, pages 555–573 (2022)
Proceedings of the 24th International Conference on Enterprise Information Systems (ICEIS 2022) : Volume 1
Proceedings of the 5th International Conference on Computer-Human Interaction Research and Applications - CHIRA, 28-29 October 2021
Data Analytics 2021 : The Tenth International Conference on Data Analytics, October 3 - 7, 2021, Barcelona, Spain
22nd International Conference on Enterprise Information Systems, May 5-7, 2020.
GNDCon 2018, Frankfurt, Germany, December 3-4, 2018.
World Library and Information Congress: 76th IFLA General Conference and Assembly, Gothenburg, Sweden, August 10-15, 2010.
Catalog entry at the German National Library
Full Paper:
Presentation slides:
The ever-growing amount of content and knowledge published online makes it possible for libraries to complement their own data and to present their collections in novel ways. Conceptually related information can be semantically linked so that users may benefit from richer data collections and novel search possibilities that capitalize on the inherent relationships between media, local metadata and external information sources. This paper presents potential solutions for the fundamental challenges of integrating heterogeneous data sources and providing innovative semantic search approaches, as they are developed for libraries and multimedia archives within the CONTENTUS project.
Advances in Focused Retrieval, 7th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2008, Dagstuhl Castle, Germany, December 15-18, 2008. Revised and Selected Papers
© Springer Verlag, Lecture Notes in Computer Science
(The market for internet search engines, in German)
In: Handbuch Internet-Suchmaschinen, Dirk Lewandowski (Hrsg.), AKA Verlag Heidelberg
Catalog entry at the German National Library
31st Annual German Conference on Artificial Intelligence (KI 2008), Kaiserslautern, Germany
© Springer Verlag, Lecture Notes in Computer Science
We introduce a new stacking-like approach for multi-value classification. We apply this classification scheme using Naive Bayes and Rocchio classifiers on the well-known Reuters dataset. We use part-of-speech tagging for stopword removal. We show that our setup performs just as well as other approaches that use the full article text even though we only classify headlines. Finally, we apply a Rocchio classifier on a dataset from a Web 2.0-site and show that it is suitable for semi-automated labelling (often called tagging) of short texts and is faster than other approaches.
Workshop Nutzerinteraktion im Social Semantic Web bei der Tagung Mensch & Computer, Lübeck, Germany
ESWC 2008 Workshop on Collective Semantics: Collective Intelligence and the Semantic Web (CISWeb 2008), Tenerife, Spain
Full Paper:
Presentation slides:
Web 2.0 and Semantic Web are regarded as two complementary paradigms that will probably converge in the future. However, whereas the Semantic Web is an established field of research, there has been little analysis devoted to Web 2.0 applications. For this reason it remains unclear how the advantages of both paradigms could be merged. In this paper we make three contributions in this direction. First, we discuss why merging Web 2.0 and the Semantic Web is beneficial and propose five approaches. Second, we show that (semi-) automated tagging of content improves the quality of annotations. Third, we present an automatic approach for improving the tag quality by using duplicate detection techniques. We verify our approach on a large-scale data set from the social search service Lycos iQ (now COSMiQ).
Software, Agents and Services for Business, Research and
E-Sciences — Conference on Social Semantic Web
(SABRE/CSSW 2007), Leipzig,
Germany
In recent years social bookmark systems like del.icio.us or Furl have become increasingly popular. These systems sometimes are regarded as alternatives to algorithmic search engines like Google. In this paper we develop seven theses on the potential of these systems in order to establish a conceptual basis for future research in this area. Thereby it becomes clear that social bookmarking systems complement rather than threaten algorithmic search engines.
3rd European Semantic Web Conference (ESWC 2006), Budva, Montenegro
© Springer Verlag, Lecture Notes in Computer Science
We present a new iterative algorithm for ontology mapping where we combine standard string distance metrics with a structural similarity measure that is based on a vector representation. After all pairwise similarities between concepts have been calculated we apply well-known graph algorithms to obtain an optimal matching. Our algorithm is also capable of using existing mappings to a third ontology as training data to improve accuracy. We compare the performance of our algorithm with the performance of other alignment algorithms and show that our algorithm can compete well against the current state-of-the-art.
PhD Thesis, University College Dublin, School of Computer Science and Informatics
Catalog entry at the German National Library
Advisor: Nicholas Kushmerick
The World Wide Web offers an ocean of information and services for everyone, and many of us rely on it on a daily basis. One of the reasons for the Web's success is that the threshold for creating content on the Web was always very low, because comfortable and easy to use HTML editors are available. However, today's Web is limited by the fact that it is machine-readable, but not machine-understandable. The Semantic Web promises a solution to this problem by adding explicit semantics with the goal of making the Web machine-understandable by using description logics and ontologies. However, the threshold for creating this extra markup is very high, and no comfortable tools exist at present. The goal of this work is to develop such tools. Machine Learning techniques have been used in literature for various classification tasks. The central claim of this thesis is that such algorithms, supervised and unsupervised, can be used for this purpose.
Chapter in Stuckenschmidt et al.: Alignment implementation and benchmarking results, pages 7—10, Knowledge Web Network of Excellence, Deliverable D2.2.4
The 22nd International Conference on Machine Learning (ICML 2005), Bonn, Germany
We propose a novel ensemble learning algorithm called Triskel, which has two interesting features. First, Triskel learns an ensemble of classifiers, each biased to have high precision on instances from a single class (as opposed to, for example, boosting, where the ensemble members are biased to maximise accuracy over a subset of instances from all classes). Second, the ensemble members' voting weights are assigned so that certain pairs of biased classifiers outweigh the rest of the ensemble, if their predictions agree. Our experiments demonstrate that Triskel often outperforms boosting, in terms of both accuracy and training time. We also present an ROC analysis, which shows that Triskel's iterative structure corresponds to a sequence of nested ROC spaces. The analysis predicts that Triskel works best when there are concavities in the ROC curves; this prediction agrees with our empirical results.
Please see also the Triskel project pages for a reference implementation and more empirical results.
6th International Workshop on Multiple Classifier Systems (MCS 2005), Monterey Bay, California, USA
© Springer Verlag, Lecture Notes in Computer Science
Please see also the Triskel project pages for a reference implementation and more empirical results.
Dagstuhl Seminar on Machine Learning for the Semantic Web
3rd International Semantic Web Conference (ISWC2004), Hiroshima, Japan
© Springer Verlag, Lecture Notes in Computer Science
Full paper:
Presentation slides:
RDF annotated abstract:
Demonstration Paper:
Poster:
Flyer:
The Semantic Web Services vision requires that each service be annotated with semantic metadata. Manually creating such metadata is tedious and error-prone, and many software engineers, accustomed to tools that automatically generate WSDL, might not want to invest the additional effort. We therefore propose ASSAM, a tool that assists a user in creating semantic metadata for Web Services. ASSAM is intended for service consumers who want to integrate a number of services and therefore must annotate them according to some shared ontology. ASSAM is also relevant for service producers who have deployed a Web Service and want to make it compatible with an existing ontology. ASSAM's capabilities to automatically create semantic metadata are supported by two machine learning algorithms. First, we have developed an iterative relational classification algorithm for semantically classifying Web Services, their operations, and input and output messages. Second, to aggregate the data returned by multiple semantically related Web Services, we have developed a schema mapping algorithm that is based on an ensemble of string distance metrics.
15th European Conference on Machine Learning (ECML2004), Pisa, Italy
© Springer Verlag, Lecture Notes in Computer Science
Full paper:
Presentation slides:
For the classification of relational data, iterative algorithms that feed back predicted labels of associated objects have been used. In this paper we show two extensions to existing approaches. First, we propose to use two separate classifiers for the intrinsic and the relational (extrinsic) attributes and vote their predictions. Second, we introduce a new way of exploiting the relational structure. When the extrinsic attributes alone are not sufficient to make a prediction, we train specialised classifiers on the intrinsic features and use the extrinsic features as a selector. We apply these techniques to the task of semi-automated Web Service annotation, a task with a rich relational structure.
Workshop on Information
Integration on the Web (IIWeb2004),
held in conjunction with the
30th International Conference on Very Large Data Bases (VLDB2004), Toronto, Canada
AAAI Spring Symposium Semantic Web Services 2004, Stanford, California, USA
Full paper:
Presentation slides:
2nd International Semantic Web Conference (ISWC2003), Sanibel Island, Florida, USA
© Springer Verlag, Lecture Notes in Computer Science
Full paper:
Presentation slides:
RDF annotated abstract:
Emerging Web standards promise a network of heterogeneous yet interoperable Web Services. Web Services would greatly simplify the development of many kinds of data integration and knowledge management applications. Unfortunately, this vision requires that services describe themselves with large amounts of semantic metadata "glue". We explore a variety of machine learning techniques to semi-automatically create such metadata.
We make three contributions. First, we describe a Bayesian learning and inference algorithm for classifying HTML forms into semantic categories, as well as assigning semantic labels to the form's fields. These techniques are important as legacy HTML interfaces are migrated to Web Services. Second, we describe the application of the Naive Bayes and SVM algorithms to the task of Web Service classification. We show that an ensemble approach that treats Web Services as structured objects is more accurate than an unstructured approach. Finally, we describe a clustering algorithm that automatically discovers the semantic categories of Web Services. All of our algorithms are evaluated using large collections of real HTML forms and Web Services.
Workshop
on Information Integration on the Web (IIWeb2003)
at the 18th
International Joint Conference on Artificial Intelligence (IJCAI2003),
Acapulco, Mexico
(Topic extraction from semantic networks, in German)
Diplomarbeit (Diploma Thesis), Fachhochschule Darmstadt (University of Applied Sciences), Fachbereich Informatik (Department of Computer Science)