KEOD 2022 Abstracts


Full Papers
Paper Nr: 4
Title:

Evaluation of a System for Named Entity Recognition in a Knowledge Management Ecosystem

Authors:

Philippe Tamla, Florian Freund, Matthias Hemmje and Paul M. Kevitt

Abstract: In this research paper, the evaluation of a new system for machine learning-based Named Entity Recognition is presented. After introducing our approach supporting two fundamental tasks for training Named Entity Recognition models using machine learning (data cleanup and data annotation), our features, prototype and evaluation methodologies are described. Also, the results of our performed quantitative and qualitative experiments validating our approach and user interface are shown. Finally, our evaluation results are discussed to derive challenges for future work.
Download

Paper Nr: 7
Title:

Knowledge Graph-based Product Recommendations on e-Commerce Platforms

Authors:

André G. Regino, Rodrigo O. Caus, Victor Hochgreb and Julio D. Reis

Abstract: The amount of data generated in e-commerce sales has expressively grown in the last few years. Online stores often receive questions about products related to price, guarantee, and shipping price. By reducing time for prompt answering, stores can improve customer satisfaction and sales conversion rate. The recommendation of available alternative products in case of product unavailability intended by the customer plays a key role in sales growth in this context. This article defines and evaluates a technique for product recommendation based on the product’s facts stored in Knowledge Graphs (KGs). Our KG is filled with facts from natural language questions and answers processed from the e-commerce platform. We exemplify our proposal in a real-world solution, using data from online stores processed by GoBots, a leading e-commerce chatbot business in Latin America. Online sellers assessed the results of the recommendations to evaluate their quality.
Download

Paper Nr: 9
Title:

Inferring New Information from a Knowledge Graph in Crisis Management: A Case Study

Authors:

Julie Bu Daher, Tom Huygue, Nathalie Hernandez and Patricia Stolf

Abstract: Natural crises are dangerous events that can threaten lives and lead to severe damages. Crisis-related data can be heterogeneous and be provided from multiple data sources. These data can be formally described using ontologies and then integrated and structured forming knowledge graphs. Inferring new information from knowledge graphs can strongly assist in the various phases of the crisis management process. Different approaches exist in the literature for inferring new information from knowledge graphs. In this paper, we present a case study of a flood crisis where we discuss three approaches for inferring flood-related information, and we experimentally evaluate these approaches using real flood-related data and synthetic data for further analysis. We discuss the interest of using each of these approaches and detail its advantages as well as its limitations.
Download

Paper Nr: 12
Title:

QART: A Framework to Transform Natural Language Questions and Answers into RDF Triples

Authors:

André G. Regino, Rodrigo O. Caus, Victor Hochgreb and Julio D. Reis

Abstract: Knowledge Graphs (KGs) model real-world things and their interactions. Several software systems have recently adopted the use of KGs to improve their data handling. E-commerce platforms are examples of software exploring the power of KGs in diversified tasks, such as advertisement and product recommendation. In this context, generating trustful, meaningful and scalable RDF triples for populating KGs remains an arduous and error-prone task. The automatic insertion of new knowledge in e-commerce KGs is highly dependent on data quality, which is often not available. In this article, we propose a framework for generating RDF triple knowledge from natural language texts. The QART framework is suited to extract knowledge from Q&A regarding e-commerce products and generate triples associated with it. QART produces KG triples reliable to answer similar questions in an e-commerce context. We evaluate one of the key steps in QART to generate summary sentences and identify product Q&A intents and entities using templates. Our research results reveal the major challenges faced in building and deploying our framework. Our contribution paves the way for the development of automatic mechanisms for text-to-triple transformation in e-commerce systems.
Download

Paper Nr: 20
Title:

An Ontology for Data Regulation

Authors:

Guillaume Delorme, Guilaine Talens and Eric Disson

Abstract: The recent upsurge enactment of regulations seeking to regulate data processing induces a complexification of compliance management for regulated firms. Firms wishing to implement efficient, cost effective and compliant information security and risk management require an increased comprehension of regulatory requirements. Following a previous paper defining Data Regulation Risk, this paper describes an ontology to apprehend the business and operational impacts of regulatory requirements. The ontology is structured to handle various firms’ legal context while remaining agnostic of risk management methodologies.
Download

Paper Nr: 31
Title:

Toward Building a Bilingual Dictionary for Libyan Dialect-modern Standard Arabic Machine Translation

Authors:

Husien Alhammi and Kais Haddar

Abstract: In this paper, a method for building a bilingual dictionary that can be used to translate words and phrases from one dialect to the native language is described. Obviously, dialects and their main languages have many features in common in terms of linguistic structure, lexicon, morphology, and so on. As a result, the method of creating a bilingual dictionary to translate texts from one language into another differs significantly from a method that is used to build a bilingual dictionary for translating dialects into their native languages. To this end, a specific dictionary including some linguistic information must be built to translate dialects into their native languages. In this paper, we discuss the main idea of the method that is used to build a bilingual dictionary to translate Libyan Dialect (LD) into its original language, Modern Standard Arabic (MSA). The advantages of the method are discussed.
Download

Paper Nr: 32
Title:

Debunking the Stereotypical Ontology Development Process

Authors:

Achim Reiz and Kurt Sandkuhl

Abstract: Ontologies facilitate meaning between human and computational actors. On the one hand, the underlying technology can be considered mature. It has a standardized language, established tools for editing and sharing, and broad adoption in practice and research. On the other hand, we still know little about how these artifacts evolve over their lifetime, even though knowledge of the development process could influence quality control. It would enable us to give knowledge engineers better modeling or selection guidelines. This paper examines the evolution of computational ontologies using ontology metrics. First, we gathered hypotheses on the ontology development process. We assume that groups of ontologies follow a similar development pattern and that a stereotypical development process exists. Afterward, these hypotheses are tested against historical metric data from 7053 versions from 69 dormant ontologies. We will show that ontology development processes are highly heterogeneous. While the made hypotheses are partly true for a slight majority of ontologies, concluding the bigger picture of ontology development down to the individual ontologies is mostly not possible.
Download

Paper Nr: 35
Title:

A System to Correct Toxic Expression with BERT

Authors:

Motonobu Yoshida, Kazuyuki Matsumoto, Minoru Yoshida and Kenji Kita

Abstract: This paper describes a system for converting posts with toxic expression on social media, such as those containing slander and libel, into less-toxic sentences. In recent years, the number of social media users as well as the cases of online flame wars has been increasing. Therefore, to prevent flaming, we first use a prediction model based on Bidirectional Encoder Representations from Transformers (BERT) to determine whether a sentence is likely to be flamed before it is posted. The highest classification accuracy recorded 82% with the Japanese Spoken Language Field Adaptive BERT Model (Japanese Spoken BERT model) as a pre-trained model. Then, for sentences that are judged to be toxic, we propose a system that uses BERT’s masked word prediction to convert toxic expressions into safe expressions, thereby converting them into sentences with mitigated aggression. In addition, the BERTScore is used to quantify whether the meaning of the converted sentence has changed in meaning compared to the original sentence and evaluate whether the modified sentence is safe while preserving the meaning of the original sentence.
Download

Short Papers
Paper Nr: 3
Title:

Named Entity Recognition for the Extraction of Emerging Technological Knowledge from Medical Literature

Authors:

Sabrina Lamberth-Cocca, Bernhard Maier, Christian Nawroth, Paul M. Kevitt and Matthias Hemmje

Abstract: In this paper, we show the results of an experimental Information Retrieval System (IRS) prototype to support the detection of emerging medical technology using the method of Named-Entity Recognition (NER). The overall goal is to automatically identify and classify entities and structures in scientific medical articles, which represent the concept of Medical Technologies (MedTech) with high topicality. As a first approach, we combine learning-based NER with rule-based emerging Named-Entity Recognition (eNER). We train a machine-learning model on manually annotated NER candidates representing medical devices. We then match the results with entries from vocabularies containing medical devices according to our definition, using a handcrafted rule-based approach and fuzzy functions. The main outcome is an experimental prototype which we call, MedTech-eNER-IRS, which shows that such an approach works in general, including pointers for further research and prototype improvements.
Download

Paper Nr: 5
Title:

Ontological View-driven Intensional Semantic Integration for Information Systems in a Decentralized Environment

Authors:

Fateh A. Adhnouss, Husam A. El-Asfour, Kenneth McIsaac, Idris El-Feghia, Raafat Aburukba and AbdulMutalib Wahaishi

Abstract: Ontologies are an essential component of semantic integration approaches for information systems . In a decentralized environment, each specification of the domain reflects an Ontological view. However, the semantics characterization of information systems in such decentralized environment poses a significant issue related to their integration. Information systems are viewed as independent intensional entities with their own beliefs, distinct from those held by others. Such autonomy is distorted by traditional extensional semantics. Other entities’ beliefs are introduced into a given entity, thus affecting their beliefs. Additionally, the information that one entity provides to another entity may not be consistent with the information known by the latter. We need an alternative semantics for information integration, which is not dependent on the extension, but rather on the underlying conceptualization. This paper proposes a classification of the environment where the information systems lives and a novel modelling paradigm for information integration using intensional logic to model the ontological views. The model comprises a formal modeling approach for the conceptualization as well as for the semantic integration process.
Download

Paper Nr: 10
Title:

Towards a Unified Multilingual Ontology for Rhetorical Figures

Authors:

Yetian Wang, Ramona Kühn, Randy A. Harris, Jelena Mitrović and Michael Granitzer

Abstract: Formal ontologies for rhetorical figures have been developed to improve the computational detection for different applications in the area of Natural Language Processing, such as hate speech and fake news detection, argumentation mining, and sentiment analysis. The existing ontologies all model different aspects of rhetorical figures, thus creating a variety of formalisms and in the worst case, creating incompatibilities and contradictory representations. In this paper, we focus on figures of perfect lexical repetition and their representation in three ontologies in three different languages: The Ploke ontology, the Serbian RetFig, and the German GRhOOT ontology. We combine those ontologies to benefit from synergy effects and create a multilingual, coherent, robust, and modular ontology for rhetorical figures of perfect lexical repetition.
Download

Paper Nr: 13
Title:

Enterprise Transformation within Enterprise Engineering: Literature Review and Research Directions

Authors:

Shoji Konno and Junichi Iijima

Abstract: In response to the rapid changes in the world, such as digital transformation, there is a growing demand for efficient and effective enterprise transformation. Research artifacts related to the transformation have been increasingly emerging as new standardized several description artifacts suitable to provide practices for particular enterprise transformations. Enterprise has many aspects, such as their architecture, processes, and organizational form, but their efforts in transformation are focused on silos such as enterprise system modelling, and dynamic capabilities. In this study, we assess the availability of topics that support the transformation and the fitness of enterprise engineering for fulfilling the modelling and managing requirements. The review was carried out, finding 349 relevant papers and a list of the few aspects and topics for classifying the focus points of enterprise transformation. Based on the analysis and results of the review, brief suggestions to stimulate further research on the design, improvement, and application of the enterprise transformation management framework are also derived.
Download

Paper Nr: 15
Title:

Knowledge Integration for Commonsense Reasoning with Default Logic

Authors:

Priit Järv, Tanel Tammet, Martin Verrev and Dirk Draheim

Abstract: Commonsense reasoning in artificial intelligence is the problem of inferring decisions and answers regarding mundane situations. Several research groups have built large knowledge graphs with the goal of capturing some aspects of commonsense knowledge. Using these knowledge graphs for problem solving and question answering is a subject of active research. Our contribution is encoding and integrating knowledge graphs like Quasimodo, ConceptNet, and Wordnet for symbolic reasoning. A major challenge in symbolic commonsense reasoning is coping with contradictory and uncertain knowledge, which we handle by extending first order logic with numeric confidences and default logic rules. To our knowledge this is the first large scale commonsense knowledge base seriously using default logic. We give several examples of the proposed representation and solving questions on the basis of the knowledge base built.
Download

Paper Nr: 21
Title:

Situational Question Answering using Memory Nets

Authors:

Joerg Deigmoeller, Pavel Smirnov, Viktor Losing, Chao Wang, Johane Takeuchi and Julian Eggert

Abstract: Embodied Question Answering (EQA) is a rather novel research direction, which bridges the gap between intelligence of commonsense reasoning systems and reasoning over actionable capabilities of mobile robotic platforms. Mobile robotic platforms are usually located in random physical environments, which have to be dynamically explored and taken into account to deliver correct response to users’ requests. Users’ requests are mostly related to foreseeable physical objects, their properties and positional relations to other objects in a scene. The challenge here is to create an intelligent system which successfully maps the query expressed in natural language to a set of reasoning stems and physical actions, required to deliver the user a correct answer. In this paper we present an approach called Situational Question Answering (SQA), which enforces the embodied agent to reason about all available context-relevant information. The approach relies on reasoning over an explicit knowledge graph complemented by inference mechanisms with transparent, human-understandable explanations. In particular, we combine a set of facts with basic knowledge about the world, a situational memory, commonsense understanding, and reasoning capabilities, which go beyond dedicated object knowledge. On top, we propose a Semantics Abstraction Layer (SAL) that acts as intermediate level between knowledge and natural language. The SAL is designed in a way that reasoning functions can be executed hierarchically to provide complex queries resolution. To demonstrate the flexibility of the SAL we define a set of questions that require a basic understanding of time, space, and actions including related objects and locations. As an outlook, a roadmap on how to extend the question set for incrementally growing systems is presented.
Download

Paper Nr: 23
Title:

Evaluating and Exploring Text Fields Information Extraction into CIDOC-CRM

Authors:

Davide Varagnolo, Guilherme Antas, Mariana Ramos, Sara Amaral, Dora Melo and Irene P. Rodrigues

Abstract: This paper presents a method for extracting information from ISAD(G) elements, that contain semi-structured text descriptions. Natural language processing is done using Gate environment and defining the set of Jape rules necessary to process the text and extract the intended information. The evaluation of the information extraction processes is done in a sample of 800 records for each type of information, and a dataset that is manually built for each type of information considered, such as baptisms, passport requisitions testaments, etc. The implementation of several automatic information extraction processes enables the population of the CIDOC-CRM knowledge base with new linked events and entities automatically. The exploration of the information, migrated from DigitArq and extracted from text descriptions represented in CIDOC-CRM, is done through SPARQL queries enabling new visualisations of the archival records and the retrieval of information collected in different records from different archives.
Download

Paper Nr: 24
Title:

Knowledge Capture for the Design of a Technology Assessment Tool

Authors:

Daniela Oliveira and Kimiz Dalkir

Abstract: The design of technology assessment tools is an important Knowledge Management endeavour. Technology assessment is a subject where consensus is far from being achieved. Any project intended to create a technology assessment tool is expected to generate a lot of discussion or criticism. Among the most critical kinds of technology, Artificial Intelligence (AI) is a highly polemic kind of technology. Its impacts are important and multidisciplinary. Moreover, the technology evolves quickly and so do the attitudes toward that technology. Therefore, business owners intending to produce an AI assessment tool should expect extensive discussion of different points of view, but also support the continuation of the discussion throughout time and with different stakeholders. Surprisingly, technology assessment tools developed by business owners have been particularly neglected in the coalescent discussion about AI documentation, not to mention the support to create those tools. To foster a continuous innovation flow, business owners should pay particular attention to how discussions are captured. This paper explores the foundations of knowledge management initiatives to support the design of an artificial intelligence assessment tool at the business owner, in a process that supports continuous discussion and innovation. This article also suggests project aspects and supporting document structure.
Download

Paper Nr: 25
Title:

An Ontology for Ontology Metrics: Creating a Shared Understanding of Measurable Attributes for Humans and Machines

Authors:

Achim Reiz and Kurt Sandkuhl

Abstract: Measuring ontologies using metrics requires specialized software. While the past years saw various developments regarding tools and frameworks, these efforts mainly stayed isolated in their applied assessments. A paper measuring an ontology using the oQual framework is hardly comparable to one that applies the metrics from OntoQA. First, the performed calculations are often bound to the used tools, and second, the correct interpretation of ontology metrics requires a deep understanding of their measured aspects. Our research tackles these challenges by providing an ontology for ontology metrics. This artifact (A.) collects the various proposed ontology measurement frameworks with human-readable descriptions. It lets users quickly inform themselves on the assessments and aspects one can measure. (B) it formalizes the metric calculations. The framework metrics are connected to shared measurable elements, homogenizing the notations and languages. At last, (C.) the ontology is the backbone of the newly developed NEOntometrics application. The software uses the formalized metric descriptions to set up the calculations for the various frameworks. We believe our research can break the silos of different measurements, enable knowledge engineers to calculate various metrics quickly, and researchers to put new measurements into use through simple adaption of the metric ontology.
Download

Paper Nr: 28
Title:

Trustworthy Intelligent Systems: An Ontological Model

Authors:

J. I. Olszewska

Abstract: Nowadays, there is an increased use of AI-based technologies in applications ranging from smart cities to smart manufacturing, from intelligent agents to autonomous vehicles. One of the main challenges posed by all these intelligent systems is their trustworthiness. Hence, in this work, we study the attributes underlying Trustworthy Artificial Intelligence (TAI), in order to develop an ontological model providing an operational definition of trustworthy intelligent systems (TIS). Our resulting Trustworthy Intelligent System Ontology (TISO) has been successfully applied in context of computer vision applications.
Download

Paper Nr: 29
Title:

Extracting Business Insights through Dynamic Topic Modeling and NER

Authors:

Muhammad Arslan and Christophe Cruz

Abstract: Companies need the data from multiple sources for analysis and to find meaning in deriving valuable business insights. Online news articles are one of the main data sources that present up-to-date business news offered by various companies in the market. Topic modeling, i.e. a key player in the Natural Language Processing (NLP) domain, helps businesses to drive value from news articles. Also, it supports the extraction of business insights from news articles to facilitate the identification of trends (topics) in the market and their evolution over time. These insights can help businesses not only automate routine tasks but also in building new marketing policies, decision-making, and customer support. It is also important to find the linked semantic information (i.e. key persons, organizations, or regions called named entities) involved in generating these topics for the identification of news sources. This paper presents the application of a hybrid approach based on dynamic topic modeling and Named-Entity Recognition (NER) for extracting business trends along with the related entities. To show the functionality of the proposed approach, the news articles collected from the websites that published the content related to company interests were from 2017 to 2021 inclusive. The proposed approach can serve as the foundation for future exploratory trend analysis to study the evolution of information not only in the business domain but also applicable in other domains.
Download

Paper Nr: 34
Title:

Genetic Algorithm and Latent Semantic Analysis based Documents Summarization Technique

Authors:

Imen Tanfouri and Fethi Jarray

Abstract: Automatic text summarization (ATS) is the process of generating or extracting a shorter text of the original document while preserving relevant and important information. Nowadays, it is a hot research topic in natural language processing with various applications, including social networks and the healthcare domain. The task of summarizing can be divided into two categories, extractive and abstractive. In this paper, we are concerned with extractive summarization for a single Arabic document. In this contribution, we propose a combination of semantic and combinatorial methods to summarize a document by clustering its content through topic modeling techniques and subsequently generating an extractive summary for each of the identified topics using genetic algorithms. This approach ensures that the final summary covers all important topics in the document. We achieve state-of-the-art performance on the common Arabic summarization benchmark datasets. The obtained results show the effectiveness of combining genetic algorithms (GA) and latent semantic analysis (LSA) for document summarization.
Download

Paper Nr: 39
Title:

Simulation Model Validation based on Ontological Engineering Methods

Authors:

Elena Zamyatina, Denis Churin, Viacheslav Lanin, Lyudmila Lyadova and Nada Matta

Abstract: A task of the simulation models examination (verification and validation, V&V) is considered. At the V&V process the correspondence degree of the simulation model created by developers to the simulated object, that description is presented in the form of a conceptual model built by customers, is determined. An ontological approach is proposed to determine the semantic proximity of the simulation model and the conceptual model, whose descriptions are presented in the form of ontologies. Matching rules can also be defined with ontology based on the metrics chosen by the customer. The approach has been tested using the simulation system Triads. The results of the matching algorithm execution are illustrated by an example. The article provides description of the simulation model ontology created in TriadNS and conceptual model ontology, developed with MASK method. The metrics used for proximity assessment are described.
Download

Paper Nr: 41
Title:

Emotional States Management for an Advanced Intelligent Tutoring System

Authors:

Domenico Redavid, Stefano Ferilli, Liza Loop and Liudmyla Matviichuk

Abstract: One of the result of the application of Artificial Intelligence (AI) to e-learning environments are Intelligent Tutoring Systems (ITSs). A crucial aspect in the field of e-learning concerns emotional states, which importance is increasingly felt also at the level of educational institutions. On the other side, the Management of moods at Information Technology level is becoming more and more important because enable new scenarios where new innovative applications can be proposed. In this paper is described a possible framework able to manage emotional states that could be adopted as a solution to address the Personal, social and learning to learn competence, one of the eight key competences lifelong learning indicated by EU COUNCIL.
Download

Paper Nr: 11
Title:

Building Eurostat Knowledge Graph

Authors:

Alexandros Vassiliades, Nick Bassiliades, Georgios Meditskos and Kimon Spiliopoulos

Abstract: The evolution of Knowledge Graphs (KGs) has encouraged developers to create more and more context related KGs. This advance is extremely important because Artificial Intelligence (AI) applications can access domain specific information in a machine understandable format. In this paper, we present the conceptual model and semantics of the OWL ontology developed to capture information about the Eurostat website. The KG also contains some knowledge from the Organisation for Economic Co-operation and Development (OECD) website. We also describe how we constructed the ontology schema in order to capture all the data in Eurostat and some of the data in OECD, such as, articles, datasets, and internal connections between them, among others. Moreover, we show how we populated the KG with an automated process, resulting into a KG with more than 820K triples.
Download

Paper Nr: 16
Title:

Comparing Arguments in Discussions of Two Parliaments

Authors:

Mare Koit

Abstract: Estonian argument corpus includes verbatim records (in Estonian) of sessions held in the Parliament of Estonia (Riigikogu). Arguments used in discussions and inter-argument relations are annotated in the corpus. By using the corpus, argument structures (basic, convergent, divergent, linked, and hybrid) and inter-argument relations (rebuttal, attack, and support) are studied. For comparison, a discussion in the UK Parliament House of Commons is analysed. Similarities and differences are considered between arguments of both parliaments. Our further aim is extending the corpus in order to make it possible the automatic recognition of arguments in Estonian political texts and comparison of discussions in the Riigikogu with other parliaments and other languages.
Download

Paper Nr: 18
Title:

Large Class Arabic Sign Language Recognition

Authors:

Zakia Saadaoui, Rakia Saidi and Fethi Jarray

Abstract: Sign languages are as rich, complex and creative as spoken languages, and consist of hand movements, facial expressions and body language. Today, sign language is the language most commonly used by many deaf people and is also learned by hearing people who wish to communicate with the deaf community. Arabic sign language has been the subject of research activities to recognize signs and hand gestures using a deep learning model. A vision-based system by applying a deep neural network for letters and digits recognition based on Arabic hand signs is proposed in this paper.
Download

Paper Nr: 27
Title:

Ontology-based Representation of Digital Devices Used in Educational Guidance

Authors:

Marie Gribouval, Cecilia Zanni-Merk and Davy Monticolo

Abstract: School counseling is a decision-making process in which high school students have to decide what higher education course they will register for. Many digital devices support high school students in this guidance process. Given the diversity of architectures and services offered by existing devices, the main goal of this project is to predict their impact on high school students. It is necessary, therefore, to propose a formal characterization of digital devices for school counseling. This paper proposes an ontology to represent knowledge of digital guidance devices. An instantiation of the ontology is performed for each device to be studied. The ontology is built to be able to identify the different functions of the digital devices, their types, and the relationships between the tools of the digital devices and the guidance actors. The ontology includes the tools that the user may use, the topics covered by the device, the elementary components, and the support used (mobile application, computer application, or website).
Download

Paper Nr: 36
Title:

GLUON: A Reasoning-based and Natural Language Generation-based System to Explicit Ontology Design Choices

Authors:

Zakaria Mejdoul and Gaëlle Lortal

Abstract: The Semantic Web (SW) is an enhancement to the World Wide Web (WWW). It allows Humans to find, share and integrate information more easily. One of the Knowledge Representation technologies related to the SW standards is the ontology, which is an inventory of knowledge defining a universal or specific domain. Ontology construction requires expertise related to logics and to the expert domain the SW application is applied to. We aim to enable ontology wide adoption by bringing the end-users closer to their own ontology building choices, providing them with the possibility to build their ontology, to validate its consistency and to formally represent their knowledge without formal methods knowledge. In this paper, we detail our tool architecture combining SW technologies and Natural Language Generation (NLG) to support users in creating consistent ontologies.
Download

Paper Nr: 40
Title:

A Text Similarity Study: Understanding How Differently Greek News Media Describe News Events

Authors:

Nikos Kapellas and Sarantos Kapidakis

Abstract: Online news media usually cover the same news events. It would be interesting to understand, how similar are the descriptions of these events. In other words to explore how a distinct news event, is described by different news media. Can we safely conclude when two or more event descriptions refer to the same event? This research aims to investigate similarities in Greek news articles and provide new data on the field, as there is no relevant research. To do so, news article’s text is extracted, processed and analyzed with automated methods. To establish an understanding of similarity, three similarity settings are explored and special focus is given in examining the degree of similarity between news articles that cover the same event. In some cases results are inconclusive, while in others results show that groups of news media share the exact same articles with little or no modifications at all. The similarity analysis of news articles is a complex task and relies on many different factors that need to be addressed.
Download

Paper Nr: 42
Title:

Ontology for a Georeferencing Mobile System for Real Time Detection and Monitoring of Wildfires

Authors:

Dulce Pacheco, David Aveiro, Valentim Caires and Duarte Pinto

Abstract: This paper presents the Georeferencing Mobile Wildfire Detection System Ontology (GeMoWilDSOn). This ontology served as a base for implementing software for a mobile and georeferencing system for real-time detection and monitoring of wildfires in steep mountainous territories. On average, about 65,000 fires occur in Europe annually, burning approximately half a million hectares of wild land and forest areas. This growing tragedy directly reduces the forest biomass and biodiversity, causing severe damage to the ecosystems. Ontologies help developers speed up the requirements’ analysis in the design of a new system. Our work results in a streamlined ontology focused on fire prevention and fighting with mobile sensors, automatically georeferenced polygon data, and visible and thermal image captures, specially designed for steep mountainous terrain, where firefighting can be complex. Our research fills gaps found in related state-of-the-art and provides innovative contributions such as the concepts of manually drawn areas of fire and shadow, which are of utmost importance regarding this particularity of steep terrain. Our ontology was validated in three real-world tests where experts were delighted with the features, captured information, and its representation in the GUI of the developed system.
Download