ausgewählte Veröffentlichungen
-
Abschnitt eines Buches
- Transformer-Encoder-Based Mathematical Information Retrieval 2022
- RetroLive: Analysis of Relational Retrofitted Word Embeddings 2020
- A Genetic-Based Search for Adaptive Table Recognition in Spreadsheets 2019
- DECO: A Dataset of Annotated Spreadsheets for Layout and Table Recognition 2019
- Cardinality estimation with local deep learning models 2019
- Automatically Configuring Parallelism for Hybrid Layouts 2019
- XLIndy: Interactive Recognition and Information Extraction in Spreadsheets 2019
- Table Recognition in Spreadsheets via a Graph Representation 2018
- DebEAQ - Debugging Empty-Answer Queries On Large Data Graphs 2016
- A Machine Learning Approach for Layout Inference in Spreadsheets 2016
- Building the Dresden Web Table Corpus: A Classification Approach 2015
- Relaxation of Subgraph Queries Delivering Empty Results 2015
- SCIT: A Schema Change Interpretation Tool for Dynamic-Schema Data Warehouses 2015
- Top-k Entity Augmentation using Consistent Set Covering 2015
- A Framework for User-Centered Declarative ETL 2014
- Top-k Differential Queries in Graph Databases 2014
- Publish-time data integration for open data platforms 2013
- DeExcelerator: A Framework for Extracting Relational Data From Partially Structured Documents 2013
- A Domain-Specific Language for Do-It-Yourself Analytical Mashups 2012
- A Flexible Graph-Based Data Model Supporting Incremental Schema Design and Evolution 2012
- Energy-aware Data Stream Management 2011
- Evaluation of Load Scheduling Strategies for Real-Time Data Warehouse Environments 2010
- Cardinality estimation in ETL processes 2009
- Optimistic Coarse-Grained Cache Semantics for Data Marts 2006
- ATUN-HL: Auto Tuning of Hybrid Layouts Using Workload and Data Characteristics
- Cell Classification for Layout Recognition in Spreadsheets
- Context Similarity for Retrieval-Based Imputation
- Exploratory Ad-Hoc Analytics for Big Data
- IMITAL: Learned Active Learning Strategy on Synthetic Data
- Leveraging flexible data management with graph databases
- Modeling Customers and Products with Word Embeddings from Receipt Data
- WeakAL: Combining Active Learning and Weak Supervision
-
Artikel
- Intermediate Results Materialization Selection and Format for Data-Intensive Flows. Fundamenta informaticae, Vol.163(2), pp. 111-138. 2018
- Quality measures for ETL processes: from goals to implementation. Concurrency and computation, Vol.28(15), pp. 3969-3993. 2016
- Answering "Why Empty?" and "Why So Many?" queries in graph databases. Journal of computer and system sciences, Vol.82(1), pp. 3-22. 2016
- Considering User Intention in Differential Graph Queries. Journal of database management, Vol.26(3), pp. 21-40. 2015
- OPEN—Enabling Non-expert Users to Extract, Integrate, and Analyze Open Data. Datenbank-Spektrum : Zeitschrift für Datenbanktechnologie : Organ der Fachgruppe Datenbanken der Gesellschaft für Informatik e.V, Vol.12(2), pp. 121-130. 2012
- Frontiers in Crowdsourced Data Integration. Information technology (Munich, Germany), Vol.54(3), pp. 130-136. 2012
- Echtzeit-Data-Warehouse-Systeme. Datenbank-Spektrum : Zeitschrift für Datenbanktechnologie : Organ der Fachgruppe Datenbanken der Gesellschaft für Informatik e.V, Vol.11(3), pp. 207-211. 2011
- Partition-based workload scheduling in living data warehouse environments. Information systems (Oxford), Vol.34(4-5), pp. 382-399. 2009
-
Dokument
- To Softmax, or not to Softmax: that is the question when applying Active Learning for Transformer Models 2022
- A Cost-based Storage Format Selector for Materialization in Big Data Frameworks 2018
- Identifying And Weighting Integration Hypotheses On Open Data Platforms 2012
- ImitAL: Learning Active Learning Strategies from Synthetic Data
- RETRO: Relation Retrofitting For In-Database Machine Learning on Textual Data