Cloze: A Building Metadata Model Generation System based on Information Extraction

Fang He, Dan Wang

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

Abstract

Recently, we have seen a flourish of data-driven building applications. It has also been noted that the main effort in application development today is on data preprocessing. More specifically, buildings have entities, e.g., a chiller. To extract their data values or to control the entities, applications need to refer to the metadata of a building, i.e., the data describe the entities in a building. Data preprocessing organizes the raw metadata of a building into a form that can be easily recognized by applications. Different buildings have different metadata conventions. Data preprocessing today is largely an ad hoc process and is manually done in a building-by-building manner. How to automate data preprocessing is challenging. In this paper, we first formulate a problem on converting building raw metadata with ad hoc conventions into a building metadata model that follows a standard convention, e.g., the Brick metadata schema. Depending on application scenarios, we present three variants of the problem. This problem is intrinsically a text analysis problem. We thus propose to leverage the information extraction paradigm, a type of document processing to extract structured information from unstructured documents/texts. We analyze real-world building metadata and present a set of challenges on corpus denoise, coreference resolution, disambiguity, etc. We develop a system, Cloze with corresponding solutions. We evaluate Cloze with six real-world buildings. Our results show that Cloze can learn and automatically recognize raw metadata and their relations with an accuracy of 96.3%.

Original languageEnglish
Title of host publicationBuildSys 2022 - Proceedings of the 2022 9th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation
PublisherAssociation for Computing Machinery, Inc
Pages109-118
Number of pages10
ISBN (Electronic)9781450398909
DOIs
Publication statusPublished - 9 Nov 2022
Externally publishedYes
Event9th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, BuildSys 2022 - Boston, United States
Duration: 9 Nov 202210 Nov 2022

Publication series

NameBuildSys 2022 - Proceedings of the 2022 9th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation

Conference

Conference9th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, BuildSys 2022
Country/TerritoryUnited States
CityBoston
Period9/11/2210/11/22

Bibliographical note

Publisher Copyright:
© 2022 ACM.

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 8 - Decent Work and Economic Growth
    SDG 8 Decent Work and Economic Growth

Keywords

  • Smart Building
  • Data Model
  • Metadat
  • Information Extraction

Fingerprint

Dive into the research topics of 'Cloze: A Building Metadata Model Generation System based on Information Extraction'. Together they form a unique fingerprint.

Cite this