Building a Construction Project Key-Phrase Network from Unstructured Text Documents
Abstract
During a construction project lifecycle, an extensive corpus of unstructured or semistructured text documents is generated. The nature of unstructured sources impedes users' acquisition, analysis, and reuse of relevant information in an integral form, leading to a possible reduction in project performance because of untimely or inadequate decisions. This paper explores the representation of information from unstructured documents in the form of a key-phrase network, intended to provide users with the possibility to visualize and analyze valuable project facts with less effort. A network of key phrases automatically extracted from various types of unstructured documents, with relations based on contextual similarity, was implemented as a graph database, enabling project participants to extract and visualize various patterns in data. With the objective of constructing a domain-independent key-phrase network with minimal expert involvement, an approach to detect key phrases in a multiling...ual environment was examined by using measures of association between words while avoiding text content from less informative contexts. A possible application is demonstrated using key-phrase networks generated from two complex international construction projects.
Keywords:
Unstructured data / Key phrase / Key-phrase network / Entropy / Relation / VisualizationSource:
Journal of Computing in Civil Engineering, 2017, 31, 6Publisher:
- American Society of Civil Engineers (ASCE)
DOI: 10.1061/(ASCE)CP.1943-5487.0000708
ISSN: 0887-3801
WoS: 000426243200005
Scopus: 2-s2.0-85026636219
Collections
Institution/Community
GraFarTY - JOUR AU - Nedeljković, Đorđe AU - Kovačević, Miloš PY - 2017 UR - https://grafar.grf.bg.ac.rs/handle/123456789/898 AB - During a construction project lifecycle, an extensive corpus of unstructured or semistructured text documents is generated. The nature of unstructured sources impedes users' acquisition, analysis, and reuse of relevant information in an integral form, leading to a possible reduction in project performance because of untimely or inadequate decisions. This paper explores the representation of information from unstructured documents in the form of a key-phrase network, intended to provide users with the possibility to visualize and analyze valuable project facts with less effort. A network of key phrases automatically extracted from various types of unstructured documents, with relations based on contextual similarity, was implemented as a graph database, enabling project participants to extract and visualize various patterns in data. With the objective of constructing a domain-independent key-phrase network with minimal expert involvement, an approach to detect key phrases in a multilingual environment was examined by using measures of association between words while avoiding text content from less informative contexts. A possible application is demonstrated using key-phrase networks generated from two complex international construction projects. PB - American Society of Civil Engineers (ASCE) T2 - Journal of Computing in Civil Engineering T1 - Building a Construction Project Key-Phrase Network from Unstructured Text Documents IS - 6 VL - 31 DO - 10.1061/(ASCE)CP.1943-5487.0000708 ER -
@article{ author = "Nedeljković, Đorđe and Kovačević, Miloš", year = "2017", abstract = "During a construction project lifecycle, an extensive corpus of unstructured or semistructured text documents is generated. The nature of unstructured sources impedes users' acquisition, analysis, and reuse of relevant information in an integral form, leading to a possible reduction in project performance because of untimely or inadequate decisions. This paper explores the representation of information from unstructured documents in the form of a key-phrase network, intended to provide users with the possibility to visualize and analyze valuable project facts with less effort. A network of key phrases automatically extracted from various types of unstructured documents, with relations based on contextual similarity, was implemented as a graph database, enabling project participants to extract and visualize various patterns in data. With the objective of constructing a domain-independent key-phrase network with minimal expert involvement, an approach to detect key phrases in a multilingual environment was examined by using measures of association between words while avoiding text content from less informative contexts. A possible application is demonstrated using key-phrase networks generated from two complex international construction projects.", publisher = "American Society of Civil Engineers (ASCE)", journal = "Journal of Computing in Civil Engineering", title = "Building a Construction Project Key-Phrase Network from Unstructured Text Documents", number = "6", volume = "31", doi = "10.1061/(ASCE)CP.1943-5487.0000708" }
Nedeljković, Đ.,& Kovačević, M.. (2017). Building a Construction Project Key-Phrase Network from Unstructured Text Documents. in Journal of Computing in Civil Engineering American Society of Civil Engineers (ASCE)., 31(6). https://doi.org/10.1061/(ASCE)CP.1943-5487.0000708
Nedeljković Đ, Kovačević M. Building a Construction Project Key-Phrase Network from Unstructured Text Documents. in Journal of Computing in Civil Engineering. 2017;31(6). doi:10.1061/(ASCE)CP.1943-5487.0000708 .
Nedeljković, Đorđe, Kovačević, Miloš, "Building a Construction Project Key-Phrase Network from Unstructured Text Documents" in Journal of Computing in Civil Engineering, 31, no. 6 (2017), https://doi.org/10.1061/(ASCE)CP.1943-5487.0000708 . .