Semantic integration of geospatial concepts - a study on land use land cover classification systems
Publication or External Link
In GI Science, one of the most important interoperability is needed in land use and land cover (LULC) data, because it is key to the evaluation of LULC's many environmental impacts throughout the globe (Foley et al. 2005). Accordingly, this research aims to address the interoperability of LULC information derived by different authorities using different classificatory approaches.
LULC data are described by LULC classification systems. The interoperability of LULC data hinges on the semantic integration of LULC classification systems. Existing works on semantically integrating LULC classification systems has a major drawback in finding comparable semantic representations from textual descriptions. To tackle this problem, we borrowed the method of comparing documents in information retrieval, and applied it to comparing LULC category names and descriptions. The results showed significant improvement comparing to previous works.
However, lexical semantic methods are not able to solve the semantic heterogeneities in LULC classification systems: the confounding conflict - LULC categories under similar labels and descriptions have different LULC status in reality, and the naming conflict - LULC categories under different labels represent similar LULC type. Without confirmation of their actual land cover status from remote sensing, lexical semantic method cannot achieve reliable matching.
To discover confounding conflicts and reconcile naming conflicts, we developed an innovative method of applying remote sensing to the integration of LULC classification systems. Remote sensing is a means of observation on actual LULC status of individual parcels. We calculated parcel level statistics from spectral and textural data, and used these statistics to calculate category similarity. The matching results showed this approach fulfilled its goal - to overcome semantic heterogeneities and achieve more reliable and accurate matching between LULC classifications in the majority of cases.
To overcome the limitations of either method, we combined the two by aggregating their output similarities, and achieve better integration. LULC categories that post noticeable differences between lexical semantics and remote sensing once again remind us of semantic heterogeneities in LULC classification systems that must to be overcome before LULC data from different sources become interoperable and serve as the key to understanding our highly interrelated Earth system.