Semi-Automatic Objects Recognition in Urban Areas Based on Fuzzy Logic

doi:10.4236/jgis.2010.22011

Paper Menu >>

Journal Menu >>

Journal of Geographic Information System, 2010, 2, 55-62

doi:10.4236/jgis.2010.22011 Published Online April 2010 (http://www.SciRP.org/journal/jgis)

Semi-Automatic Objects Recognition in Urban Areas

Based on Fuzzy Logic

Federico Prandi, Raffaella Brumana, Francesco Fassi

Department of B.E.S.T., Politecnico di Milano, P.zza Leonardo da Vinci, Milan, Italy

E-mail: {federico.prandi, raffaella.brumana, francesco.fassi}@polimi.it

Abstract

Three dimensional object extraction and recognition (OER) from geographic data has been definitely one of

more important topic in photogrammetry for quite a long time. Today, the capability of rapid generating

high-density DSM increases the supply of geographic information but the discrete nature of the measuring

makes more difficult to recognize correctly and to extract 3D objects from these surface. The proposed

methodology wants to semi-automate some geographic objects clustering operations, in order to perform the

recognition process. The clustering is a subjective process; the same set of data items often needs to be parti-

tioned differently based on the application. Fuzzy logic gives the possibility to use in a mathematical process

the uncertain information typical of human reasoning. The concept at the base of our proposal is to use the

information contained in Image Matching or LiDAR DSM, and typically understood by the human operator,

in a fuzzy recognition process able to combine the different input in order to perform the classification. So

the object recognition approach proposed in our workflow integrates 3D structural descriptive components of

objects, extracted from DSM, into a fuzzy reasoning process in order to exploit more fully all available in-

formation, which can contribute to the extraction and recognition process and, to handling the object’s

vagueness. The recognition algorithm has been tested with to different data set and different objectives. An

important issue is to apply the typical human process which allows to recognize objects in a range image in a

fuzzy reasoning process. The investigations presented here have given a first demonstration of the capability

of this approach.

Keywords: Objects Recognition, DSM, Fuzzy Logic

1. Introduction

1.1. Motivations

Estimating the research fields involved in the 3D topog-

raphic objects generation for 3D GIS a re very huge.

3D GIS require 3D representations of objects and 3D

object reconstruction of real world is a relatively new

issue in photogrammetry.

Progress in the field of digital image matching for

DSM generation and growth of airborne laser scanning

technologies have improved the derivation of digital sur-

face models.

The basic idea of this work is to use the 3D informa-

tion containe d in these DSM in order to aid the 3D iden-

tification and extraction of features. Using the DSM we

would to overcome some typical difficulties in image in-

terpretation.

Then this work focuses on the automation and accel-

eration of the recognition of 3D topographic objects for

3D GIS. The purpose is to use the 3D information con-

tained in the DSM in order to automatic detect and recog-

nize topographic objects in complex scenes. Furtherm ore, it

is tried to use only LiDAR or Image Match ing DSM. Due

to this limitation our goal becomes even more chal lenging.

Experience and daily practice allow us to have our brain

automatically interpreting what we see. Human recogni-

tion takes advantage of a variety of acquired information

rather than relying on, say, a single descriptor of the ob-

jects. Further, human perception possesses a tremendous

potential for learning and deals perfectly with the fuzzi-

ness of the real world. In the case of objects in geogr aph i c

data an interpretation by human operator may be easier

but, if these processes are replaced by a computer, it is

more than probable that none of the objects will be iden-

tified.

Fuzzy set theory, introduced by L. Zadeh in the 1960s,

resembles human reasoning in its use of approximate

F. PRANDI ET AL.

informati on and uncertainty to generat e decisi ons.

This work contains possible methods that aim at an

automated object recognition based on fuzzy logic. An

important aspect of fuzzy reasoning is that the rule base

should include observations of the important object’s de-

scriptors. Moreover, it reflects the fact that people may

formulate similar “fuzzy statements” to characterize how

they perceive object in range images. Formulating the

rules is more a question of the expertise of an operator

than of a detailed technical modelling approach. This

aspect is very important because we are aware of the fact

that an operator will never be completely replaced.

1.2. Related Works

Three dimensional object extraction and recognition (O

ER) from geographic data has been definitely one of

more important topic in photogrammetry for quite a long

time. However, most of the existing methods for auto-

matic object extraction and recogn ition from data are ju st

based on the range information and employ parametric

methods while object’s vagueness behaviour is basically

neglected [1].

The main task in automatic DTM generation from data

is the separation of terrain points from non-terrain points;

further classification of non-terrain points in buildings,

vegetation and other objects (e.g. bridges) may be nec-

essary, depending o n the appl i cat i on.

Since long years automatic approaches to DTM gen-

eration and building reconstruction based on radiometric

images have long been a research topic. It is shown that

the segmentation and classification problem is not only

an interesting research topic, but also very important in

practice. Multiple, la rgely complem entary, sensor data such

as colour or multi-spectral aerial images and range data

from laser scanners or SAR have been used to achieve

robustness and better performance in 3D ORR. For ex-

ample, colour infrared (CIR) aerial images in comb i n a ti o n

with laser scanner data DTMs can be used for feature

extraction [2].

If multispectral imagery is available the classification

approach is the most convenient way for detecting

building areas or urban regions [3]. Everything that ex-

ceeds a certain threshold in the nDSM will be included,

and vegetation will be afterwards excluded by interpret-

ing the NDVI [4].

In many case however the multispectral information

are absent and techniques in order to segment the DSM

are carried out. Different filtering strategies have been

proposed, based on deviations from a parametric surface,

slope threshold, clustering, etc.

A slope based filtering using mathematical morphol-

ogy has been developed defining a slope threshold as the

maximum allowed height difference between two points

as a function of their spatial distance [5].

A complex iterative procedure, based on the analysis

of gradient, surface orientation and residuals from suc-

cessive spline interpolations has been implemented in

order filtered out non-terrain points [6].

Another interesting approach [7] has been discussed:

the segmentation is carried out by combining region

growing with a principal component analysis (PCA).

A three-stage framework has been implemented for a

complete, robust and automatic classification of LiDAR

data and is composed by: a region-growing technique to

obtain regions with a step edge along their border, a

grouping of connected sets of pixels on the basis of an

8-classes partition of the height gradient orientation and

a rule based scheme applied to the classification of the

regions [8].

Study focuses on the automatic extraction of a DTM

from a high-resolution DEM produced by image correla-

tion in urban or rural areas based on a hybrid approach

has been designed and combines complementary aspects

of both TIN-based and segmentation-based techniques

[9].

This review outlines the considerable efforts which

have been under taken to deal w ith the complex pr oblems

of segmentation Computational theories on human per-

ception and artificial intellig ence are partially included in

these developments.

2. Classification Based on Fuzzy Logic

2.1. Fuzzy Logic and Application in Feature

Extraction

By recognition we refer to the process that assigns a la-

bel (e.g. “building” or “tree”) to a segmentation result, in

particular a region, based on properties (descriptors) of

the region. Segmentation is the process th at partitions the

spatial domain of an image or other raster datasets like

digital elevation models into mutually exclusive parts,

called regions.

The major challenge of this work is to develop a seg-

mentation procedure in connection with an inference

process for object recognition where computational theo-

ries on human perception and artificial intelligence are

partially included.

The bridge to connect the computational theories spec i-

fied above with the human perception can be given by

the Fuzzy Logic.

Fuzzy logic provides a simple way to arrive at a defi-

nite conclusion based upon imprecise, uncertain, am-

biguous, vague or missing input information. Over the

past few decades, fuzzy logic has been used in a wide

range of problem domains. Although the fuzzy logic is

relatively young theory, the areas of applications are very

wide: process control, management and decision making,

operation s res ear ch , econ o mies. Dealing with simple ‘b l a ck’

F. PRANDI ET AL.57

and ‘white’ answers is no longer satisfactory enough; a

degree of membership (suggested by Prof. Zadeh in 1965)

became a new way of solving the problems. The natural

description of problems, in linguistic terms, rather than

in terms of relationships between precise numerical val-

ues, is the another advantage of this theory.

Fuzzy Logic introduces linguistic variables for the

object characteristic descriptors and linguistic labels to

describe the fuzzy sets on the range of all possible values

of the linguistic variables.

The basic idea of the recognition approach presented

in this work is to resemble human reasoning in its use of

imprecise, vague or missing information. This involves

formulating rules with imprecise knowledge and provid-

ing a simple way of arriving at a definite conclusion, i.e.

the recognized objects. The recognition approach based

on fuzzy reasoning and a concept for introducing learn-

ing into fuzzy recognition is in the focus of this work.

2.2. Application in Feature Extraction

Many methods in pattern recognition and feature extrac-

tion have been proposed, based on various approaches,

such as Bayesian inference, neural networks, linear meth-

ods, nearest prototypes, etc. Fuzzy set theory and related

domains have brought new tools for pattern recognition,

either based on the concep t of fuzzy sets as a fundamen-

tal concept for modelling classes, or based on non clas-

sical representations of uncertainty.

These approaches can often be viewed as a generaliza-

tion of some classical method, these generalizations in-

troducing either some fuzziness in the modelling or some

uncertainty measure more general than probability meas-

ures. In other cases, they follow the same kind of phi-

losophy, but with different too ls. Typical examples in the

first category are: the fuzzy k-nearest neighbours and the

fuzzy c-means. In the second one, the fuzzy pattern

matching approach, in the framework of possibility the-

ory, can be viewed as the possible counterpart of the

Bayesian approach to classification.

Applied to raster image, fuzzy classification estimates

the contribution of each class in the pixel. It assumes that

a pixel is not an ind ecomposable unit in the image an aly-

sis and, consequently, works on a new principle: “one

pixel-several classes” to provide more information about

the pixel unlike the hard classification methods which

are poor in information extraction. For instance a fuzzy

c-means clustering algorithm has been developed and

implemented in unsupervised classification of remote

sensing data [10] .

In the proposed fuzzy process (Figure 1), each pixel is

transformed into a matrix of membership degrees repre-

senting the fuzzy inputs. A minimum-reasoning rule is,

then, applied to infer the fuzzy outputs. Finally, a de-

fuzzification step is applied to extract features [11].

The main componen ts of the fuzzy recogn ition process

are as follows:

 a database, which defines the membership func-

tions of the fuzzy sets,

 a rule base, which contains fuzzy if-then rules,

 a fuzzy reasoning procedure, which performs in-

ference operations on the rules.

This should consider 1) all available descriptors of an

object (such as: 3D structure, textural information and

spectral responses), 2) a fuzzy description of object

properties and a fuzzy inference strategy for object rec-

ognition, 3) learning capabilities to modify imprecise

model descriptions and increase the potential of recogni-

tion in particular if new and unrecognized objects are

encountered.

The fuzzy AND or OR operators combine the mem-

bership values of the inputs in each rule for the antece-

dent of that rule. The MIN reasoning rule, applied on the

matrix of produced fuzzy, inputs will consider, for each

class, the membership degrees provided by the different

fuzzy sets and pick out the minimal membership degree

to represent the class extent in the pixel. Then a MAX

operation will be performed, on the consequent of each

rule, to obtain the element with the highest value (fuzzy

output) and the corresp onding class of that feature willbe

considered as associated fuzzy class to that pixel (Fig-

ure 2).

Figure 1. Implementation flowchart in fuzzy step.

F. PRANDI ET AL.

Rule base should include observations of the important

descriptors. Moreover, it reflects the fact that people may

formulate similar “fuzzy statements” to characterize how

they perceive objects in, for instance, aerial colour im-

ages.

The investigations of the works invo lved in classifica-

tion and feature extraction [12-14] have given a demon-

stration of the capability of these fuzzy approaches.

2.3. Proposed Workflow

The proposed methodology wants to semi-automate some

geographic objects clustering operations, in order to per-

form the recognition process.

The clustering is a subjective process; the same set of

data items often needs to b e partitioned differently based

on the application. There is no general algorithm or ap-

proach that solves every clustering problem. In order to

overcome this problem is important to have simple and

semi-automated tools with the possibility to change the

classification based on different situations and different

data.

As seen in the previous, fuzzy logic gives the possibil-

ity to use in a mathematical process the uncertain infor-

mation typical of human reasoning. The concept at the

base of our proposal is to use the information contained

in Image Matching or LiDAR DSM, and typically un-

derstood by the human operator, in a fuzzy recognition

process able to combine the different input in order to

perform the classification.

So the object recognition approach proposed in our

workflow integrates 3D structural descriptive components

of objects, extracted from DSM, into a fuzzy reasoning

process in order to exploit more fully all available infor-

mation, which can contribute to the extraction and recog-

nition process and, to handling the object’s vagueness.

Figure 2. Architecture of the explicit fuzzy method illus-

trated for two descriptor, two variables with three classes.

The proposed objects extraction method requires some

preliminary step that con sists in 1) to locate and separate

all 3D objects from the terrain and 2) to considerate and

generate all the geometric properties that can describe

the objects. Then the challenge is to develop a segmenta-

tion procedure in connection with an inference process

for object recognition.

Basic idea was to perform the classification procedure

using a priori human knowledge about the objects and

then in fuzzy logic manner. Some information, needed

for membership function definition, was taken from hu-

man heuristic knowledge.

More specifically the steps of fuzzy recognition proc-

ess will be:

 input raster data and structural descriptor variables

are introduced;

 membership functions are defined using results

from human heuristic knowledge;

 definition of fuzzy logic inference rules;

 performing Raster data classification.

The OER process starts with the extraction of the

Structural Descriptor (SD). Then the descr iptors are an a-

lyzed to derive the SD membership functions. In the next

stage, the recognition operation is performed with the

application of the sample if-then fuzzy rules and the

MIN-MAX inference process (Figure 3).

Prior to fuzzy recognition the membership functions of

the structural descriptors have to be specified but, before

the system operation is started, it is always necessary to

set up the fuzzy reasoning parameters. For the object

classes, the preliminary Structural descriptor’s member-

ship functions components are defined based on the

knowledge of an expe r ien ce d photogrammetric operator.

The formulation of the fuzzy rules requires a profound

observation of the integrative impact of the descriptors

on the recognition of an object. This again depends on

the experience of an operator but also on the complexity

of an object. In the experiments we observed that for

partition walls a comparatively small number of fuzzy

rules are sufficient, whereas trees require more rules pro-

bably due to their more complex shapes and variety of

appearances.

2.4. Recognition of Little Objects in Urban Areas:

The Partition Walls

In the data-test we investigated the possibility to recog-

nize large scale objects in urban areas. In these areas

large amount of objects are constituted by building and

road feature classes but, th ere are also many little o bjects

which are important i.e. for construction of 3D city mod-

els. In many cases these features are difficult to single

out into images by an expert photogrammetric operator

too. The partition wall is a typical example of this kind

of feature class. In fact due to their geometry is difficult

F. PRANDI ET AL.59

Figure 3. OER (Object Extraction and Recognition) fuzzy

process work-flow.

to detect them into image, beside they may have parts

occluded by the buildings or trees, or may be confused

by the shadow of the walls themselves. Human recogni-

tion takes advantage of a variety of acquired information

and the walls are identifiable on the aerial images more

like as a separation between to different areas then like

as single geometry. In this situation human perception

possesses a tremendous potential for learning and deals

perfectly with the fuzziness of the real world. For these

reasons the recognition of these particular features may

take advantage from a fuzzy logic approach, which is

able to capture the “fuzziness” or “imprecision” of the

real world.

Beside in this particular context use of Structural de-

scriptor are fundamental because the spectral information

is very poor due to shadows, noise and similarity to other

objects such as road, buildings etc. So this is a very im-

portant challenge for our approach which starts only from

3D Structural Descriptors.

The data test is an urban area of Milan, there are many

partition walls which divide road and railways and like

as boundary of big factor y (Figure 4). The input DSM is

obtained trough Image matching technique and has 1 × 1

meter resolution.

The surface model generated by image matching is sh-

own in (Figure 5). Appling segmentation of the surface

model based on morphological filtering in order to the

detected 3D regions, we can obtain the nDSM.

After refinement of the extracted regions the next step

is to derive the structural descriptors: gradient edge de-

tector and others. The membership functions are defined

and applied to these features and the results are shown in

Figure 6.

The trapezoidal membership function, with maximum

equal to 1 and minimum equal to 0, is used in this work.

Special cases like symmetrical trapezoids and triangles

reduce the number of parameters to three. The trapezoi-

dal function are modell ed with four param et ers (α, β, γ, δ).

(,,,, )1

if x

xif x

if x



















































Figure 4. Sample data-set, the more important partition

walls are highlighted. It is possible to see like as the object

identification into the im age is problematic.

F. PRANDI ET AL.

Figure 5. Digital surface model generated by image match-

ing. 1 × 1 meter resolution.

Figure 6. Application of the membership function to the

structural descriptors (Height range, edge) generation of

the fuzzy Data Set.

The linguistic variables, which are variables that as-

sume linguistic terms values, of the object’s structural

descriptors have to be defined in fuzzy logic. Linguistic

variables are associated to each input structural descrip-

tor and for each linguistic variables some linguistic labels

has been assigned. This assignment is mostly a mixture of

expert knowledge and examination of the desired input-

output dat a .

Through definition of linguistic variables and mem-

bership function parameters calculation of the input to

the fuzzy recognition pro cess is given.

In a second step the determined membership values

have to be combined to get a final decision (inference

process). This component of the fuzzy recognition proc-

ess consists in the definition of a set of rule base, which

contains fuzzy if-then rules.

Once defined the sample if-then rules we can start the

inference process. The results of classification are shown

in Figure 7.

This methodology returns as output a very thigh rec-

ognition objects set, due to use of min implication, which

consist in low pass filter. In this situation is enough a

wrong value in an input data set to get worse in the ob-

jects extraction. For instance, inaccurate DTM extraction

may cause the elimination of some objects part from

nDSM. This is all the more probable in case of little ob-

jects such as walls. The result is a chopped classification

set where the objects classified as walls are correctly

understood and the false positive are limited but, we

have many not recognized objects and the geometry re-

sults fragmented.

The second method we have tested a weighted sum

approach [15] and thi s ha s sh own better res ults.

The resulting outpu t value (Figure 8 left) is calculated

by t he su m of the me mbe rsh ip values for each rules con-

cerning an individual weight factor Wi. The weights for

parameters height, edge and height range depend on the

experience and data input/output

( )(),......,()

iii ii

iA ABBNnN

mxW xWxW

 



with 11





Our investigations have shown that the classified ob-

jects are better defined than in fuzzy-min approach and

we may recognize more features than previously.

Finally an accuracy investigation is carried out to

evaluate the recognition results. For this purpose the ex-

Figure 7. Results of fuzzy walls recognition process.

F. PRANDI ET AL.61

Figure 8. Results of fuzzy walls recognition process using

the weighted sum approach. At left is shown the results of

fuzzy inference process and at right the final cluster once

applied a threshold (value 0.6) to the weighted fuzzy output.

tracted objects are compared to the object manually

measured and contained in the 1:1000 digital maps of

Milan.

Due to the particular wall’s geometry (very thigh in

one direction), the classification is influenced by the

quality of input DSM. Where DSM is well defined (Fig-

ure 9) we have a good result.

3. Conclusions and Future Works

In object recognition, human interaction remains an im-

portant part of the work flow even though the amount of

work to be done by the human operator can be reduced

considerably in the global extraction process. Many of

automated and semi-automated methods proposed in lit-

erature focus their efforts on the reconstruction process

or on the feature extraction once recognized the objects

by the operator.

The core of the system is an approach for recognition

of object’s primitives based on fuzzy logic theory which

analyzes the data in order to extract a maximal amount of

information, through an explicit process that uses struc-

tural informatio n of objects and integrates them in a f uz zy

reasoning process.

The methods presented in this work integrate some

typical heuristic human concept and is adaptive to oper-

ate with different data sets and in different applications.

Despite that further investigation and developments migh t

be carried out. In relation to dat a input di fferent descriptor

Figure 9. Comparison between the extracted features and

1:1000 digital maps of Milan. In red the wall features

manually extracted. We can observe a good result for 50%

of elements, while some walls are not recognized and we

have many false-positive due to the presence of many ob-

jects which have wall’s similar characteristics.

of objects, such as textural or multispectral properties,

can be considered in recognition process. Moreover the

generation of structural descriptor have to be improved

considering the sensitivity o f the algorithms to the co arse

input.

Another important improvement is neural-network lea-

rning. The learning capability of neural networks can be

introduced to the fuzzy recognition process by taking

adaptable parameter sets into account which leads to the

neuro-fuzzy approach.

Finally data acquired from different sensor (such as

High density Airborne Laser Scanning) or new systems

(UAV) can be used in recognition process and their po-

tentiality can be evaluated.

4. References

[1] F. Samadzadegan, A. Azizi, M. T. Hahn and C. Lucas,

“Automatic 3D Object Recognition and Reconstruction

Based on Neuro-Fuzzy Modelling,” ISPRS Journal of

Photogrammetry & Remote Sensing, Vol. 59, No. 5, 2005,

pp. 255-277.

[2] M. Hahn and C. Stätter, “A Scene Labeling Strategy for

Terrain Feature Extraction Using Multisource Data,” In-

ternational Archives of Photogrammetry and Remote

Sensing, Vol. 32, Part 3/1, 1998, pp. 435-443.

[3] T. Knudsen and B. Olsen, “Automated Change Detection

for Updates of Digital Map Database s,” Photogrammetric

F. PRANDI ET AL.

Engineering & Remote Sensing, Vol. 69, No. 11, 2003,

pp. 1289-1296.

[4] G. Sohn and I. J. Dowman, “Terrain Surface Reconstruc-

tion by the Use of Tetrahedron Model with the MDL Cri-

terion, International Archives of Photogrammetry,” Re-

mote Sensing and Spatial Information Sciences, Vol. 34,

Part 3A, 2002, pp. 336-344.

[5] G. Vosselmann, “Slope Based Filtering of Laser Altim-

etry Data,” International Archives of Photogrammetry

and Remote Sensing, Amsterdam, Vol. 33, No. B3, 2000,

pp. 935-942.

[6] M. A. Brovelli, M. Cannata and U. M. Longoni, “Man-

aging and Processing LIDAR Data within GRASS,” Pro-

ceedings of the Open Source GIS-GRASS Users Confer-

ence, Trento, September 2002.

[7] M. Roggero, “Object Segmentation with Region Growing

and Principal Component Analysis,” International Ar-

chives of Photogrammetry and Remote Sensing and Spa-

tial Information Sciences, Vol. 34, Part 3A, 2002, pp.

289-294.

[8] G. Forlani, C. Nardinocchi, M. Scaioni and P. Zingaretti,

“Complete Classification of Raw LIDAR Data and 3D

Reconstruction of Buildings,” Pattern Analysis Applica-

tion, Vol. 8, No. 4, 2006, pp. 357-374.

[9] C. Baillard, “A Hybrid Method for Deriving DTMs from

Urban DEMs,” International Archives of Photogram-

metry and Remote Sensing, Vol. 37, Part B3b, 2008, pp.

109-113.

[10] R. L. Cannon, J. V. Dave, J. C. Bezdek and M. M.

Trivedi, “Segmentation of a Thematic Mapper Image

Using the Fuzzy C-Means Clustering Algorithm,” IEEE

Transactions on Geoscience and Remote Sensing, Vol. 24,

No. 3, 1986, pp. 400-408.

[11] F. Melgani, B. A. R. Al Hashemy and S. M. R. Taha, “An

Explicit Fuzzy Supervised Classification Method for Mul-

tispectral Remote Sensing Images,” IEEE Transactions on

Geoscience and Remote Sensing, Vol. 38, No. 1, January

2000, pp. 287-295.

[12] J. Amini, C. Lucas, M. Saradjian, A. Azizi and S. Sa-

deghian, “Fuzzy Logic System for Road Identification

Using Ikonos Images,” Photogrammetric Record, Vol. 17,

No. 99, 2002, pp. 493-503.

[13] A. Mohammadzadeh, A. Tavakoli and M. J. V. Zoej,

“Automatic Linear Feature Extraction of Iranian Roads

from High Resolution Multi-Spectral Satellite Imagery,”

International Archives of the Photogrammetry, Remote

Sensing and Spatial Information Sciences, Vol. 35, Part

B3, 2004, p. 764.

[14] B. Wuest and Y. Zhang, “Region Based Segmentation Of

Quickbird Imagery through Fuzzy Integration,” Interna-

tional Archives of the Photogrammetry, Remote Sensing

and Spatial Information Sciences, Vol. 35, Part B7, 2008,

p. 491.

[15] T. Vögtle and E. Steinle, “On the Quality of Object Clas-

sification and Automated Building Modelling Based on

Laser-Scanning Data,” International Archives of Photo-

grammetry Remote Sensing and photogrammetry, Vol. 32,

No. 4, 2003, pp. 77-84.