A Novel Trajectory Clustering technique for selecting cluster heads in Wireless Sensor Networks

International J ournal of Recen t Tre nds in Engineering, Issue. 1, Vo l. 1, May 2009 A Novel Traject ory Clustering technique for selecting cluster heads in Wireles s Sensor Networks Hazarath Munaga 1 , J.V.R. Murthy 1 , and N.B.Venkateswarlu 2 1 University Colle ge of Engine ering, Dept. of C SE, J.N. T.University Ka kinada, A.P, India Email: {hazarath. munaga, mjonnalagedda}@ gmail.com 2 AITAM, Dept o f CSE, Tekkali, A.P, India Email: venkat_ritc h@yahoo .com Abstract —Wireless sensor netw orks (WSNs) s uffers from the hot spot proble m w here the sensor nodes closest to the base station are n eed to relay m ore packet than the nodes farther away from the base stat ion. Thus, li fetime of sensory network depends on these closest nodes. Clustering methods are used t o extend the lifeti me of a wireless sensor network. However, current cl ustering algorithms usually utilize tw o techniques; selecting cluster heads w ith more residual energy, and rotating cluster heads periodically to distribute the e nergy consumption a mong nodes i n each cluster and lengthen the netw ork lifetime. Most of the algorithms use random selection fo r selecting the cl uster heads. Here, we propose a novel trajectory clustering technique fo r s electing the cluster heads in WSNs. O ur algorithm selects the cl uster heads based on traffic and rotates per iodically. It p rovides the firs t trajectory based clustering technique fo r selecting the cluster heads and to extenuate the h ot spot problem by prolonging the network lifetime. Index Terms —Trajectory c lustering, Wireless sensor networks, Network life time, Cluster head I. I NTRODUCTI ON Wireless sensor networks (hereinafter, WSNs) are networks of wireless nodes that ar e d eployed o ver an area for the purpose of monitoring certain pheno mena of interest. To keep spe cific ar eas under observation, WSNs deploy h undreds or thousands of integrated sensor nodes to sample data from observe d envir onment. The nodes perform certain m easurements , process the m easured data and transmit t he proc essed data to a b ase station over a wireless channel. The base station collects data fro m all the nodes, and analyzes this data to draw conclusion s about the activity in the area of interest. In practice , due to the large q uantit y of sens or node s, it is i nfeasible to recharge the batteries in WSNs. T herefore, sensor network lifeti me is a pri mary concern in sensor net work design. In literature many resear chers c oncerning pr otocols for WSNs have been prop osed to improve the e nergy consumption and the network lifetime. Those protocols can be categorize d into three classes: routing p rotoco ls, sleep-and-awake scheduling protocols, and clustering protoco ls. The ro uting pr otocols [1] [2] d etermine the energy-efficient multi -hop paths fro m eac h node to the base station. In sleep-a nd-awake scheduling protocols [3- 5], every node in the schedule ca n sleep , in order to minimize energ y consumptio n. I n cl ustering p rotoco ls [ 6] [7] data aggregatio n ca n b e used for reducing e nergy consumption. Da ta aggregatio n, also known as data fusion, ca n c ombine multiple data packets received fro m different sensor nod es. It r educes the size of the data packet by eliminating the red undancy. Wireless communication cost is also decreased by the reduction in the d ata packets [ 8]. Therefore, clustering protoc ols improve the energ y consumpti on and the net work lifeti me of the WSNs. Clustering [9 ] is a co mmonly ad opted approach in sensor networks to m anage po wer efficiently. In clustering, senso rs in the monitoring area ar e grouped into clusters; all sensor nodes within the same cluster send t heir data to t he cluster head, which then for wards the a ggregated data to the b ase station. Therefore, cluster heads “typicall y die at a n ear ly stage” [1 0]. This is sometimes calle d as the hot sp ot pro blem [11]. Without adding extra nodes or red istributing the available energ y, this p roble m i s hard to solve. For ex ample, [10] have shown that var ying the tran smission po wer of nodes, e ven considering unlimited tra nsmission ranges, d oes not so lve the hot spot problem. At the sa me time, it is al so envisioned that sensor nodes will become “extremely inexpensive” [ 12]. W hile beyond a certain node density, adding additional nodes doe s not provide any improvement regard ing se nsing, co mmunication or coverage [ 13], adding nod es might obvio usly help to increase the l ifetime of a se nsor network while providing the same service to its users, i. e. leveraging sensor value s from the same n umber of nod es. Ref. [ 14] p roposed LEACH, a well-known cluster ing protoco l for WSNs. LEACH includes d istributed clu ster formation, local processing to reduce global communication and ra ndomized rotation of cluster head s among all t he node s in the ne twork. E ach cluster select s a cluster head, which is respo nsible for aggregating collected data and sending data to base station. LEACH provides a good model that he lped to reduce infor mation overload and provides a reliable data to the end us er. Together, these features allo w LEACH to ac hieve t he desired p roperties. Ref. [ 15] the p roblem of finding a n e nergy-bala nced solution to d ata p ropagation in WS Ns using a probabilistic al gorithm was c onsidered for the first time. The lifespan o f t he networ k is m aximized b y ensurin g International J ournal of Recen t Tre nds in Engineering, Issue. 1, Vo l. 1, May 2009 that the energy consumptio n in each slice is the sa me. Sensors are assumed to be randomly distributed with uniform di stribution in a circular region or, more generally, the sec tor o f a disk. Data have to be pr opagated by t he WSN towards a sink loc ated at the center of t he disk, a nd it is sho wn that e nerg y balance can be achieved if a recurrence relation between the p robab ilities that a slice ejects a message to the si nk is satis fied. Ref. [16] proposed clustering-based routing protocol called base station co ntrolled dynamic cl ustering proto col (BCDCP), w hich utilizes a hi gh e nergy base statio n to set up cluster heads and perfor m other energy-intensive tasks, can noticea bly enhance the lifetime o f a network. Ref. [17 ] pro posed two ne w algorith ms under the na me PEDAP, which are near optimal minimu m spanning tree based wi reless ro uting sche me. T he perfor mance of t he PEDAP was compared with LE ACH a nd PEG ASIS, and showed a slightl y better networ k lifetime than PEGASIS. Ref. [ 18] p roposed a new ro uting scheme (SH ORT) , to achieve higher energy e fficienc y, network life time, and more throughput than PEGASIS, and PEDAP-PA protoco ls. T his scheme used the centralized algorit hms and r equired the p owerful b ase station. T he performance results sho wed t hat S HORT can achieve b etter “ener gy X delay” performance than the exi sting chain based data aggregation pro tocols. Ref. [19] prop osed EECR, which is an energy e fficient clustering routing algorithm. T he p erformance of t he EECR was co mpared w ith LE ACH, and showed a slightly better net work lifeti me than LE ACH. However, the unsolved pro blem of co nsiderable energ y consumption on the cl uster formation still exists. Here, we consider t he path follo wed by the node to transfer d ata to the base station a s the “tra jectory” . We used our proposed novel trajectory clustering algorithm for clustering such paths and obtained “representative trajectory” is u sed to as sign the c luster heads. T hese obtained cluster heads will be used for co mmunicating data to the base station. I n this paper, w e co ncentrated o n the rotatio n of c luster head s among all se nsor nod es to improve the li fetime of t he network b ased on the tra ffic density. We tested our p roposed method a nd found that this method enha nces the lifeti me of the net work. II. N OVEL A LGORIT HM This section consider s the WSNs consisting o f hundreds or thousands of de ployed se nsor nodes in t he sensing field. O n the basis of [20][ 16], it is assumed b y the follo wing pr operties of the WSNs to si mplify the network model. • The base statio n is located far away fro m the sensors, • The nodes have uni form initial e nergy allocation and all sensor nodes have eq ual capab ilities (data processing, wirele ss communi cation, batte ry power). • All sensor node s have various transmission po wer levels, and each nod e can cha nge the po wer level dynamically. • Each node senses t he environmen t at a fixed r ate, and • All nodes are i mmobile. The sensor nodes are geographicall y grouped into clusters and cap able of ope rating in two ba sic modes: the sensing m ode and t he cluste r head mode [ 20]. In the sensing mode, the n ode senses the task a nd sends the sensed data to its cl uster hea d. In cluster head mode, a node gathers d ata from its cl uster members, perfor ms data fusion, and transmit s t he data to the b ase station. T he base station in turn perfor ms the key task of cluster head selection. A. Cluster Head Selection Initially t he nodes will trans mit a he llo packet to the base station. After receiving hello packets fro m the nodes, usi ng the Traje ctory Clusteri ng algorith m, the b ase station computes the re presentative traj ectory b y clustering the traj ectories (here the trajectory is nothing but the pat h used b y t he nod e to tr ansfer its data to the base statio n). T he nodes of the obtained representative trajecto ry are considered as the cluster heads. Then t he base station splits the net work into c lusters (equal to the number of nodes i n the rep resentati ve traj ectory), and identifies the nodes in the repr esentative trajecto ry as t he correspo nding cluster heads. Then, the base station broad cast a m essage to the network m entioning about th e nodes and their corresp onding cl uster heads . Subsequently the nod es will use its cluster head s to transmit data . T his pr ocess will b e pe rformed per iodicall y and the cluster head s will c hange based o n the traffic. Cluster head selection ro utine contains the following stages:- 1. Base station computes the cluster heads using proposed T rajectory Clusteri ng algorith m; 2. Split the net work into N clu sters; and 3. Bro adcast message to all nod es mentioni ng cluster members and their co rrespo nding cluster heads B. Trajectory Clustering The success o f any cl ustering algorithm depe nds on t he adopted dissimilarity measure. Fo llowing section explains about the ad opted dissimilarity mea sure. Ref. [21], proposed th e us age of Eu clidean distance between ti me series of equal length a s t he measure o f their similarity. T he idea has been generalized in [2 2] for subsequence matching. In a similar way [23] used Discrete Wavelet T ransfor m and [24] used Pr incipal Component Analysis for measuring t ime series si milarity. Another approach whic h is b rought fro m i mage processing is Time Warping techniqu e and it is used in [25] to match signals in spee ch rec ognition. A si milar technique is used to find lon gest co mmon subseq uence (LCSS) of t wo sequence s using fast prob abilistic algorithms to co mpute the LCSS, a nd then def ine the distance using the le ngth o f this subsequence [26]. Here w e adop ted Hausdo rff distance [2 7] for calculating dissimilarity between trajecto ries. T he following are so me of t he definitions used in our algorithm. International J ournal of Recen t Tre nds in Engineering, Issue. 1, Vo l. 1, May 2009 Definition 1 : A trajectory (t) is represented as trj(t id ,u 0 ,u 1 ,u 2 ..,u n ) where (t id ) is a unique trajec tory id (data packet), and (u 0 ,u 1 ,u 2 ,..,u n ) is a seque nce o f nodes reflecting the spatial p osition of the nod e. Definition 2 : We d efine the spatial dissimilarity function bet ween two traj ectories t 1 and t 2 as the maximum of one way distances between t wo traj ectorie s. The one w ay distance from a trajectory t 1 to another trajecto ry t 2 is d efined as the integral of t he Hausdorf f distance between po ints of t 1 to trajecto ry t 2 divided b y the number of poin ts in t 1 ( |t 1 | ). dist ow (t 1 ,t 2 ) = dp t p d t t p h ∫ ∈ 1 ) , ( | | 1 2 1 The Hausdorff dista nce f ro m a trajectory p oint p to another traj ectory t 2 is defined as d(p, t 2 ) = min q ∈ t2 {d(p,q)} . T he dis tance bet ween trajector ies t 1 and t 2 is the maximu m of their one way distances, i.e., dist(t 1 ,t 2 ) = max{ dist ow (t 1 ,t 2 ),dist ow (t 2, t 1 ) } Clearly the di st ow (t 1 ,t 2 ) is not symmetric but dist(t 1 , t 2 ) is s ymmetric. Note that dist ow (t 1 ,t 2 ) is the inte gral of t he shortest distance s from points in t 1 and t 2 . 1) Trajecto ry Cluster Ro utine Trajec tories are groupe d into clusters usi ng the threshold . Here the threshold is considered as a maximu m value, such t hat all traj ectories are grouped into a single cluster. The trajectory cluster routine co ntains the following stage s: 1. Dissimilarit y matrix for traj ectories w ill be computed using t he Hausdor ff distance, 2. Using following Initialization Algorithm trajecto ries are groupe d into initial clusters; a. T ake first sa mple a s first cluster. Classify all the remainin g trajectorie s into this cluster if the y are within the th reshold. b. Take a traj ectory (seq uentiall y) which i s not alread y classified into any of the cluster and co nsider it as a new c luster. Take all the o ther trajectories which are not kept in any o f the clusters and keep in this clu ster if t hey s atisf y the threshold limit. c. Repeat step b till no ne w clust er is adde d. 3. Using t he follo wing R epTraj Algorithm representative traj ectories ar e computed. a. For each T rajector y o f cluster C calculate cumulative dissimilarity w ith all other trajecto ries of the same clust er C. Select the trajecto ry which i s havin g minimum cumulative d issimilarit y and take this as representative traj ectory of that cluster. 4. By co nsidering the traj ectories received fro m step 3, a s initial clus ter ca nters, using t he follo wing Re- cluster Algorithm re co mpute clusters and their representative traj ectories until there is no c hange in the represe ntative traje ctories. a. For each Trajec tory calculate dissimilarity with all the K r epresentative trajecto ries a nd classi fy to th e cluster for which dissimilari ty is lo w. b. Re-calculate repr esentative trajecto ries using RepTraj Algorithm. C. Data co mmunica tion phase There are three steps during the data co mmunication phase: data collection, data fusion and data trans mission. Initially eac h se nsor nod e tr ansmits t he se nsed information to its clus ter head at the time slot assigned b y its cluster head. In o rder to save itself ener gy, the node will close transmit part durin g the time slot, which is not required to it. Once data from all sensor nod es hav e been received, the cluster h ead performs data fusion o n the collected data and reduces the a mount of raw data that need to send to the ba se stati on. Once the data gat hering and data fusion are completed , the cluster head sends the compressed d ata to the base station. As mention pr eviously, a ll the nodes c an work as a cluster head s. Due to this, an y nod e can be come a clu ster head or a cluster member . At ea ch turn the cluster head calculates available po wer and compares with the cluster members. W henever the cluster heads p ower b ecomes less than the minimum p owe r holding, then the cluster heads infor ms to it s cluster members and a ssigns the maximum p ower holding cl uster member a s t he c luster head and in tur n co mmunicates to the base station . Whenever the cl uster head is changed base station r epeats the cluster findi ng process a nd modifies the cl usters. III. E XPERIMENTAL W ORK To e valuate the perfor mance o f o ur algorit hm, it has been simulated and co mpared its performance with energy efficie nt clustering r outing (hereafter , EECR). Before the si mulation and r esults are introd uced, the radio model and so me important p ara meters [19] used i n simulation have b een describe d. A. The ra dio model We have used bo th the free-space propagation model and the two-ra y ground p ropagation model to appro ximate path loss susta ined because o f wireles s channel tra nsmission. Given a threshold transmission distance of d 0 , the free-space m odel i s used wh en d < do, and the two-ray model is applied for cases when d ≥ d 0 . Using these two models, the transmit energy costs for the transfer of a b-bit data message between two nodes separated b y a distance of d meters is give n: if d

Original Paper

Loading high-quality paper...

Comments & Academic Discussion

Loading comments...

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment