Our Brothers Keepers: Secure Routing with High Performance

Our Brothers’ K eepers: Secure Routing with High Performance ∗ Alex Brodsk y Uni versity of W innipe g W innipeg, MB, Canada, R3B 2E9 abrodsky@acs.uwinnipeg.ca Scott Lindenberg Uni versity of W innipe g W innipeg, MB, Canada, R3B 2E9 slindenb@acs.uwinnipeg.ca Nov ember 9, 2018 Abstract The T rinity [BB07] spam classiﬁcation system is based on a distributed hash table that is imple- mented using a structured peer-to-peer ov erlay . Such an ov erlay must be capable of processing hundreds of messages per second, and must be able to route messages to their destination even in the presence of failures and malicious peers that misroute packets or inject fraudulent routing information into the system. T ypically there is tension between the requirements to route messages securely and ef ﬁciently in the ov erlay . W e describe a secure and efﬁcient routing extension that we developed within the I3 [SAZ + 04] implementation of the Chord [SMK + 01] overlay . Secure routing is accomplished through se veral com- plementary approaches: First, peers in close proximity form overlapping groups that police themselves to identify and mitigate fraudulent routing information. Second, a form of random routing solves the problem of entire pack et ﬂows passing t hrough a malicious peer . Third, a message authentication mech- anism links each message to it sender , pre venting spooﬁng. Fourth, each peer’ s identiﬁer links the peer to its network address, and at the same time uniformly distrib utes the peers in the key-space. Lastly , we present our initial ev aluation of the system, comprising a 255 peer ov erlay running on a local cluster . W e describe our methodology and show that the ov erhead of our secure implementation is quite reasonable. keywords: secure routing, peer authentication, distrib uted hash tables ∗ This research was supported by an NSERC Discov ery grant. 1 Intr oduction Systems such as T rinity [BB07], LOCKSS [MRR + 03], and others are based on distributed hash tables that are implemented on top of peer-to-peer structured overlays. These ov erlays differ from better kno wn peer- to-peer systems such as BitT orrent in three fundamental ways. First, these ov erlays are closed, meaning that only authorized hosts may join the overlay . Second, these overlays must be secure and function even in the presence of failures, denial of service attacks, and malicious peers. Third, performance is paramount, meaning that each peer in the these ov erlays must be able to forward hundreds of messages per second. Although securing closed o verlays seems more manageable than the task of securing open o verlays, the task presents sev eral challenges. First, identifying, authenticating and authorizing peers and authenticating the messages that they send is not easy because the mechanisms must be fault tolerant, allow rev ocation, and must not signiﬁcantly impact performance. Second, securely routing messages, dealing with host and network failures, and most importantly , dealing with malicious peers and the fraudulent routing information that they inject into the o verlay is challenging in itself, let alone without signiﬁcantly impacting performance. As part of the T rinity project [BB07], we ha ve designed, implemented, and tested a secure closed ov erlay based on the I3 [SAZ + 04] Chord [SMK + 01] implementation. Our design comprises a distributed and fault tolerant identiﬁcation, authentication, and authorization mechanism; a key assignment scheme that encodes a peer’ s netw ork location yet ensures that the ke ys are uniformly distributed in the ke y space; a self-policing scheme based on groups of local peers; and a form of random routing that ensures that no (malicious) peer is a choke-point between an y two other peers. In addition to describing our approaches, we present a performance e valuation, which was performed on a local cluster that hosted overlays consisting of 255 peers. W e compare the performance of our system in “secure” and “insecure” modes, and sho w that the performance penalty for secure operation is acceptable. The rest of the paper is or ganized as follows: Section 2 describes our assumptions and the Chord proto- col. Section 3 describes the three parts of our approach and Section 4 describes our e v aluation of the system. Lastly , Section 5 and 6 describe related work, and discuss future work. 2 Pr eliminaries W e selected the Chord [SMK + 01] structured ov erlay to provide lookup services for the Trinity [BB07] system because Chord has good performance characteristics and provides control ov er the location of peers within the ov erlay , which makes securing the overlay easier [SM02, CDG + 02]. The Chord [SMK + 01] ov erlay structure assigns each peer a unique ke y , k , from a 160 -bit ke y-space and org anizes the peers into a single ring in order of their keys. The predecessor and successor of key k are the ke ys k p and k s , respecti vely , belonging to peers in the ring, such that k − k p mo d 2 160 and k s − k mo d 2 160 , respecti vely , are minimal. Intuiti vely , the peer to whom ke y k is assigned is located between its predecessor and successor , the peers to whom the ke ys k p and k s are assigned. If a key k is not assigned to a peer in the ring, then the peer whose k ey is the successor to k is responsible for the k ey . Consequently , each peer is responsible for all the possible ke y values between it and its predecessor . When a peer joins the ring, it locates its position within the ring by sending a “ﬁnd successor” request with its o wn key , k , to a “well known” peer that is already in the ring. The request is routed to the current predecessor of k , whose successor is therefore also the successor of k . The predecessor replies to the new peer , informing it of both the successor and itself. The ne w peer then informs the successor and predecessor of its e xistence and assumes its location in ring. Lastly , the peer b uilds its routing table, called a ﬁnger table. The ﬁnger table is used by the peer to forward a message toward its ev entual destination. The ﬁnger 1 s f 1 f 2 f 3 f 4 f 5 p q the route from p to q g h r Figure 1: The peers labeled f i are in p ’ s ﬁnger table, peer g is in peer f 5 ’ s ﬁnger table, and peer h is in peer g ’ s ﬁnger table. Peers r and s are the predecessor and successor of peer p . table comprises k eys of select peers in the ring. T ypically , the table contains O (log N ) ke ys of peers that are 1 2 i of a ring aw ay , i = 1 . . . log( N ) , where N is the number of peers in the ring (see Figure 1). T o forw ard a message to the peer responsible for key k , the peer with the closest preceding key to k is selected from the ﬁnger table, and the message is forwarded directly to that peer . Thus, the distance to the destination peer is decreased by at least half, and after at most O (log N ) such hops, the message arriv es at the destination. If the closest preceding peer is the current peer , then the message is forw arded directly to the peer’ s successor , its destination. The ﬁnger table is populated by performing additional “ﬁnd successor” queries with key values of the form k + 2 i mo d 2 160 , 0 < i < 160 . Additional ongoing “ﬁnd successor” queries, at regular intervals, are used to update the ﬁnger table as well as the peer’ s successor and predecessor . Also, a simple heart-beat mechanism tracks when peers leav e the ring. Unfortunately , the system as described, is susceptible to many attacks. First, the overlay uses an un- reliable message-based transport protocol, User Datagram Protocol (UDP), that is susceptible to spooﬁng because the source address of a message can easily be forged. Thus, the source of the message can not be (reliably) determined. Second, the system, as described, allows any host to become a peer , which is prob- lematic for a closed overlay and can lead to the admittance of malicious peers. Third, as a result of the ﬁrst two weaknesses, the overlay is susceptible to denial of service attacks because large numbers of messages and requests can be injected into the ov erlay by external hosts. Fourth, the overlay relies on the correct behaviour of all of its constituents. For example, all peers must correctly forward and reply to “ﬁnd successor” requests. Malicious peers can inject fraudulent rout- 2 ing information into the overlay by replying with incorrect “ﬁnd successor” replies, dropping requests, or misdirecting the requests. Consequently , a few collaborating malicious peers could cause segments of the ring to “drop out”. This is a problem even if peers are initially identiﬁed and authenticated prior to joining because peers may be compromised and an initially nonmalicious peer may become malicious. In fact, the only assumption that we can reasonably make, assuming that all peers are identiﬁed and authenticated before joining, is that only a small fraction of peers are malicious. The challenge then, is to limit the ability of the malicious peers to collaborate and disrupt the overlay , to detect malicious peers, and e vict them from the overlay . 3 Design and Implementation Our implementation is an extension of the I3 [SAZ + 04] code-base. Our implementation comprises ﬁ ve parts: (i) a key assignment scheme that links each peer’ s key with its network address 1 while at the same time uniformly distributing the peers’ keys in the ring; (ii) a distributed identiﬁcation, authentication, and (re vocable) authorization mechanism that allows the overlay to control what peers are admitted into the ring; (iii) a message authentication mechanism that links each message to its sender; (i v) a self-policing mechanism based on ov erlapping groups composed of proximate peers; and (v) a simple form of random routing that av oids the possibility of any peer becoming a chok e point between two other peers. 3.1 K ey Assignment As was observed in [SM02] and [CDG + 02], it is harder for malicious peers to collaborate when they are uniformly distrib uted in the ring than when the y are clustered. Consequently , peers should be assigned ke ys from a uniform distribution. Thus, prior to joining, each peer is expected to choose a key from the uniform distribution on the key space. Ho we ver , there is nothing that pre vents malicious peers from choosing keys that facilitate collaboration. Furthermore, a randomly selected key , only encodes the peers position within the ring, not the network, which another peer would need to contact it directly . Lastly , the choice of the peer’ s network address is typically limited and in most cases beyond the control of the peer , malicious or otherwise. W e le verage this restriction to assign keys to peers so that the peers hav e no choice in their key , the key is unique, the key encodes a peer’ s network address, and the key appears to be chosen from the uniform distribution on the ke y space. T o determine its key a peer concatenates its IP address and port number, both in network byte order, to create a 6 byte string. This string is passed through the SHA-1 function, generating a 20 byte hash. The hash is the same length as a ke y , 20 bytes, and appears as if it was chosen from the uniform distribution on the ke y space. 2 Lastly , the IP address and the port number replace the 6 least signiﬁcant bytes of the hash, as suggested in [CDG + 02]. The resulting 20 byte key , can easily be validated by extracting the 6 least signiﬁcant bytes, passing them from the SHA-1 function, and comparing the 14 most signiﬁcant bytes of the resulting hash and the ke y—they should match. The 14 most signiﬁcant bytes of the ke y look as if they were drawn from a uniform distribution, ensuring that the peers are uniformly distrib uted throughout the ring. Lastly , the key uniquely identiﬁes each peer because the IP address of each peer is necessarily unique. Thus, each peer can be uniquely identiﬁed. 1 Both IP address and port number . 2 In reality the hash is uniformly chosen from key subspace of cardinality 2 48 , the size of the input string. 3 3.2 Distributed Identiﬁcation, A uthentication and A uthorization A peer must be identiﬁed, authenticated, and authorized before it can join the overlay . The peer’ s key uniquely identiﬁes the peer , but it does not authenticate the peer , which is a prerequisite for authorization. Since the maliciousness of a peer may be discov ered only after it joins the ring, authorization must be re vocable, in order to facilitate the e xcommunication of such peers. Authentication is accomplished by using a public key signature system—each new peer generates a public-pri vate k ey-pair . A peer authenticates a message by ﬁrst embedding its 20-byte k ey into the message and then signing it. Ho wev er , two problems remain: distribution of the public ke y , and the authorization of the peer . Both of these problems are solved simultaneously by le veraging the Domain Name System (DNS) [Moc87a, Moc87b]. Each ring is identiﬁed by a domain name in the DNS database and each authorized peer in the ring has corresponding a TXT entry within the domain, identiﬁed by the peer’ s key and storing a certiﬁcate that contains the peer’ s public key . The authority responsible for authorizing peers is also responsible for signing the certiﬁcates and for adding or removing the TXT entries. When a peer receiv es a message from another peer , it checks its cache for the sender’ s public key , if present then the sender is authorized to participate in the ring. Otherwise, the receiv er performs a DNS lookup for the sender’ s key in the ring’ s domain. If found, the sender’ s public key is added to the cache and the sender is deemed to be authorized. If not, a negati ve entry is added to the cache, causing the peer to ignore all future messages from the sender until the negati ve entry expires. Authorizations are rev oked by removing the corresponding TXT entry from the DNS database and informing all peers via a broadcast. W e leverage the DNS system because it has proven to be relativ ely robust and fault tolerant. In fact, robustness can be increased by simply adding more name servers. Furthermore, a DNS query is only needed when a new peer joins. In theory , peers could broadcast the certiﬁcates they receive from their DNS queries, informing the ring of the joining peer . Thus, an attack on the DNS system would only prev ent new peers from joining the ring. One problem with our approach is that authenticating each message using a public ke y signature is prohibitiv ely expensi ve. 3.3 Message A uthentication A message is linked to its sender because it contains the sender’ s key and then signed by the sender . Since the keys are unique and contain the sender’ s netw ork address, each message can be traced to its origin. Thus, if fraudulent messages are detected, the sender can be identiﬁed with certainty and excommunicated. Unfortunately , signing and verifying all messages using a public ke y signature system is expensi ve. F or example, to determine the ov erhead of using a public key signature system, we ran a two peer ring on a single 1.60GHz Intel Xeon E5310 (4-core) server with 2 gigabytes of RAM, and had one peer ping the other . This nulliﬁed the any potential netw ork related slowdo wn, and allocated one CPU to each peer , thus a voiding an y issues associated with sharing a CPU. W ithout message authentication, the system performed about 4000 pings per second—approximately 8000 messages per second. W ith message authentication, using public ke y signatures, the number of pings per second dropped to 15—a slowdo wn by a factor of 300! W e solve this problem by using message authentication codes (MA C) as the default authentication mech- anism. The Chord overlay structure exhibits good temporal locality with respect to communication, meaning that if a peer communicates directly with another peer , it will do so repeatedly in the future. The ﬁrst time two peers communicate directly , they exchange shared secret keys (using public key encryption), and use shared ke ys to authenticate all messages to each other . Using HMA C based authentication, the performance of our system went back up to about 3500 pings per second. 4 3.4 Our Brothers’ K eepers Chord overlay structure relies on peers behaving properly: forwarding requests that they cannot satisfy and replying truthfully to requests that they can satisfy . Howe ver , if a malicious peer does not forward requests, or ev en worse, misdirects the requests or sends fraudulent replies, the overlay structure can be subv erted. In particular , maligning the “ﬁnd successor” requests, which are used by peers to ﬁnd their position within the ring and construct ﬁnger tables, can create loops and partitions within the ring, rendering the ov erlay dysfunctional. That is, a few collaborating malicious peers could cause se gments of the ring to “drop out”. Realistically , we can neither ensure that no malicious peer will ev er join, nor can we ensure that no peer will e ver be compromised. Malicious peers are distinguished by their behaviour that, when detected, can be quashed by excommunicating the peer . Thus, by increasing the system’ s ability to detect malicious behaviour , the amount of damage caused by a malicious peer can be limited. Since our key assignment scheme ensures that with high probability two malicious peers will not be near each other in the ring, we use a peer group approach to improve detection of malicious behaviour , i.e., the peer’ s proximate peers keep it honest. Each peer in the ring, is associated with a peer group of size g , where g is a small odd number , such as 5 , 7 , 9 , 11 , etc. The group comprises the peer itself—the group leader—and g − 1 of its closest peers: g − 1 2 closest preceding peers and g − 1 2 closest succeeding peers. Thus, each peer belongs to g overlapping groups of size g . Furthermore, given our assumption about the uniform distribution of malicious peers, the chance of a group having multiple malicious peers is small. When a ne w peer joins the ring, it queries its predecessor and successor for their group memberships, constructs its own group membership list from the responses, and then queries the other peers in its group to conﬁrm their membership. On an ongoing basis, the peers in a group query each other’ s membership lists, updating them as peers join or lea ve. In closed overlays, particularly in the case of Trinity , we assume that the rate at which peers join and leave the ring is relativ ely low . Hence, a peer’ s group membership list will not change often. In fact, a peer is only added to a group only after it has been veriﬁed by the group’ s leader , ensuring that group lists only contain valid peers. These group lists also provide a fast mechanism for ﬁnding a new successor or predecessors if the current one leav es (or fails) the ring. A peer’ s group membership list, should be consistent with those of the group’ s members, e.g., if the group of peer p is ( n, o, p, q , r ) , then peer q ’ s group should be ( o, p, q , r , s ) . Thus, if a peer sends a group list that is inconsistent with the lists of other group members, it is considered malicious, or at least untrustworthy . Consequently , malicious peers cannot easily send fraudulent “ﬁnd successor” responses about their group members, because similar queries to their neighbours would unmask them. The result is that peers cannot send out false “ﬁnd successor” replies to an y of its neighbouring peers without being excommunicated. Ho wev er , it is also necessary to ensure that remote peers are also honest, i.e., those peers that are not within a peer’ s group. This is accomplished by le veraging the group structure. Speciﬁcally , a peer’ s “ﬁnd successor” response is be veriﬁed by querying a member of its peer group, and is based on the fact that peers in the same group will hav e similar ﬁnger tables. Recall, that a peer’ s i th ﬁnger table entry contains the successor to key k + 2 i mo d 2 160 , where k is the peer’ s ke y . Assuming that peers are uniformly distributed in the ring, if peers with ke ys k and k 0 are adjacent, then the successors to k + 2 i mo d 2 160 and k 0 + 2 i mo d 2 160 will likely be close to each other in the ring, if not the same peer . Thus, there will be considerable ov erlap between the groups associated with the i th ﬁnger table entries of the two peers. Consequently , a “ﬁnd successor” response can be veriﬁed by resending the query to a member of the responder’ s group. T o facilitate this approach, and to verify the consistency of the groups associated with the ﬁnger table 5 entries, our implementation uses an expanded ﬁnger table that stores the keys of the peer’ s entire group rather than just the peer’ s key—the ﬁnger table stores g ke ys per entry . Furthermore, a peer’ s “ﬁnd successor” response includes the ke ys of the peer’ s entire group. Since “ﬁnd successor” queries are sent on an ongoing basis, the ﬁnger table entries are updated and check ed on a re gular basis. Lastly , storing entire groups in the ﬁnger table, instead of single peers, facilitates the implementation of a simple randomized routing scheme, mitigating the problem of packet dropping by malicious peers. 3.5 Randomized Routing Even if a malicious peer does not send fraudulent routing responses, it can still cause problem by sim- ply dropping all messages. If a malicious peer is a choke-point between two other peers—all messages from one peer to the other are routed through it—then none of the messages may get through. Detecting this behaviour is problematic because the I3 Chord implementation and many other ov erlay systems use lightweight connectionless unreliable transport protocols, such as UDP . Consequently , it is impossible to distinguish between poor network connectivity and a misbehaving peer . Fortunately , our scheme can mit- igate both problems. W e note that we cannot ensure that no messages will be lost; only that with high probability , not all the messages will pass through the same peer , while in transit. W e use a variant of randomized routing [LMRR94]. T raditional randomized routing forwards the mes- sage to a randomly chosen peer in the system, and then from that peer to the destination. This can dramati- cally increase the latency , particularly if the destination peer is close to the sender but the randomly chosen peer is far a way . Instead, in our scheme, multiple messages between two peers take different but comparable length paths, ensuring that a choke-point can not form. When a message arriv es at a peer , the peer classiﬁes the message’ s destination as either local, near , or far . If the destination is local, then the message has arriv ed at its destination. If the destination is near, then the message is destined to a neighbour of the peer and is forwarded directly to its destination. Otherwise, a peer is selected and the message is forwarded to it. According to the traditional deterministic forwarding protocol, the peer whose ke y most closely precedes the message destination is chosen from the ﬁnger table, and the message is forwarded to this peer . In our implementation, a group is chosen from the ﬁnger table such that the group leader’ s most closely precedes the message destination. Then, a peer is randomly chosen from this group and the message is forwarded to it. Since the ﬁnger tables of the peers in a group are similar , the route taken between two peers will differ in the peers that the messages transit. Howe ver , as discussed in the preceding section, these peers are near each other within the ring, implying that the total number of hops will not v ary greatly . The correctness of the protocol does not change as long as the key of the peer selected from the ﬁnger table precedes the message destination, and since all peers in a group are, by deﬁnition, near each other , the size of each hop is will dif fer by an additiv e constant, resulting in a small variance in the number of hops that a message takes. 4 Evaluation T o ev aluate the performance of our implementation we used a 255 peer ring running on a 26 machine cluster running OpenBSD 4.3 and 4.2. One of these machines was an Intel Xeon X3210 2.13GHz Quad-core based server with 4GB of RAM, which ran 5 peers on it and serv ed as the name server for the cluster . Each of the remaining 25 machines was an Intel Pentium 4 2.80 GHz based desktop with 1 GB of RAM. Each of these desktops ran 10 peers each and all the machines were interconnected via a Cisco WS-C2924–XL-EN and 6 a Cisco WS-C3548-XL-EN managed switches that were locked at 10 Mb/s half-duplex—the mean latency between any two machines in the cluster was 0.5 milliseconds, with a negligible v ariance. W e performed se veral dif ferent tests to measure the latenc y , throughput, and capacity of our implementation in both secure and insecure modes, in order to compare the ov erhead associated with secure mode. 4.1 Latency and Throughput W e ﬁrst compared the latency and throughput ov erhead of secure versus insecure operation. Since peers regenerate and exchange their shared keys at regular intervals, different parts of ring had different loads at dif ferent times. T o compensate for this, a series of test runs were performed, spanning a sufﬁciently large time interv al, and the minimums over these test runs were used. Each test comprises two communicating peers: the initiator, which conducts and times the test, and the responder , which serves as the other end-point of the communication. The latency test measures the round trip time of a ping and its echo. The initiator pings the responder , which echos the ping—both the ping and the echo are routed through the ov erlay . The test is repeated sequentially a set number of times and the count is divided by the total time, yielding the round trip time per ping. The throughput test measures ho w fast packets (or messages) can be sent through the ov erlay . The initiator sends a throughput request to the responder , indicating the number of packets the responder should send back. The responder sends the requested number of pack ets (through the ov erlay) as quickly as possible, and the initiator measures the time dif ference between the arriv al of the ﬁrst and last packets—the number of packets di vided by the dif ference is the throughput. Our e valuation ﬁx ed one of the ﬁv e peers on the 4-core server to be the initiator , and used the 250 peers running on the 25 desktops as responders. For both latency and throughput measurements, the initiator performed 12 test series consisting of 10 test runs that consisted of 250 tests, once for each peer . Each latency test performed 10 pings at a time and each throughput test had the responder send back 1000 packets. Each series takes the minimum measurement for each peer ov er the 10 runs. The minimums for each peer from the 12 series are av eraged to yield the latency or throughput measurement. T able 1 displays the mean, median, maximum, minimum, and standard de viation round trip times and throughput measured for all 250 peers. The table sho ws the measurements for both insecure mode operation and secure mode operation, and the ov erhead of the secure mode. Latency Throughput Insecure Op. Secure Op. Relati ve Insecure Op. Secure Op. Relati ve R TT (sec) R TT (sec) Dif ference Pkts / sec Pkts / sec Dif ference Mean 0.002874 0.003457 20.2% 6148 4946 19.4% Median 0.002897 0.003483 20.2% 6389 5087 20.4% Maximum 0.003542 0.004282 20.9% 7794 6566 15.8% Minimum 0.000759 0.000880 15.9% 3107 2643 14.9% Std. Dev . 0.000335 0.000411 N/A 1164 930 N/A T able 1: Summary statistics of round trip times to peers and packets per second from peers. The measured latency in secure mode is 20% greater than the latency in insecure mode. Although, this seems high, it is important to remember that there were 10 peers running on each host, making the system CPU bound and that the time difference, 0.6 milliseconds, is negligible compared to the typical latency between two hosts in the Internet. 7 The throughput in secure mode is also on av erage 20% lower . This is due to the cost of authenticating messages: the sender has to sign each message and the recei ver has to verify the message. Since message authentication is a CPU bounded task, its ef fect will be less when only one peer is running on each server . It is more instructi ve to vie w the round trip times for each peer and throughput from each peer in a sorted order . The ﬁrst graph in Figure 2 shows the round trip times to all the peers for both insecure and secure operation modes, in ascending order of times measured in insecure mode. The second graph in Figure 2 sho ws the throughput from all the peers for both insecure and secure operation modes, in descending order of times measured in insecure mode. Figure 2: Round trip times to peers. Se veral artifacts are immediately visible in the ﬁrst graph: First, four peers have much lower round trip times. These peers are the successors and predecessors of the peer performing the ping, and hence both the ping and the response only take one hop. Second, there is large jump in round trip times for both insecure and secures modes; approximately , 0.0025 and 0.003 seconds respectively . Since the minimum latency between tw o peers in the cluster is 0.0005 seconds, this means that pings to and from all the other peers take between 6 and 9 hops, which makes sense for a ring of 255 peers. Lastly , and most importantly , the relativ e dif ference in latency between insecure and secure operation remains ﬁxed, at 0.06 milliseconds per hop. The second graph also exhibits a couple important features. First, the graph has a step feature, corre- sponding to the distances between the initiator and the responders. The closer a responder is to the initiator , the higher the measured throughput. Second, the relativ e decrease in throughput between insecure and se- cure operation remains relati vely constant. As before the primary reason for the reduction is the cost of message authentication and is noticeable because 10 peers were running on each singe-core machine. 4.2 Capacity The capacity of an ov erlay is the measure of the number of messages that the system can deliv er per unit time. T o measure the system’ s capacity we implemented a game of hot-potato over the overlay: A set number of messages (potatoes) are injected into the system. The potatoes are randomly passed from peer to peer , and counter in each potato tracks the number of times the potato is passed. By varying the number of concurrent potatoes in the system, we control the system’ s load. When a peer receives a potato, it increments the potato’ s counter , generates a random key , and sends the potato to the peer responsible for the random key . T o ensure that no potato is dropped, the recei ving peer acknowledges the potato, and the sender acknowledges the acknowledgment. Only after receiving the second ackno wledgment does the receiv er commence the next potato pass. If potato’ s originator recei ves it, 8 and the potato has been in the system for a minimum amount of time, e.g., 60 seconds, the number of passes per second for the potato is computed, by dividing the v alue of the potato’ s pass counter by the number of seconds that the potato was in the system. The potato’ s time to liv e counter is then decremented, and if nonzero, the potato’ s pass counter is reset and the potato is injected into the system again. This ensures a period of consistent load. In each of the runs, the ﬁrst measurement from the ﬁrst 75 ejected potatoes was used. T able 2 exhibits the mean, median, standard deviation, maximum, and minimum number of passes per second that a potato achie ved under different system loads: 10, 20, 30, 40, 50, 60, 70, 80, and 160 potatoes in the system. Note: a pass consists of a 3-message exchange between two peers in the system and message deliv ery may take multiple hops within the ov erlay . # of msgs 10 20 30 40 50 60 70 80 160 Insecure Mode Operation Mean 163 134 107 86 72 60 51 45 23 Median 163 134 106 86 72 60 51 45 22 Std. Dev . 3.3 2.8 2.1 2.3 2.0 1.6 1.4 1.8 2.4 Maximum 172 141 113 92 77 65 54 50 33 Minimum 156 127 103 81 67 56 48 42 19 Secure Mode Operation Mean 138 115 93 76 62 53 45 40 20 Median 137 115 94 76 62 53 45 39 19 Std. Dev . 3.4 2.0 2.1 1.6 1.4 1.3 1.6 1.8 2.6 Maximum 147 120 98 79 68 55 49 46 29 Minimum 131 109 89 71 58 47 42 36 15 T able 2: Number of passes per second that a message takes. Figure 3: Capacity of overlay . As the load increases, the number of passes per second of a potato decreases because the likelihood that a peer may need to process multiple potatoes at once increases. Ho wev er , passes per second of a potato does not yield a measure of the capacity of the system as a whole. The capacity of the system is the number of passes per second that the system performs over all. This is equal to the av erage number of passes per 9 second multiplied by the number potatoes in the system. Figure 3 e xhibits the capacity of the system for both insecure and secure operation modes. The capacity of the system is 3600 and 3150 passes per second in insecure and secure operation modes, respecti vely . In both cases the system becomes saturated at 50 potatoes, but capacity does not degrade as the number of potatoes increases. The relati ve difference in capacity is 12.5%, and is predominately affected by the CPU bounded task of message authentication. 5 Related W ork The challenge of securing peer-to-peer systems has been around since their advent. Sit and Morris [SM02] ﬁrst identiﬁed a set of design principles for securing peer-to-peer systems and described a taxonomy of v arious attacks against them. This work was extended by W allach [W al02] who inv estigated the security aspects of systems such as CAN [RFH + 01], Chord [SMK + 01], Pastry [RD01], and T apestry [ZKJ01], and discussed issues such as ke y assignment, routing, and excommunication of malicious peers. Castro et al. [CDG + 02] proposed sev eral approaches to securing peer-to-peer overlays. They proposed to dele gate assignment of ke ys to trusted certiﬁcation authorities, that would ensure that the k eys are chosen at random, and that each peer is bound to a unique ke y , with the peer’ s IP embedded in the key . T o securely route messages, they proposed to use constrained routing tables, which contain ke ys from speciﬁc locations in the overlay . In our case Chord already constrains a ke y’ s location within the overlay , obviating the need for constrained routing tables. In fact, our self-policing and random routing mechanisms lev erage this constraint. Castro et al. [CDG + 02] also proposed a routing failure test that tries to determine what nodes are mali- cious. Their approach also sends multiple copies of the message through div erse routes to ensure message deli very . Our approach is similar but less resource intensiv e. Our system uses the peer groups to detect faulty routing information, and to ensure that no peer is a choke-point between two other peers. Our system does not attempt to ensure the delivery of all messages, but instead attempts to ensure that some messages will be deli vered. Lastly , there are many ways to secure a peer-to-peer system, for example LOCKSS [MRR + 03] uses majority voting replicas and computationally rate-limiting cryptographic puzzles [DN93]. Unfortunately , these approaches sev erely impact system performance and are not practical in the context where good per - formance is a necessity . 6 Conclusion and Futur e W ork W e hav e designed and implemented a secure and efﬁcient extension to the I3 [SAZ + 04] implementation of the Chord structured ov erlay [SMK + 01]. Our e xtension is aimed at closed o verlays in which membership is tightly controlled. This context requires mechanisms for peer identiﬁcation, authentication, and authoriza- tion, mechanisms for message authentication, and mechanisms to mitigate the beha viour of malicious peers in the ov erlay , which are unavoidable. Our implementation uses a simple hashing scheme to generate keys that are linked to peer’ s network address, and are uniformly distrib uted in the ke y space. The ke ys are embedded into messages, linking each message to its sender via an ef ﬁcient two-part authentication mechanism, combining public k ey and HMA C message authentication. Secure routing is implemented via self-policing peer groups that force malicious peers to either behave properly or face detection. Lastly , these groups are leveraged for a simple random routing scheme that pre vents choke-points within the o verlay . 10 Our ev aluation, which was performed on a local cluster , has demonstrated that our implementation’ s ov erhead, of about 20%, is primarily due to CPU bounded operations. W e believ e that this ef fect will signiﬁcantly decrease under normal conditions in the larger Internet context where latency will dominate, and where multiple peers are not running on the same host. T o validate this hypothesis, we intend to perform a more realistic e valuation using the Planet-Lab plat- form, which spans the world and will allo w us to test much larger ov erlays. W e are in the process of im- plementing the T rinity [BB07] e-mail classiﬁcation system on top of our secure ov erlay . This will provide additional opportunities to identify and solve performance bottlenecks in our implementation. Refer ences [BB07] A. Brodsky and D. Brodsky . A distributed content independent method for spam detection. In Pr oc. of the 1st USENIX W orkshop on Hot T opics in Understanding Botnet , 2007. [CDG + 02] M. Castro, P . Druschel, A. Ganesh, A. Rowstron, and D. W allach. Secure routing for structured peer-to-peer o verlay netw orks. In Pr oc. of the 5th A CM Symposium on Operating System Design and Implementation , 2002. [DN93] C. Dwork and M. Naor . Pricing via processing or combatting junk mail. In E. F . Brickell, editor , Advances in Cryptology – CRYPT O ’ 92 , v olume 740 of Lectur e Notes in Computer Science , pages 139–147. International Association for Cryptologic Research, Springer-V erlag, Berlin Germany , 1993. [LMRR94] T . Leighton, B. Maggs, A. Ranade, and S. Rao. Randomized routing and sorting on ﬁxed- connection networks. J. Algorithms , 17(1):157–205, 1994. [Moc87a] P . Mockapetris. RFC 1034 – Domain Names - Concepts and F acilities . Internet Engineering T ask Force, 1987. [Moc87b] P . Mockapetris. RFC 1035 – Domain Names - Implementation and Speciﬁcation . Internet Engineering T ask Force, 1987. [MRR + 03] P . Maniatis, D. Rosenthal, M. Roussopoulos, M. Baker , T . Giuli, and Y . Muliadi. Preserving peer replicas by rate-limited sampled voting. In Pr oc. of the 19th A CM Symposium on Operating Systems Principles , 2003. [RD01] A.. Rowstron and P .. Druschel. Pastry: Scalable, decentralized object location, and routing for large-scale peer -to-peer systems. Lectur e Notes in Computer Science , 2218:329–??, 2001. [RFH + 01] S. Ratnasamy , P . Francis, M. Handley , R. Karp, and S. Shenker . A scalable content-addressable network. In Proc. of the A CM SIGCOMM 2001 Conference , pages 161–172, 2001. [SAZ + 04] I. Stoica, D. Adkins, S. Zhuang, S. Shenker , and S. Surana. Internet indirection infrastructure. IEEE/A CM T ransactions on Networks , 12(2):205–218, 2004. [SM02] E. Sit and R. Morris. Security considerations for peer-to-peer distrib uted hash tables. In Int. W orkshop on P eer-to-P eer Systems , volume 2429 of Lectur e Notes in Computer Science , 2002. [SMK + 01] I. Stoica, R. Morris, D. Karger , M. F . Kaashoek, and H. Balakrishnan. Chord: A scalable peer-to-peer lookup service for internet applications. In Pr oc. of the ACM SIGCOMM 2001 Confer ence , 2001. [W al02] D. W allach. A survey of peer-to-peer security issues. In M. Okada, B. Pierce, A. Scedrov , H. T okuda, and A. Y onezaw a, editors, ISSS , volume 2609 of Lectur e Notes in Computer Sci- ence , 2002. [ZKJ01] B. Zhao, J. Kubiato wicz, and A. Joseph. T apestry: An infrastructure for fault-tolerant wide-area location androuting. T echnical report, April 04 2001. 11

Our Brothers Keepers: Secure Routing with High Performance

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment