30ee67debe7fc51026b80082b23c6e5d2945837e
Andrew Lewman create the research page.

Andrew Lewman authored 13 years ago

1) ## translation metadata
Roger Dingledine looks like we never set the...

Roger Dingledine authored 13 years ago

2) # Revision: $Revision$
Andrew Lewman create the research page.

Andrew Lewman authored 13 years ago

3) # Translation-Priority: 4-optional
4) 
5) #include "head.wmi" TITLE="Tor: Research" CHARSET="UTF-8"
6) <div id="content" class="clearfix">
7)   <div id="breadcrumbs">
8)     <a href="<page index>">Home &raquo; </a>
9)     <a href="<page getinvolved/research>">Research</a>
10)   </div>
11)   <div id="maincol"> 
12)     <!-- PUT CONTENT AFTER THIS TAG -->
13) <h2>Tor: Research</h2>
14) <hr />
15) 
16) <p>
17) Many people around the world are doing research on how to improve the Tor
18) design, what's going on in the Tor network, and more generally on attacks
19) and defenses for anonymous communication systems. This page summarizes
20) the resources we provide to help make your Tor research more effective.
21) The best way to reach us about research is through the <a href="<page
Andrew Lewman fix two page links I missed.

Andrew Lewman authored 13 years ago

22) about/contact>">tor-assistants</a> list.
Andrew Lewman create the research page.

Andrew Lewman authored 13 years ago

23) </p>
24) 
25) <ul>
26) 
27) <li>
28) <b>Data.</b>
29) We've been <a href="https://metrics.torproject.org/data.html">collecting
30) data to learn more about the Tor network</a>: how many relays and
31) clients there are in the network, what capabilities they have, how
32) fast the network is, how many clients are connecting via bridges,
33) what traffic exits the network, etc. We are also developing
34) tools to process these huge data archives and come up with
35) <a href="https://metrics.torproject.org/graphs.html">useful
36) statistics</a>.  For example, we provide a <a
37) href="https://gitweb.torproject.org//ernie.git?a=blob_plain;f=doc/manual.pdf">tool
38) called Ernie</a> that can import relay descriptors into a local database
39) to perform analyses. Let us know what other information you'd like to
40) see, and we can work with you to help make sure it gets collected
41) <a href="https://metrics.torproject.org/papers/wecsr10.pdf">safely</a>
42) and robustly.
43) </li>
44) 
45) <li>
46) <b>Analysis.</b>
47) If you're investigating Tor, or solving a Tor-related problem,
48) <i>_please_</i> talk to us somewhere along the way &mdash; the earlier
49) the better. These days we review too many conference paper submissions
50) that make bad assumptions and end up solving the wrong problem. Since
51) the Tor protocol and the Tor network are both moving targets, measuring
52) things without understanding what's going on behind the scenes is going
53) to result in bad conclusions. In particular, different groups often
54) unwittingly run a variety of experiments in parallel, and at the same
55) time we're constantly modifying the design to try new approaches. If
56) you let us know what you're doing and what you're trying to learn,
57) we can help you understand what other variables to expect and how to
58) interpret your results.
59) </li>
60) 
61) <li>
62) <b>Measurement and attack tools.</b>
63) We're building a <a
64) href="https://metrics.torproject.org/tools.html">repository</a> of tools
65) that can be used to measure, analyze, or perform attacks on Tor. Many
66) research groups end up needing to do similar measurements (for example,
67) change the Tor design in some way and then see if latency improves),
68) and we hope to help everybody standardize on a few tools and then make
69) them really good. Also, while there are some really neat Tor attacks
70) that people have published about, it's hard to track down a copy of
71) the code they used. Let us know if you have new tools we should list,
72) or improvements to the existing ones. The more the better, at this stage.
73) </li>
74) 
75) <li>
76) <b>We need defenses too &mdash; not just attacks.</b>
77) Most researchers find it easy and fun to come up with novel attacks on
78) anonymity systems. We've seen this result lately in terms of improved
79) congestion attacks, attacks based on remotely measuring latency or
80) throughput, and so on. Knowing how things can go wrong is important,
81) and we recognize that the incentives in academia aren't aligned with
82) spending energy on designing defenses, but it sure would be great to
83) get more attention to how to address the attacks. We'd love to help
84) brainstorm about how to make Tor better. As a bonus, your paper might
85) even end up with a stronger "countermeasures" section.
86) </li>
87) 
88) <li>
89) <b>In-person help.</b>
90) If you're doing interesting and important Tor research and need help
91) understanding how the Tor network or design works, interpreting your
92) data, crafting your experiments, etc, we can send a Tor researcher to
93) your doorstep. As you might expect, we don't have a lot of free time;
94) but making sure that research is done in a way that's useful to us is
95) really important. So let us know, and we'll work something out.
96) </li>
97) 
98) </ul>
99) 
100) <a id="Groups"></a>
101) <h2><a class="anchor" href="#Groups">Research Groups</a></h2>
102) 
103) <p>Interested to find other anonymity researchers? Here are some
104) research groups you should take a look at.</p>
105) 
106) <ul>
107) <li>Ian Goldberg's <a href="http://crysp.uwaterloo.ca/">CrySP</a> group
108) at Waterloo.
109) </li>
110) <li><a href="http://www-users.cs.umn.edu/~hopper/">Nick Hopper</a>'s
111) group at UMN.
112) </li>
113) <li><a href="http://www.hatswitch.org/~nikita/">Nikita Borisov</a>'s
114) group at Illinois.
115) </li>
116) <li>Matt Wright's <a href="http://isec.uta.edu/">iSec</a> group at
117) UTA.
118) </li>
119) </ul>
120) 
121) <a id="Ideas"></a>
122) <h2><a class="anchor" href="#Ideas">Research Ideas</a></h2>
123) 
124) <p>
125) If you're interested in anonymity research, you must make it to the
126) <a href="http://petsymposium.org/">Privacy Enhancing Technologies
127) Symposium</a>. Everybody who's anybody in the anonymity research world
Andrew Lewman remove the reference to 201...

Andrew Lewman authored 13 years ago

128) will be there. Stipends are generally available for people whose presence
129) will benefit the community.
Andrew Lewman create the research page.

Andrew Lewman authored 13 years ago

130) </p>
131) 
132) <p>To get up to speed on anonymity research, read <a
133) href="http://freehaven.net/anonbib/">these papers</a> (especially the
134) ones in boxes).</p>
135) 
136) <p>We need people to attack the system, quantify defenses,
137) etc. Here are some example projects:</p>
138) 
139) <ul>
140) 
Roger Dingledine add the newest research blo...

Roger Dingledine authored 13 years ago

141) <li>For various diversity metrics, how has the diversity of
142) the Tor network changed over time? How robust is it to change or
143) attack? These results can help us make better design decisions. See the <a
144) href="<blog>/research-problem-measuring-safety-tor-network">blog post</a>
145) for details.
146) </li>
147) 
Andrew Lewman create the research page.

Andrew Lewman authored 13 years ago

148) <li>If we prevent the really loud users from using too much of the Tor
149) network, how much can it help? We've instrumented Tor's entry relays
150) so they can rate-limit connections from users, and we've instrumented
151) the directory authorities so they can change the rate-limiting
152) parameters globally across the network. Which parameter values improve
153) performance for the Tor network as a whole? How should relays adapt
154) their rate-limiting parameters based on their capacity and based on
155) the network load they see, and what rate-limiting algorithms will work
156) best? See the <a
157) href="<blog>/research-problem-adaptive-throttling-tor-clients-entry-guards">blog
158) post</a> for details.
159) </li>
160) 
161) <li>Right now Tor clients are willing to reuse a given circuit for ten
162) minutes after it's first used. The goal is to avoid loading down the
163) network with too many circuit creations, yet to also avoid having
164) clients use the same circuit for so long that the exit node can build a
165) useful pseudonymous profile of them. Alas, ten minutes is probably way
166) too long, especially if connections from multiple protocols (e.g. IM and
167) web browsing) are put on the same circuit. If we keep fixed the overall
168) number of circuit extends that the network needs to do, are there more
169) efficient and/or safer ways for clients to allocate streams to circuits,
170) or for clients to build preemptive circuits? Perhaps this research item
171) needs to start with gathering some traces of what requests typical
172) clients try to launch, so you have something realistic to try to optimize.
173) </li>
174) 
175) <li>The "website fingerprinting attack": make a list of a few
176) hundred popular websites, download their pages, and make a set of
177) "signatures" for each site. Then observe a Tor client's traffic. As
178) you watch him receive data, you quickly approach a guess about which
179) (if any) of those sites he is visiting. First, how effective is
180) this attack on the deployed Tor design? The problem with all the
181) previous attack papers is that they look at timing and counting of
182) IP packets on the wire. But OpenSSL's TLS records, plus Tor's use of
183) TCP pushback to do rate limiting, means that tracing by IP packets
184) produces very poor results. The right approach is to realize that
185) Tor uses OpenSSL, look inside the TLS record at the TLS headers, and
186) figure out how many 512-byte cells are being sent or received. Then
187) start exploring defenses: for example, we could change Tor's cell
188) size from 512 bytes to 1024 bytes, we could employ padding techniques
189) like <a href="http://freehaven.net/anonbib/#timing-fc2004">defensive
190) dropping</a>, or we could add traffic delays. How much of an impact do
191) these have, and how much usability impact (using some suitable metric)
192) is there from a successful defense in each case?</li>
193) 
194) <!--
195) <li>
196) Path selection algorithms, directory fetching schedules for Tor-on-mobile
197) that are compatible anonymity-wise with our current approaches.
198) </li>
199) 
200) -->
201) 
Roger Dingledine add the newest research blo...

Roger Dingledine authored 13 years ago

202) <li>More coming soon. See also the "Research" section of the <a
203) href="<page getinvolved/volunteer>#Research">volunteer</a> page for
204) other topics.
Andrew Lewman create the research page.

Andrew Lewman authored 13 years ago

205) </li>
206) 
207) </ul>
208) 
209)   </div>
210)   <!-- END MAINCOL -->
211)   <div id = "sidecol">
212) #include "side.wmi"
213) #include "info.wmi"
214)   </div>
215)   <!-- END SIDECOL -->
216) </div>
217) <!-- END CONTENT -->