Discussion:
[tor-dev] Starting with contributing to Anonymous Local Count Statistics.
Aruna Maurya
2018-01-31 11:42:14 UTC
Permalink
Hey!

I was going through the Tor Volunteer page and came across the Anonymous
local count statistics project. As a student it would be a great starting
point and an even bigger opportunity to get a chance to collaborate and
learn in the process.

I would like to contribute to it, and would love to start as soon as
possible. It would be great if someone could guide me through.

Thanks for your time!!
--
Regards,
Aruna Maurya,
CSE,B.tech,
Blog <https://themindreserves.wordpress.com/> | Medium
<https://medium.com/@arunamaurya>
George Kadianakis
2018-01-31 13:02:22 UTC
Permalink
[ text/plain ]
Hey!
I was going through the Tor Volunteer page and came across the Anonymous
local count statistics project. As a student it would be a great starting
point and an even bigger opportunity to get a chance to collaborate and
learn in the process.
I would like to contribute to it, and would love to start as soon as
possible. It would be great if someone could guide me through.
Hello Aruna,

thanks for reaching out.

I also find this project interesting. I'd like to help you but my time
is quite limited lately.

What would you like guidance with?

With regards to design, I suggest you take a look at the last comments
of this trac ticket: https://trac.torproject.org/projects/tor/ticket/7532#comment:22
Particularly it seems like the PCSA algorithm might be a reasonable way
forward.

With regards to coding, I suggest you familiarize yourself with the Tor
codebase. Some specific places to look at would be the way that Tor
currently counts users. For example, see geoip_note_client_seen() and
its callers, for when bridges register new clients to their stats
subsystem. Also check geoip_format_bridge_stats() for when bridges
finally report those stats.

Let us know if you have any specific questions!

Cheers!
Aruna Maurya
2018-02-02 10:00:42 UTC
Permalink
Hey!

What is the current status of the project, how much work has been done and
where can I pick up from?
---------- Forwarded message ----------
Date: Wed, Jan 31, 2018 at 6:32 PM
Subject: Re: [tor-dev] Starting with contributing to Anonymous Local Count
Statistics.
[ text/plain ]
Hey!
I was going through the Tor Volunteer page and came across the Anonymous
local count statistics project. As a student it would be a great starting
point and an even bigger opportunity to get a chance to collaborate and
learn in the process.
I would like to contribute to it, and would love to start as soon as
possible. It would be great if someone could guide me through.
Hello Aruna,
thanks for reaching out.
I also find this project interesting. I'd like to help you but my time
is quite limited lately.
What would you like guidance with?
With regards to design, I suggest you take a look at the last comments
of this trac ticket: https://trac.torproject.org/pr
ojects/tor/ticket/7532#comment:22
Particularly it seems like the PCSA algorithm might be a reasonable way
forward.
With regards to coding, I suggest you familiarize yourself with the Tor
codebase. Some specific places to look at would be the way that Tor
currently counts users. For example, see geoip_note_client_seen() and
its callers, for when bridges register new clients to their stats
subsystem. Also check geoip_format_bridge_stats() for when bridges
finally report those stats.
Let us know if you have any specific questions!
Cheers!
--
Regards,
Aruna Maurya,
CSE,B.tech,
Blog <https://themindreserves.wordpress.com/> | Medium
--
Regards,
Aruna Maurya,
CSE,B.tech,
Blog <https://themindreserves.wordpress.com/> | Medium
<https://medium.com/@arunamaurya>
George Kadianakis
2018-02-02 10:18:49 UTC
Permalink
[ text/plain ]
Hey!
What is the current status of the project, how much work has been done and
where can I pick up from?
Hi!

The project is currently not being worked on.

Mainly design work has been done so far; no code has been written.
See: https://lists.torproject.org/pipermail/tor-dev/2017-March/012001.html
https://lists.torproject.org/pipermail/tor-dev/2017-March/012073.html

I suggest you pick it up by fleshing out the design work and seeing if
it works for you, and then checking out the code to see where you need
to inject the code. Perhaps you can also get in touch with Jaskaran
Singh (***@gmail.com) who did all the previous design work to see
if he is interested in collaborating!

Cheers!
---------- Forwarded message ----------
Date: Wed, Jan 31, 2018 at 6:32 PM
Subject: Re: [tor-dev] Starting with contributing to Anonymous Local Count
Statistics.
[ text/plain ]
Hey!
I was going through the Tor Volunteer page and came across the Anonymous
local count statistics project. As a student it would be a great starting
point and an even bigger opportunity to get a chance to collaborate and
learn in the process.
I would like to contribute to it, and would love to start as soon as
possible. It would be great if someone could guide me through.
Hello Aruna,
thanks for reaching out.
I also find this project interesting. I'd like to help you but my time
is quite limited lately.
What would you like guidance with?
With regards to design, I suggest you take a look at the last comments
of this trac ticket: https://trac.torproject.org/pr
ojects/tor/ticket/7532#comment:22
Particularly it seems like the PCSA algorithm might be a reasonable way
forward.
With regards to coding, I suggest you familiarize yourself with the Tor
codebase. Some specific places to look at would be the way that Tor
currently counts users. For example, see geoip_note_client_seen() and
its callers, for when bridges register new clients to their stats
subsystem. Also check geoip_format_bridge_stats() for when bridges
finally report those stats.
Let us know if you have any specific questions!
Cheers!
--
Regards,
Aruna Maurya,
CSE,B.tech,
Blog <https://themindreserves.wordpress.com/> | Medium
--
Regards,
Aruna Maurya,
CSE,B.tech,
Blog <https://themindreserves.wordpress.com/> | Medium
[ text/plain ]
_______________________________________________
tor-dev mailing list
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev
Jaskaran Singh
2018-02-02 10:41:15 UTC
Permalink
Hi,

I thought the project idea had already been depreciated in favor of
counting unique users by directory fetches. No?

Regards,
Jaskaran
Post by George Kadianakis
[ text/plain ]
Hey!
What is the current status of the project, how much work has been done
and
where can I pick up from?
Hi!
The project is currently not being worked on.
Mainly design work has been done so far; no code has been written.
See: https://lists.torproject.org/pipermail/tor-dev/2017-March/
012001.html
https://lists.torproject.org/pipermail/tor-dev/2017-March/
012073.html
I suggest you pick it up by fleshing out the design work and seeing if
it works for you, and then checking out the code to see where you need
to inject the code. Perhaps you can also get in touch with Jaskaran
if he is interested in collaborating!
Cheers!
---------- Forwarded message ----------
Date: Wed, Jan 31, 2018 at 6:32 PM
Subject: Re: [tor-dev] Starting with contributing to Anonymous Local
Count
Statistics.
[ text/plain ]
Hey!
I was going through the Tor Volunteer page and came across the
Anonymous
local count statistics project. As a student it would be a great
starting
point and an even bigger opportunity to get a chance to collaborate
and
learn in the process.
I would like to contribute to it, and would love to start as soon as
possible. It would be great if someone could guide me through.
Hello Aruna,
thanks for reaching out.
I also find this project interesting. I'd like to help you but my time
is quite limited lately.
What would you like guidance with?
With regards to design, I suggest you take a look at the last comments
of this trac ticket: https://trac.torproject.org/pr
ojects/tor/ticket/7532#comment:22
Particularly it seems like the PCSA algorithm might be a reasonable way
forward.
With regards to coding, I suggest you familiarize yourself with the Tor
codebase. Some specific places to look at would be the way that Tor
currently counts users. For example, see geoip_note_client_seen() and
its callers, for when bridges register new clients to their stats
subsystem. Also check geoip_format_bridge_stats() for when bridges
finally report those stats.
Let us know if you have any specific questions!
Cheers!
--
Regards,
Aruna Maurya,
CSE,B.tech,
Blog <https://themindreserves.wordpress.com/> | Medium
--
Regards,
Aruna Maurya,
CSE,B.tech,
Blog <https://themindreserves.wordpress.com/> | Medium
[ text/plain ]
_______________________________________________
tor-dev mailing list
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev
_______________________________________________
tor-dev mailing list
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev
George Kadianakis
2018-02-02 11:03:47 UTC
Permalink
[ text/plain ]
Hi,
I thought the project idea had already been depreciated in favor of
counting unique users by directory fetches. No?
Yes, we do count unique users by directory fetches for the "active Tor
users" metric: https://metrics.torproject.org/userstats-relay-country.html

But we also use in-memory data structures tracking IP addresses to count
unique users per-country: https://metrics.torproject.org/userstats-bridge-combined.html?start=2017-11-04&end=2018-02-02&country=dz

I was not aware that we are planning to deprecate the latter in favor of
counting directory fetches. Did you get that from somewhere? Perhaps it
could make sense, not sure.

Cheers!
Jaskaran Singh
2018-02-02 12:08:25 UTC
Permalink
Hi,

Oops. I meant "counting unique users just like we do with directory
fetches". The argument given was that most users are already behind NAT and
hence counting unique IP addrs would not be accurate anyway. It was
suggested that we count per connection country statistics (that is, total
number of connections coming from a country) and divide that by average
number of connections a user makes, to arrive at estimated number of unique
users. I cannot find where I got to know this. Maybe on IRC, but I don't
have logs of an year ago.

Also, the Metrics team has(?) to come up with a proposal on this IIRC.
Until then it would not be considered a valid project?

Karsten would have something to say on this.

cc: karsten
Post by George Kadianakis
[ text/plain ]
Hi,
I thought the project idea had already been depreciated in favor of
counting unique users by directory fetches. No?
Yes, we do count unique users by directory fetches for the "active Tor
users" metric: https://metrics.torproject.org/userstats-relay-country.html
But we also use in-memory data structures tracking IP addresses to count
unique users per-country: https://metrics.torproject.
org/userstats-bridge-combined.html?start=2017-11-04&end=
2018-02-02&country=dz
I was not aware that we are planning to deprecate the latter in favor of
counting directory fetches. Did you get that from somewhere? Perhaps it
could make sense, not sure.
Cheers!
Iain Learmonth
2018-02-03 10:49:25 UTC
Permalink
Hi,
Post by Jaskaran Singh
Also, the Metrics team has(?) to come up with a proposal on this IIRC.
Until then it would not be considered a valid project?
Considered a valid project by whom?

Metrics is currently very busy and may not have time to assist with the
project, but it doesn't mean that the work shouldn't be done or that it
isn't "valid".

It's more important that the project is safe than that Metrics has put
time into it. If the work shows promise then maybe that would be a
reason for Metrics to reprioritise it later.

Until an analysis of the safety is done (which may have already happened
from the design, I've not followed this) just use a test network for the
development. In this way, you can only be attacking yourself.

Thanks,
Iain.
Aruna Maurya
2018-02-09 17:22:49 UTC
Permalink
Hey!

Is anybody willing to mentor me on this project?
Post by Iain Learmonth
Hi,
Post by Jaskaran Singh
Also, the Metrics team has(?) to come up with a proposal on this IIRC.
Until then it would not be considered a valid project?
Considered a valid project by whom?
Metrics is currently very busy and may not have time to assist with the
project, but it doesn't mean that the work shouldn't be done or that it
isn't "valid".
It's more important that the project is safe than that Metrics has put
time into it. If the work shows promise then maybe that would be a
reason for Metrics to reprioritise it later.
Until an analysis of the safety is done (which may have already happened
from the design, I've not followed this) just use a test network for the
development. In this way, you can only be attacking yourself.
Thanks,
Iain.
_______________________________________________
tor-dev mailing list
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-dev
--
Regards,
Aruna Maurya,
CSE,B.tech,
Blog <https://themindreserves.wordpress.com/> | Medium
<https://medium.com/@arunamaurya>
Loading...