Skip to content

Study suggestion: Survey influence of tails and CG-Nat #16

@gfr10598

Description

@gfr10598

We may be able to get some insights into distributions within IP addresses, and their influence on aggregate distributions.

  1. Compute CDFs on 5 or 10 bins per decade. Compute at a modest aggregation level, e.g. state, county, or possibly city. Use a fairly large time interval, like 3 months or a year.
    a. Include all tests from all clients.
    a. Use 1st, 5th, 10th, 25th, 50th, 75th, 90th, 95th, 99th percentile per IP.
    a. Repeat excluding hottest IPs, with > 2 tests/day.
    a. Repeat with only IPs that have very few tests, less than 3 per week.
    a. Repeat with only IPs that have frequent tests - more than 3 per week.
    a. Repeat with only hot IPs - those with more than 2 tests/day.

Repeat using WScale or CWnd to distinguish clients within an IP address?

Repeat for individual ASN or ISP. Some will have higher rates of CG-Nat than others.

Compare US vs EU vs later internet adopters.

With cold IPs, there should be little spread between percentiles, because most IPs with have only 1 or 2 tests.

With warm and hot IPs, the spread will be greater if there are multiple clients per IP, less if there is very little CG_NAT influence.

Possibly repeat, but use ratios, e.g. of 5th percentile and median.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions