Index of /datasets/supplement/2021-aintec-hoiho

      Name                               Last modified      Size  Description
Parent Directory - 201007-midar-iff-asnames.dict 2021-12-14 11:46 4.9K 201007-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.3M 201007-midar-iff-asnames.re 2021-12-14 11:46 48K 201007-midar-iff.routers.bz2 2021-12-14 11:46 13M 201104-midar-iff-asnames.dict 2021-12-14 11:49 6.2K 201104-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M 201104-midar-iff-asnames.re 2021-12-14 11:49 58K 201104-midar-iff.routers.bz2 2021-12-14 11:46 17M 201110-midar-iff-asnames.dict 2021-12-14 11:46 6.7K 201110-midar-iff-asnames.json.bz2 2021-12-14 11:49 3.1M 201110-midar-iff-asnames.re 2021-12-14 11:49 59K 201110-midar-iff.routers.bz2 2021-12-14 11:46 17M 201207-midar-iff-asnames.dict 2021-12-14 11:46 7.2K 201207-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M 201207-midar-iff-asnames.re 2021-12-14 11:46 57K 201207-midar-iff.routers.bz2 2021-12-14 11:49 18M 201304-midar-iff-asnames.dict 2021-12-14 11:49 7.4K 201304-midar-iff-asnames.json.bz2 2021-12-14 11:49 3.0M 201304-midar-iff-asnames.re 2021-12-14 11:49 59K 201304-midar-iff.routers.bz2 2021-12-14 11:49 20M 201307-midar-iff-asnames.dict 2021-12-14 11:46 7.4K 201307-midar-iff-asnames.json.bz2 2021-12-14 11:46 3.7M 201307-midar-iff-asnames.re 2021-12-14 11:46 57K 201307-midar-iff.routers.bz2 2021-12-14 11:49 20M 201404-midar-iff-asnames.dict 2021-12-14 11:46 8.4K 201404-midar-iff-asnames.json.bz2 2021-12-14 11:46 3.8M 201404-midar-iff-asnames.re 2021-12-14 11:46 67K 201404-midar-iff.routers.bz2 2021-12-14 11:49 21M 201412-midar-iff-asnames.dict 2021-12-14 11:49 9.2K 201412-midar-iff-asnames.json.bz2 2021-12-14 11:49 2.9M 201412-midar-iff-asnames.re 2021-12-14 11:46 70K 201412-midar-iff.routers.bz2 2021-12-14 11:46 21M 201508-midar-iff-asnames.dict 2021-12-14 11:46 10K 201508-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M 201508-midar-iff-asnames.re 2021-12-14 11:46 65K 201508-midar-iff.routers.bz2 2021-12-14 11:46 21M 201603-midar-iff-asnames.dict 2021-12-14 11:46 11K 201603-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.4M 201603-midar-iff-asnames.re 2021-12-14 11:46 63K 201603-midar-iff.routers.bz2 2021-12-14 11:46 22M 201609-midar-iff-asnames.dict 2021-12-14 11:46 12K 201609-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.7M 201609-midar-iff-asnames.re 2021-12-14 11:46 69K 201609-midar-iff.routers.bz2 2021-12-14 11:49 24M 201702-midar-iff-asnames.dict 2021-12-14 11:49 12K 201702-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.3M 201702-midar-iff-asnames.re 2021-12-14 11:49 75K 201702-midar-iff.routers.bz2 2021-12-14 11:46 23M 201708-midar-iff-asnames.dict 2021-12-14 11:46 15K 201708-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M 201708-midar-iff-asnames.re 2021-12-14 11:46 77K 201708-midar-iff-rtaa-asnames.dict 2021-12-14 11:49 13K 201708-midar-iff-rtaa-asnames.re 2021-12-14 11:46 82K 201708-midar-iff.routers.bz2 2021-12-14 11:46 24M 201708-speedtrap-asnames.dict 2021-12-14 11:46 2.6K 201708-speedtrap-asnames.json.bz2 2021-12-14 11:49 209K 201708-speedtrap-asnames.re 2021-12-14 11:46 10K 201708-speedtrap.routers.bz2 2021-12-14 11:46 30M 201803-midar-iff-asnames.dict 2021-12-14 11:46 16K 201803-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M 201803-midar-iff-asnames.re 2021-12-14 11:46 72K 201803-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 13K 201803-midar-iff-rtaa-asnames.re 2021-12-14 11:46 71K 201803-midar-iff.routers.bz2 2021-12-14 11:46 25M 201901-midar-iff-asnames.dict 2021-12-14 11:46 13K 201901-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.0M 201901-midar-iff-asnames.re 2021-12-14 11:46 48K 201901-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 10K 201901-midar-iff-rtaa-asnames.re 2021-12-14 11:46 49K 201901-midar-iff.routers.bz2 2021-12-14 11:46 21M 201901-speedtrap-asnames.dict 2021-12-14 11:46 3.9K 201901-speedtrap-asnames.json.bz2 2021-12-14 11:49 234K 201901-speedtrap-asnames.re 2021-12-14 11:46 12K 201901-speedtrap.routers.bz2 2021-12-14 11:46 5.7M 201904-midar-iff-asnames.dict 2021-12-14 11:46 16K 201904-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.2M 201904-midar-iff-asnames.re 2021-12-14 11:46 62K 201904-midar-iff-rtaa-asnames.dict 2021-12-14 11:49 14K 201904-midar-iff-rtaa-asnames.re 2021-12-14 11:46 68K 201904-midar-iff.routers.bz2 2021-12-14 11:46 24M 202001-midar-iff-asnames.dict 2021-12-14 11:46 18K 202001-midar-iff-asnames.json.bz2 2021-12-14 11:46 3.3M 202001-midar-iff-asnames.re 2021-12-14 11:46 67K 202001-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 14K 202001-midar-iff-rtaa-asnames.re 2021-12-14 11:46 73K 202001-midar-iff.routers.bz2 2021-12-14 11:46 25M 202008-midar-iff-asnames.dict 2021-12-14 11:46 17K 202008-midar-iff-asnames.json.bz2 2021-12-14 11:49 4.0M 202008-midar-iff-asnames.re 2021-12-14 11:46 71K 202008-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 13K 202008-midar-iff-rtaa-asnames.re 2021-12-14 11:49 71K 202008-midar-iff.routers.bz2 2021-12-14 11:46 25M 202011-speedtrap-asnames.dict 2021-12-14 11:49 4.7K 202011-speedtrap-asnames.json.bz2 2021-12-14 11:46 244K 202011-speedtrap-asnames.re 2021-12-14 11:46 13K 202011-speedtrap.routers.bz2 2021-12-14 11:49 7.2M 202103-midar-iff-asnames.dict 2021-12-14 11:46 16K 202103-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.4M 202103-midar-iff-asnames.re 2021-12-14 11:46 64K 202103-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 12K 202103-midar-iff-rtaa-asnames.re 2021-12-14 11:49 60K 202103-midar-iff.routers.bz2 2021-12-14 11:46 24M 202103-speedtrap-asnames.dict 2021-12-14 11:46 4.6K 202103-speedtrap-asnames.json.bz2 2021-12-14 11:46 233K 202103-speedtrap-asnames.re 2021-12-14 11:46 13K 202103-speedtrap.routers.bz2 2021-12-14 11:46 6.8M README.txt 2021-12-14 11:46 2.0K md5.md5 2022-07-12 13:51 6.6K web/ 2021-12-14 11:47 -
This public dataset contains the data used to train our system to
learn regular expressions that extract AS names from router hostnames,
as well as the product of using our system.

 + *-midar-iff.routers files contain IPv4 routers inferred using
   MIDAR and Mercator, annotated with node IDs, ASNs, and hostnames.
 + *-speedtrap.routers files contain IPv6 routers inferred using
   Speedtrap, annotated with node IDs, ASNs, and hostnames.
 + *-asnames.re files contain naming conventions inferred for each
   suffix that seems to embed an AS name, using one of the routers
   files identified in the filename,
 + *-asnames.dict files contain mappings from names to ASNs for
   each of the sets of training data, using one of the routers files
   identified in the filename,
 + *.asnames.json files contain JSON-formatted output obtained from
   the rules.

If you use this data supplement, you are required to cite:

 M. Luckie, A. Marder, B. Huffaker, and k. claffy.
 Learning Regexes to Extract Network Names from Hostnames.
 Proc. Asian Internet Engineering Conference (AINTEC).
 December 2021.

You are also required to cite the ITDK, from which this data is
derived.  The instructions for citing the ITDK are included at:

 http://data.caida.org/datasets/topology/ark/ipv4/

The data is designed to be used with sc_hoiho, which is included
as part of scamper:

 https://www.caida.org/tools/measurement/scamper/

To obtain the inferred regular expressions which are included in this
dataset release, you will need to build sc_hoiho by passing
--with-sc_hoiho and either --with-pcre or --with-pcre2 to configure.
When building sc_hoiho, ensure pcre (or pcre2) is in the path where
your compiler looks for header files and libraries.  For example:

CFLAGS='-I/usr/local/include' LDFLAGS='-L/usr/local/lib' ./configure \
 --with-sc_hoiho --with-pcre2

and then run:

sc_hoiho -O learnasnames -d best-regex public_suffix_list.dat <training-set>.routers

Other options to sc_hoiho are documented in the manual page for
sc_hoiho.

 https://www.caida.org/tools/measurement/scamper/man/sc_hoiho.1.pdf