Index of /datasets/supplement/2021-aintec-hoiho
Name Last modified Size Description
Parent Directory -
201007-midar-iff-asnames.dict 2021-12-14 11:46 4.9K
201007-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.3M
201007-midar-iff-asnames.re 2021-12-14 11:46 48K
201007-midar-iff.routers.bz2 2021-12-14 11:46 13M
201104-midar-iff-asnames.dict 2021-12-14 11:49 6.2K
201104-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M
201104-midar-iff-asnames.re 2021-12-14 11:49 58K
201104-midar-iff.routers.bz2 2021-12-14 11:46 17M
201110-midar-iff-asnames.dict 2021-12-14 11:46 6.7K
201110-midar-iff-asnames.json.bz2 2021-12-14 11:49 3.1M
201110-midar-iff-asnames.re 2021-12-14 11:49 59K
201110-midar-iff.routers.bz2 2021-12-14 11:46 17M
201207-midar-iff-asnames.dict 2021-12-14 11:46 7.2K
201207-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M
201207-midar-iff-asnames.re 2021-12-14 11:46 57K
201207-midar-iff.routers.bz2 2021-12-14 11:49 18M
201304-midar-iff-asnames.dict 2021-12-14 11:49 7.4K
201304-midar-iff-asnames.json.bz2 2021-12-14 11:49 3.0M
201304-midar-iff-asnames.re 2021-12-14 11:49 59K
201304-midar-iff.routers.bz2 2021-12-14 11:49 20M
201307-midar-iff-asnames.dict 2021-12-14 11:46 7.4K
201307-midar-iff-asnames.json.bz2 2021-12-14 11:46 3.7M
201307-midar-iff-asnames.re 2021-12-14 11:46 57K
201307-midar-iff.routers.bz2 2021-12-14 11:49 20M
201404-midar-iff-asnames.dict 2021-12-14 11:46 8.4K
201404-midar-iff-asnames.json.bz2 2021-12-14 11:46 3.8M
201404-midar-iff-asnames.re 2021-12-14 11:46 67K
201404-midar-iff.routers.bz2 2021-12-14 11:49 21M
201412-midar-iff-asnames.dict 2021-12-14 11:49 9.2K
201412-midar-iff-asnames.json.bz2 2021-12-14 11:49 2.9M
201412-midar-iff-asnames.re 2021-12-14 11:46 70K
201412-midar-iff.routers.bz2 2021-12-14 11:46 21M
201508-midar-iff-asnames.dict 2021-12-14 11:46 10K
201508-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M
201508-midar-iff-asnames.re 2021-12-14 11:46 65K
201508-midar-iff.routers.bz2 2021-12-14 11:46 21M
201603-midar-iff-asnames.dict 2021-12-14 11:46 11K
201603-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.4M
201603-midar-iff-asnames.re 2021-12-14 11:46 63K
201603-midar-iff.routers.bz2 2021-12-14 11:46 22M
201609-midar-iff-asnames.dict 2021-12-14 11:46 12K
201609-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.7M
201609-midar-iff-asnames.re 2021-12-14 11:46 69K
201609-midar-iff.routers.bz2 2021-12-14 11:49 24M
201702-midar-iff-asnames.dict 2021-12-14 11:49 12K
201702-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.3M
201702-midar-iff-asnames.re 2021-12-14 11:49 75K
201702-midar-iff.routers.bz2 2021-12-14 11:46 23M
201708-midar-iff-asnames.dict 2021-12-14 11:46 15K
201708-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M
201708-midar-iff-asnames.re 2021-12-14 11:46 77K
201708-midar-iff-rtaa-asnames.dict 2021-12-14 11:49 13K
201708-midar-iff-rtaa-asnames.re 2021-12-14 11:46 82K
201708-midar-iff.routers.bz2 2021-12-14 11:46 24M
201708-speedtrap-asnames.dict 2021-12-14 11:46 2.6K
201708-speedtrap-asnames.json.bz2 2021-12-14 11:49 209K
201708-speedtrap-asnames.re 2021-12-14 11:46 10K
201708-speedtrap.routers.bz2 2021-12-14 11:46 30M
201803-midar-iff-asnames.dict 2021-12-14 11:46 16K
201803-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.9M
201803-midar-iff-asnames.re 2021-12-14 11:46 72K
201803-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 13K
201803-midar-iff-rtaa-asnames.re 2021-12-14 11:46 71K
201803-midar-iff.routers.bz2 2021-12-14 11:46 25M
201901-midar-iff-asnames.dict 2021-12-14 11:46 13K
201901-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.0M
201901-midar-iff-asnames.re 2021-12-14 11:46 48K
201901-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 10K
201901-midar-iff-rtaa-asnames.re 2021-12-14 11:46 49K
201901-midar-iff.routers.bz2 2021-12-14 11:46 21M
201901-speedtrap-asnames.dict 2021-12-14 11:46 3.9K
201901-speedtrap-asnames.json.bz2 2021-12-14 11:49 234K
201901-speedtrap-asnames.re 2021-12-14 11:46 12K
201901-speedtrap.routers.bz2 2021-12-14 11:46 5.7M
201904-midar-iff-asnames.dict 2021-12-14 11:46 16K
201904-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.2M
201904-midar-iff-asnames.re 2021-12-14 11:46 62K
201904-midar-iff-rtaa-asnames.dict 2021-12-14 11:49 14K
201904-midar-iff-rtaa-asnames.re 2021-12-14 11:46 68K
201904-midar-iff.routers.bz2 2021-12-14 11:46 24M
202001-midar-iff-asnames.dict 2021-12-14 11:46 18K
202001-midar-iff-asnames.json.bz2 2021-12-14 11:46 3.3M
202001-midar-iff-asnames.re 2021-12-14 11:46 67K
202001-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 14K
202001-midar-iff-rtaa-asnames.re 2021-12-14 11:46 73K
202001-midar-iff.routers.bz2 2021-12-14 11:46 25M
202008-midar-iff-asnames.dict 2021-12-14 11:46 17K
202008-midar-iff-asnames.json.bz2 2021-12-14 11:49 4.0M
202008-midar-iff-asnames.re 2021-12-14 11:46 71K
202008-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 13K
202008-midar-iff-rtaa-asnames.re 2021-12-14 11:49 71K
202008-midar-iff.routers.bz2 2021-12-14 11:46 25M
202011-speedtrap-asnames.dict 2021-12-14 11:49 4.7K
202011-speedtrap-asnames.json.bz2 2021-12-14 11:46 244K
202011-speedtrap-asnames.re 2021-12-14 11:46 13K
202011-speedtrap.routers.bz2 2021-12-14 11:49 7.2M
202103-midar-iff-asnames.dict 2021-12-14 11:46 16K
202103-midar-iff-asnames.json.bz2 2021-12-14 11:46 2.4M
202103-midar-iff-asnames.re 2021-12-14 11:46 64K
202103-midar-iff-rtaa-asnames.dict 2021-12-14 11:46 12K
202103-midar-iff-rtaa-asnames.re 2021-12-14 11:49 60K
202103-midar-iff.routers.bz2 2021-12-14 11:46 24M
202103-speedtrap-asnames.dict 2021-12-14 11:46 4.6K
202103-speedtrap-asnames.json.bz2 2021-12-14 11:46 233K
202103-speedtrap-asnames.re 2021-12-14 11:46 13K
202103-speedtrap.routers.bz2 2021-12-14 11:46 6.8M
README.txt 2021-12-14 11:46 2.0K
md5.md5 2022-07-12 13:51 6.6K
web/ 2021-12-14 11:47 -
This public dataset contains the data used to train our system to
learn regular expressions that extract AS names from router hostnames,
as well as the product of using our system.
+ *-midar-iff.routers files contain IPv4 routers inferred using
MIDAR and Mercator, annotated with node IDs, ASNs, and hostnames.
+ *-speedtrap.routers files contain IPv6 routers inferred using
Speedtrap, annotated with node IDs, ASNs, and hostnames.
+ *-asnames.re files contain naming conventions inferred for each
suffix that seems to embed an AS name, using one of the routers
files identified in the filename,
+ *-asnames.dict files contain mappings from names to ASNs for
each of the sets of training data, using one of the routers files
identified in the filename,
+ *.asnames.json files contain JSON-formatted output obtained from
the rules.
If you use this data supplement, you are required to cite:
M. Luckie, A. Marder, B. Huffaker, and k. claffy.
Learning Regexes to Extract Network Names from Hostnames.
Proc. Asian Internet Engineering Conference (AINTEC).
December 2021.
You are also required to cite the ITDK, from which this data is
derived. The instructions for citing the ITDK are included at:
http://data.caida.org/datasets/topology/ark/ipv4/
The data is designed to be used with sc_hoiho, which is included
as part of scamper:
https://www.caida.org/tools/measurement/scamper/
To obtain the inferred regular expressions which are included in this
dataset release, you will need to build sc_hoiho by passing
--with-sc_hoiho and either --with-pcre or --with-pcre2 to configure.
When building sc_hoiho, ensure pcre (or pcre2) is in the path where
your compiler looks for header files and libraries. For example:
CFLAGS='-I/usr/local/include' LDFLAGS='-L/usr/local/lib' ./configure \
--with-sc_hoiho --with-pcre2
and then run:
sc_hoiho -O learnasnames -d best-regex public_suffix_list.dat <training-set>.routers
Other options to sc_hoiho are documented in the manual page for
sc_hoiho.
https://www.caida.org/tools/measurement/scamper/man/sc_hoiho.1.pdf