RecordLinkageConfig#

class RecordLinkageConfig#

RecordLinkageConfig class with record field definitions and various tunings of the Record-Linkage algorithm and heuristics.

get_fields_names(self: pyhelayers.RecordLinkageConfig) list#

Get the fields names in order

get_name_field_name(self: pyhelayers.RecordLinkageConfig) str#

Get the name of the name field (used by some specific name related heuristics).

get_num_bands(self: pyhelayers.RecordLinkageConfig) int#

Get the number of bands used by the min-hash algorithm

get_size_bands(self: pyhelayers.RecordLinkageConfig) int#

Get the number of hashes per band used by the min-hash algorithm

get_verbosity(self: pyhelayers.RecordLinkageConfig) pyhelayers.Verbosity#

Get the Verbosity level.

is_fully_initialized(self: pyhelayers.RecordLinkageConfig) bool#

Returns true if all required fields are set

set_num_bands_and_size_bands(self: pyhelayers.RecordLinkageConfig, num_bands: int, size_bands: int) None#

Set the number of bands and the number of hashes per band, used by the min-hash algorithm.

Parameters:
  • num_bands – The number of bands to use.

  • size_bands – The number of hashes per band.

set_records_fields(self: pyhelayers.RecordLinkageConfig, fields_names: list, name_field_name: str) None#

Set the field names.

Parameters:
  • fields_names – Both players CSV files contain these fields by the order given here.

  • name_field_name – The name of the name field used by some specific name related heuristics.

set_verbosity(self: pyhelayers.RecordLinkageConfig, verbosity: pyhelayers.Verbosity) None#

Set the verbosity level

Parameters:

verbosity (Verbosity) – The desired verbosity level