Guides
Guides

Data lifecycle in HERE Anonymizer Self-Hosted

The anonymized data processed by the HERE Anonymizer Self-Hosted is stored in the output queue for the RabbitMQ connector and the HERE platform stream layer for the HERE platform stream layer data connector. You can adjust the data retention for these connectors to best suit your use case.

RabbitMQ

RabbitMQ allows you to set TTL (time to live) for both messages and queues. Messages and queues that older than the set TTL are considered expired and deleted. By default, the messages don't expire.

To set a TTL using a policy, add the message-ttl argument to the policy definition. The TTL value is expressed in milliseconds.

For example, to set the message TTL to 60 seconds, run:

rabbitmqctl set_policy TTL ".*" '{"message-ttl":60000}' --apply-to queues

HERE platform stream layer data connector

The HERE platform stream layer data connector uses Kafka. As such, you can configure the retention policy using the following configuration items:

# Set the retention time in hours
log.retention.hours

# Set the retention time in minutes
log.retention.minutes

# Set the retention time in miliseconds
log.retention.ms

Note that the most granular of the configuration values always takes precedence. For example, if you set log.retention.hours to 12 and log.retention.minutes to 30, the system uses the retention time of 30 minutes.

By default, the retention time is set to seven days (168 hours).

Java Garbage Collector

The default (parallel) Java Garbage Collector runs in the application's environment and removes the in-memory raw data on every run.