Frequently Asked Questions

Known Issues

Why are my Grafana panels returning truncated/partial data?
Why do queries that use math on several selector functions return more than one point?
What should I do if I see the panic: unexpected fault address xxxxxxxxxxxxxx?

Log Errors

Why am I seeing a 503 Service Unavailable error in my meta node logs?
Why am I seeing a 409 error in some of my data node logs?
Why am I seeing hinted handoff queue not empty errors in my data node logs?
Why am I seeing error writing count stats ...: partial write errors in my data node logs?
Why am I seeing queue is full errors in my data node logs?
Why am I seeing unable to determine if "hostname" is a meta node when I try to add a meta node with influxd-ctl join?
Why am I getting a Basic Authentication pop-up window from my InfluxEnterprise Web Console?

Other

How do I make a web console user an admin web console user?

Why are my Grafana panels returning truncated/partial data?

In InfluxEnterprise versions 1.2.0-1.2.2, the system sets the max-row-limit configuration option to 10,000 by default. That option limits the number of rows returned per query to 10,000 rows. If a query in Grafana exceeds that 10,000 row limit, the panel appears to show truncated data. To prevent that issue, set max-row-limit to 0 to allow an unlimited number of returned rows.

This issue is fixed in version 1.2.5. In version 1.2.5, the configuration file sets max-row-limit to 0 by default.

Why do queries that use math on several selector functions return more than one point?

In InfluxEnterprise versions prior to 1.2.0, queries that use math on several selector functions return one point with the epoch 0 (1970-01-01T00:00:00Z) timestamp. In versions 1.2.0-1.2.2, those queries return N points, where N is the number of unique timestamps returned by the individual selector functions. As a workaround, use InfluxQL’s subqueries in versions 1.2.0-1.2.2 to replicate the query behavior in versions prior to 1.2.0.

This issue is fixed in version 1.2.5.

Example

The queries below perform multiplication on two selector functions.

Behavior in versions prior to 1.2.0:

> SELECT MIN("avocado")*2,MAX("avocado")*2 FROM "mycart"

name: mycart
time  min  max
----  ---  ---
0     24   46

The query returns a single point with the epoch 0 timestamp.

Behavior in versions 1.2.0-1.2.2:

> SELECT MIN("avocado")*2,MAX("avocado")*2 FROM "mycart"

name: mycart
time                 min  max
----                 ---  ---
1490113486589201368  24
1490113497387418180       46

The query returns two points; one point for each timestamp returned by the individual selector functions.

Workaround for versions 1.2.0-1.2.2:

> SELECT "min"*2,"max"*2 FROM (SELECT MIN("avocado"),MAX("avocado") FROM "mycart")

name: mycart
time  min  max
----  ---  ---
0     24   46

The workaround uses InfluxQL’s subqueries to replicate the query behavior in versions prior to 1.2.0.

What should I do if I see the panic: `unexpected fault address xxxxxxxxxxxxxx`?

In InfluxEnterprise versions 1.2.0-1.2.2, there is a known issue where the data node process stops and reports the panic unexpected fault address xxxxxxxxxxxxxx in the logs. If you experience this panic please restart the data node process. We are working to address this issue; see GitHub Issue #8022 for additional information.

This issue is fixed in version 1.2.5.

Where can I find InfluxEnterprise logs?

On systemd operating systems service logs can be accessed using the journalctl command.

Meta: journalctl -u influxdb-meta

Data : journalctl -u influxdb

Enterprise console: journalctl -u influx-enterprise

The journalctl output can be redirected to print the logs to a text file. With systemd, log retention depends on the system’s journald settings.

Why am I seeing a `503 Service Unavailable` error in my meta node logs?

This is the expected behavior if you haven’t joined the meta node to the cluster. The 503 errors should stop showing up in the logs once you join the meta node to the cluster.

Why am I seeing a `409` error in some of my data node logs?

When you create a Continuous Query (CQ) on your cluster every data node will ask for the CQ lease. Only one data node can accept the lease. That data node will have a 200 in its logs. All other data nodes will be denied the lease and have a 409 in their logs. This is the expected behavior.

Log output for a data node that is denied the lease:

[meta-http] 2016/09/19 09:08:53 172.31.4.132 - - [19/Sep/2016:09:08:53 +0000] GET /lease?name=continuous_querier&node_id=5 HTTP/1.2 409 105 - InfluxDB Meta Client b00e4943-7e48-11e6-86a6-000000000000 380.542µs

Log output for the data node that accepts the lease:

[meta-http] 2016/09/19 09:08:54 172.31.12.27 - - [19/Sep/2016:09:08:54 +0000] GET /lease?name=continuous_querier&node_id=0 HTTP/1.2 200 105 - InfluxDB Meta Client b05a3861-7e48-11e6-86a7-000000000000 8.87547ms

Why am I seeing `hinted handoff queue not empty` errors in my data node logs?

[write] 2016/10/18 10:35:21 write failed for shard 2382 on node 4: hinted handoff queue not empty

This error is informational only and does not necessarily indicate a problem in the cluster. It indicates that the node handling the write request currently has data in its local hinted handoff queue for the destination node. Coordinating nodes will not attempt direct writes to other nodes until the hinted handoff queue for the destination node has fully drained. New data is instead appended to the hinted handoff queue. This helps data arrive in chronological order for consistency of graphs and alerts and also prevents unnecessary failed connection attempts between the data nodes. Until the hinted handoff queue is empty this message will continue to display in the logs. Monitor the size of the hinted handoff queues with ls -lRh /var/lib/influxdb/hh to ensure that they are decreasing in size.

Note that for some write consistency settings, InfluxDB may return a write error (500) for the write attempt, even if the points are successfully queued in hinted handoff. Some write clients may attempt to resend those points, leading to duplicate points being added to the hinted handoff queue and lengthening the time it takes for the queue to drain. If the queues are not draining, consider temporarily downgrading the write consistency setting, or pause retries on the write clients until the hinted handoff queues fully drain.

Why am I seeing `error writing count stats ...: partial write` errors in my data node logs?

[stats] 2016/10/18 10:35:21 error writing count stats for FOO_grafana: partial write

The _internal database collects per-node and also cluster-wide information about the InfluxEnterprise cluster. The cluster metrics are replicated to other nodes using consistency=all. For a write consistency of all, InfluxDB returns a write error (500) for the write attempt even if the points are successfully queued in hinted handoff. Thus, if there are points still in hinted handoff, the _internal writes will fail the consistency check and log the error, even though the data is in the durable hinted handoff queue and should eventually persist.

Why am I seeing `queue is full` errors in my data node logs?

This error indicates that the coordinating node that received the write cannot add the incoming write to the hinted handoff queue for the destination node because it would exceed the maximum size of the queue. This error typically indicates a catastrophic condition for the cluster - one data node may have been offline or unable to accept writes for an extended duration.

The controlling configuration settings are in the [hinted-handoff] section of the file. max-size is the total size in bytes per hinted handoff queue. When max-size is exceeded, all new writes for that node are rejected until the queue drops below max-size. max-age is the maximum length of time a point will persist in the queue. Once this limit has been reached, points expire from the queue. The age is calculated from the write time of the point, not the timestamp of the point.

Why am I seeing `unable to determine if "hostname" is a meta node` when I try to add a meta node with `influxd-ctl join`?

Meta nodes use the /status endpoint to determine the current state of another metanode. A healthy meta node that is ready to join the cluster will respond with a 200 HTTP response code and a JSON string with the following format (assuming the default ports):

"nodeType":"meta","leader":"","httpAddr":"<hostname>:8091","raftAddr":"<hostname>:8089","peers":null}

If you are getting an error message while attempting to influxd-ctl join a new meta node, it means that the JSON string returned from the /status endpoint is incorrect. This generally indicates that the meta node configuration file is incomplete or incorrect. Inspect the HTTP response with curl -v "http://<hostname>:8091/status" and make sure that the hostname, the bind-address, and the http-bind-address are correctly populated. Also check the license-key or license-path in the configuration file of the meta nodes. Finally, make sure that you specify the http-bind-address port in the join command, e.g. influxd-ctl join hostname:8091.

Why am I getting a Basic Authentication pop-up window from my InfluxEnterprise Web Console?

The InfluxEnterprise Web Console will create a popup requesting Authentication credentials when the shared-secret configured under the [influxdb] section in the influx-enterprise.conf Web Console configuration file does not match with the shared-secret configured under the [http] section in all data node influxdb.conf configuration files. All data nodes and the InfluxEnteprise Web Console must share the same passphrase.

How do I make a web console user an admin web console user?

Web console users can be admin users or non-admin users. In addition to having access to the web console, admin users are able to invite users, manage web console users, manage cluster accounts, and edit cluster names.

By default, new web console users are non-admin users. To make a web console user an admin user, visit the Users page located in the WEB ADMIN section in the sidebar and click on the name of the relevant user. In the Account Details section, select the checkbox next to Admin and click Update User.

Web Console Admin User