所有集成 / Kafka 和 PostgreSQL 集成

Kafka 和 PostgreSQL 集成

强大的性能和简单的集成，由 InfluxData 构建的开源数据连接器 Telegraf 提供支持。

免费获取此集成免费获取此集成

50 亿+

Telegraf 下载量

时间序列数据库
来源：DB Engines

10 亿+

InfluxDB 下载量

2,800+

贡献者

实用链接

文档

Telegraf 快速入门培训

使用 Telegraf 进行基础设施监控

强大的性能，无限的扩展能力

收集、组织和处理海量高速数据。当您将任何数据视为时间序列数据时，它都更有价值。使用 InfluxDB，排名第一的时间序列平台，旨在与 Telegraf 一起扩展。

查看入门方法

输入和输出集成概述

此插件允许您从 Kafka 主题实时收集指标，从而增强 Telegraf 设置中的数据监控和收集功能。

Telegraf PostgreSQL 插件允许您高效地将指标写入 PostgreSQL 数据库，同时自动管理数据库模式。

集成详情

Kafka

Kafka Telegraf 插件旨在从 Kafka 主题读取数据，并使用支持的输入数据格式创建指标。作为服务输入插件，它持续监听传入的指标和事件，这与以固定间隔运行的标准输入插件不同。此特定插件可以使用各种 Kafka 版本的功能，并且能够从指定主题消费消息，应用 SASL 等安全凭证配置，以及使用消息偏移量和消费者组选项管理消息处理。此插件的灵活性使其能够处理各种消息格式和用例，使其成为依赖 Kafka 进行数据摄取的应用程序的宝贵资产。

PostgreSQL

PostgreSQL 插件使用户能够将指标写入 PostgreSQL 数据库或兼容数据库，通过自动更新缺失的列，为模式管理提供强大的支持。该插件旨在促进与监控解决方案的集成，允许用户高效地存储和管理时间序列数据。它为连接设置、并发性和错误处理提供可配置选项，并支持高级功能，例如用于标签和字段的 JSONB 存储、外键标记、模板化模式修改以及通过 pguint 扩展支持无符号整数数据类型。

配置

Kafka


[[inputs.kafka_consumer]]
              ## Kafka brokers.
              brokers = ["localhost:9092"]

              ## Set the minimal supported Kafka version. Should be a string contains
              ## 4 digits in case if it is 0 version and 3 digits for versions starting
              ## from 1.0.0 separated by dot. This setting enables the use of new
              ## Kafka features and APIs.  Must be 0.10.2.0(used as default) or greater.
              ## Please, check the list of supported versions at
              ## https://pkg.go.dev/github.com/Shopify/sarama#SupportedVersions
              ##   ex: kafka_version = "2.6.0"
              ##   ex: kafka_version = "0.10.2.0"
              # kafka_version = "0.10.2.0"

              ## Topics to consume.
              topics = ["telegraf"]

              ## Topic regular expressions to consume.  Matches will be added to topics.
              ## Example: topic_regexps = [ "*test", "metric[0-9A-z]*" ]
              # topic_regexps = [ ]

              ## When set this tag will be added to all metrics with the topic as the value.
              # topic_tag = ""

              ## The list of Kafka message headers that should be pass as metric tags
              ## works only for Kafka version 0.11+, on lower versions the message headers
              ## are not available
              # msg_headers_as_tags = []

              ## The name of kafka message header which value should override the metric name.
              ## In case when the same header specified in current option and in msg_headers_as_tags
              ## option, it will be excluded from the msg_headers_as_tags list.
              # msg_header_as_metric_name = ""

              ## Set metric(s) timestamp using the given source.
              ## Available options are:
              ##   metric -- do not modify the metric timestamp
              ##   inner  -- use the inner message timestamp (Kafka v0.10+)
              ##   outer  -- use the outer (compressed) block timestamp (Kafka v0.10+)
              # timestamp_source = "metric"

              ## Optional Client id
              # client_id = "Telegraf"

              ## Optional TLS Config
              # enable_tls = false
              # tls_ca = "/etc/telegraf/ca.pem"
              # tls_cert = "/etc/telegraf/cert.pem"
              # tls_key = "/etc/telegraf/key.pem"
              ## Use TLS but skip chain & host verification
              # insecure_skip_verify = false

              ## Period between keep alive probes.
              ## Defaults to the OS configuration if not specified or zero.
              # keep_alive_period = "15s"

              ## SASL authentication credentials.  These settings should typically be used
              ## with TLS encryption enabled
              # sasl_username = "kafka"
              # sasl_password = "secret"

              ## Optional SASL:
              ## one of: OAUTHBEARER, PLAIN, SCRAM-SHA-256, SCRAM-SHA-512, GSSAPI
              ## (defaults to PLAIN)
              # sasl_mechanism = ""

              ## used if sasl_mechanism is GSSAPI
              # sasl_gssapi_service_name = ""
              # ## One of: KRB5_USER_AUTH and KRB5_KEYTAB_AUTH
              # sasl_gssapi_auth_type = "KRB5_USER_AUTH"
              # sasl_gssapi_kerberos_config_path = "/"
              # sasl_gssapi_realm = "realm"
              # sasl_gssapi_key_tab_path = ""
              # sasl_gssapi_disable_pafxfast = false

              ## used if sasl_mechanism is OAUTHBEARER
              # sasl_access_token = ""

              ## SASL protocol version.  When connecting to Azure EventHub set to 0.
              # sasl_version = 1

              # Disable Kafka metadata full fetch
              # metadata_full = false

              ## Name of the consumer group.
              # consumer_group = "telegraf_metrics_consumers"

              ## Compression codec represents the various compression codecs recognized by
              ## Kafka in messages.
              ##  0 : None
              ##  1 : Gzip
              ##  2 : Snappy
              ##  3 : LZ4
              ##  4 : ZSTD
              # compression_codec = 0
              ## Initial offset position; one of "oldest" or "newest".
              # offset = "oldest"

              ## Consumer group partition assignment strategy; one of "range", "roundrobin" or "sticky".
              # balance_strategy = "range"

              ## Maximum number of retries for metadata operations including
              ## connecting. Sets Sarama library's Metadata.Retry.Max config value. If 0 or
              ## unset, use the Sarama default of 3,
              # metadata_retry_max = 0

              ## Type of retry backoff. Valid options: "constant", "exponential"
              # metadata_retry_type = "constant"

              ## Amount of time to wait before retrying. When metadata_retry_type is
              ## "constant", each retry is delayed this amount. When "exponential", the
              ## first retry is delayed this amount, and subsequent delays are doubled. If 0
              ## or unset, use the Sarama default of 250 ms
              # metadata_retry_backoff = 0

              ## Maximum amount of time to wait before retrying when metadata_retry_type is
              ## "exponential". Ignored for other retry types. If 0, there is no backoff
              ## limit.
              # metadata_retry_max_duration = 0

              ## When set to true, this turns each bootstrap broker address into a set of
              ## IPs, then does a reverse lookup on each one to get its canonical hostname.
              ## This list of hostnames then replaces the original address list.
              ## resolve_canonical_bootstrap_servers_only = false

              ## Strategy for making connection to kafka brokers. Valid options: "startup",
              ## "defer". If set to "defer" the plugin is allowed to start before making a
              ## connection. This is useful if the broker may be down when telegraf is
              ## started, but if there are any typos in the broker setting, they will cause
              ## connection failures without warning at startup
              # connection_strategy = "startup"

              ## Maximum length of a message to consume, in bytes (default 0/unlimited);
              ## larger messages are dropped
              max_message_len = 1000000

              ## Max undelivered messages
              ## This plugin uses tracking metrics, which ensure messages are read to
              ## outputs before acknowledging them to the original broker to ensure data
              ## is not lost. This option sets the maximum messages to read from the
              ## broker that have not been written by an output.
              ##
              ## This value needs to be picked with awareness of the agent's
              ## metric_batch_size value as well. Setting max undelivered messages too high
              ## can result in a constant stream of data batches to the output. While
              ## setting it too low may never flush the broker's messages.
              # max_undelivered_messages = 1000

              ## Maximum amount of time the consumer should take to process messages. If
              ## the debug log prints messages from sarama about 'abandoning subscription
              ## to [topic] because consuming was taking too long', increase this value to
              ## longer than the time taken by the output plugin(s).
              ##
              ## Note that the effective timeout could be between 'max_processing_time' and
              ## '2 * max_processing_time'.
              # max_processing_time = "100ms"

              ## The default number of message bytes to fetch from the broker in each
              ## request (default 1MB). This should be larger than the majority of
              ## your messages, or else the consumer will spend a lot of time
              ## negotiating sizes and not actually consuming. Similar to the JVM's
              ## `fetch.message.max.bytes`.
              # consumer_fetch_default = "1MB"

              ## Data format to consume.
              ## Each data format has its own unique set of configuration options, read
              ## more about them here:
              ## https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_INPUT.md
              data_format = "influx"

PostgreSQL

# Publishes metrics to a postgresql database
[[outputs.postgresql]]
  ## Specify connection address via the standard libpq connection string:
  ##   host=... user=... password=... sslmode=... dbname=...
  ## Or a URL:
  ##   postgres://[user[:password]]@localhost[/dbname]?sslmode=[disable|verify-ca|verify-full]
  ## See https://postgresql.ac.cn/docs/current/libpq-connect.html#LIBPQ-CONNSTRING
  ##
  ## All connection parameters are optional. Environment vars are also supported.
  ## e.g. PGPASSWORD, PGHOST, PGUSER, PGDATABASE
  ## All supported vars can be found here:
  ##  https://postgresql.ac.cn/docs/current/libpq-envars.html
  ##
  ## Non-standard parameters:
  ##   pool_max_conns (default: 1) - Maximum size of connection pool for parallel (per-batch per-table) inserts.
  ##   pool_min_conns (default: 0) - Minimum size of connection pool.
  ##   pool_max_conn_lifetime (default: 0s) - Maximum age of a connection before closing.
  ##   pool_max_conn_idle_time (default: 0s) - Maximum idle time of a connection before closing.
  ##   pool_health_check_period (default: 0s) - Duration between health checks on idle connections.
  # connection = ""

  ## Postgres schema to use.
  # schema = "public"

  ## Store tags as foreign keys in the metrics table. Default is false.
  # tags_as_foreign_keys = false

  ## Suffix to append to table name (measurement name) for the foreign tag table.
  # tag_table_suffix = "_tag"

  ## Deny inserting metrics if the foreign tag can't be inserted.
  # foreign_tag_constraint = false

  ## Store all tags as a JSONB object in a single 'tags' column.
  # tags_as_jsonb = false

  ## Store all fields as a JSONB object in a single 'fields' column.
  # fields_as_jsonb = false

  ## Name of the timestamp column
  ## NOTE: Some tools (e.g. Grafana) require the default name so be careful!
  # timestamp_column_name = "time"

  ## Type of the timestamp column
  ## Currently, "timestamp without time zone" and "timestamp with time zone"
  ## are supported
  # timestamp_column_type = "timestamp without time zone"

  ## Templated statements to execute when creating a new table.
  # create_templates = [
  #   '''CREATE TABLE {{ .table }} ({{ .columns }})''',
  # ]

  ## Templated statements to execute when adding columns to a table.
  ## Set to an empty list to disable. Points containing tags for which there is no column will be skipped. Points
  ## containing fields for which there is no column will have the field omitted.
  # add_column_templates = [
  #   '''ALTER TABLE {{ .table }} ADD COLUMN IF NOT EXISTS {{ .columns|join ", ADD COLUMN IF NOT EXISTS " }}''',
  # ]

  ## Templated statements to execute when creating a new tag table.
  # tag_table_create_templates = [
  #   '''CREATE TABLE {{ .table }} ({{ .columns }}, PRIMARY KEY (tag_id))''',
  # ]

  ## Templated statements to execute when adding columns to a tag table.
  ## Set to an empty list to disable. Points containing tags for which there is no column will be skipped.
  # tag_table_add_column_templates = [
  #   '''ALTER TABLE {{ .table }} ADD COLUMN IF NOT EXISTS {{ .columns|join ", ADD COLUMN IF NOT EXISTS " }}''',
  # ]

  ## The postgres data type to use for storing unsigned 64-bit integer values (Postgres does not have a native
  ## unsigned 64-bit integer type).
  ## The value can be one of:
  ##   numeric - Uses the PostgreSQL "numeric" data type.
  ##   uint8 - Requires pguint extension (https://github.com/petere/pguint)
  # uint64_type = "numeric"

  ## When using pool_max_conns>1, and a temporary error occurs, the query is retried with an incremental backoff. This
  ## controls the maximum backoff duration.
  # retry_max_backoff = "15s"

  ## Approximate number of tag IDs to store in in-memory cache (when using tags_as_foreign_keys).
  ## This is an optimization to skip inserting known tag IDs.
  ## Each entry consumes approximately 34 bytes of memory.
  # tag_cache_size = 100000

  ## Enable & set the log level for the Postgres driver.
  # log_level = "warn" # trace, debug, info, warn, error, none

输入和输出集成示例

Kafka

实时数据处理：使用 Kafka 插件将来自 Kafka 主题的实时数据馈送到监控系统。这对于需要即时反馈性能指标或用户活动的应用尤其有用，使企业能够更快地对其环境中的变化条件做出反应。
动态指标收集：利用此插件根据 Kafka 中发生的事件动态调整正在捕获的指标。例如，通过与其他服务集成，用户可以让插件即时重新配置自身，确保始终根据业务或应用程序的需求收集相关指标。
集中式日志记录和监控：使用 Kafka Consumer Plugin 实施集中式日志记录系统，以将来自多个服务的日志聚合到统一的监控仪表板中。此设置可以帮助识别不同服务之间的问题，并提高整体系统可观察性和故障排除能力。
异常检测系统：将 Kafka 与机器学习算法结合用于实时异常检测。通过不断分析流数据，此设置可以自动识别异常模式，触发警报并更有效地缓解潜在问题。

PostgreSQL

使用复杂查询进行实时分析：利用 PostgreSQL 插件将来自各种来源的指标存储在 PostgreSQL 数据库中，从而使用复杂查询实现实时分析。此设置可以帮助数据科学家和分析师发现模式和趋势，因为他们可以在利用 PostgreSQL 强大的查询优化功能的同时，跨多个表操作关系数据。具体来说，用户可以使用跨不同指标表的 JOIN 操作创建复杂的报告，从而揭示通常隐藏在嵌入式系统中的见解。
与 TimescaleDB 集成以处理时间序列数据：在 TimescaleDB 实例中使用 PostgreSQL 插件，以高效处理和分析时间序列数据。通过实施超表，用户可以在时间维度上实现更高的性能和主题分区。此集成允许用户对大量时间序列数据运行分析查询，同时保留 PostgreSQL SQL 查询的全部功能，从而确保指标分析的可靠性和效率。
数据版本控制和历史分析：实施使用 PostgreSQL 插件的策略，以维护指标在不同时间的不同版本。用户可以设置不可变数据表结构，其中保留旧版本的表，从而轻松实现历史分析。这种方法不仅可以深入了解数据演变，还有助于遵守数据保留策略，确保数据集的历史完整性保持不变。
用于不断发展的指标的动态模式管理：使用插件的模板功能创建动态变化的模式，以响应指标变化。此用例允许组织在指标发展时调整其数据结构，添加必要的字段并确保遵守数据完整性策略。通过利用模板化的 SQL 命令，用户无需手动干预即可扩展其数据库，从而促进敏捷数据管理实践。

反馈

感谢您成为我们社区的一份子！如果您有任何一般性反馈或在这些页面上发现任何错误，我们欢迎并鼓励您提出意见。请在 InfluxDB 社区 Slack 中提交您的反馈。

实用链接

文档

Telegraf 快速入门培训

使用 Telegraf 进行基础设施监控

强大的性能，无限的扩展能力

查看入门方法

Kafka 和 PostgreSQL 集成

目录

实用链接

强大的性能，无限的扩展能力

输入和输出集成概述

集成详情

配置

输入和输出集成示例

反馈

实用链接

强大的性能，无限的扩展能力

相关集成

标题

标题

标题

相关集成

HTTP 和 InfluxDB 集成

Kafka 和 InfluxDB 集成

Kinesis 和 InfluxDB 集成

立即开始构建

产品与解决方案

开发者

公司

Kafka 和 PostgreSQL 集成

目录

实用链接

强大的性能，无限的扩展能力

输入和输出集成概述

集成详情

配置

输入和输出集成示例

反馈

实用链接

强大的性能，无限的扩展能力

相关集成

标题

标题

标题

相关集成

HTTP 和 InfluxDB 集成

Kafka 和 InfluxDB 集成

Kinesis 和 InfluxDB 集成

立即开始构建

产品与解决方案

开发者

公司

注册 InfluxData 新闻简报

关注我们