winton_kafka_streams package¶
Subpackages¶
- winton_kafka_streams.processor package
- Subpackages
- Submodules
- winton_kafka_streams.processor.extract_timestamp module
- winton_kafka_streams.processor.processor module
- winton_kafka_streams.processor.processor_context module
- winton_kafka_streams.processor.topology module
- winton_kafka_streams.processor.wallclock_timestamp module
- Module contents
- winton_kafka_streams.state package
Submodules¶
winton_kafka_streams.kafka_config module¶
Configuration values that may be set to control behaviour of Winton Kafka Streams
Configuration may either be set inline in your application using:
import kafka_config kafka_config.BOOTSTRAP_SERVERS = ‘localhost:9092’
or as a file in java properties format. The property names are identical to those used in the Java implementation for ease of sharing between both.
External files can be loaded using:
import kafka_config kafka_config.read_local_config(‘path/to/kafka.streams.config’)
-
winton_kafka_streams.kafka_config.APPLICATION_ID= 'wkstream.application.id'¶ The replication factor for changelog topics and repartition topics created by the application Default: 1 Importance: Low
-
winton_kafka_streams.kafka_config.APPLICATION_SERVER= ''¶ The maximum number of records to buffer per partition Default: 1000 Importance: Low
-
winton_kafka_streams.kafka_config.AUTO_OFFSET_RESET= 'earliest'¶ Consumer Default: false, see Consumer Auto Commit Importance: low
-
winton_kafka_streams.kafka_config.BOOTSTRAP_SERVERS= 'localhost:9092'¶ An identifier for the stream processing application. Must be unique within the Kafka cluster. It is used as:
- the default client-id prefix
- the group-id for membership management
- the changelog topic prefix.
-
winton_kafka_streams.kafka_config.BUFFERED_RECORDS_PER_PARTITION= 1000¶ An id string to pass to the server when making requests. (This setting is passed to the consumer/producer clients used internally by Winton Kafka Streams.) Default: “” Importance: Low
-
winton_kafka_streams.kafka_config.CACHE_MAX_BYTES_BUFFERING= 10485760¶ The number of standby replicas for each task Default: 0 Importance: Medium
-
winton_kafka_streams.kafka_config.CLIENT_ID= ''¶ The frequency with which to save the position (offsets in source topics) of tasks Default: 30000 (millisecs) Importance: Low
-
winton_kafka_streams.kafka_config.COMMIT_INTERVAL_MS= 30000¶ A list of classes to use as metrics reporters Default: [] Importance: Low
-
winton_kafka_streams.kafka_config.ENABLE_AUTO_COMMIT= 'false'¶ Consumer Default: Integer.MAX_VALUE Importance: low
-
winton_kafka_streams.kafka_config.LINGER_MS= 100¶ Producer Default: 10 Importance: low
-
winton_kafka_streams.kafka_config.MAX_POLL_INTERVAL_MS= 9223372036854775807¶ Consumer Default: 1000 Importance: low
-
winton_kafka_streams.kafka_config.MAX_POLL_RECORDS= 1000¶ Default serializer/deserializer class for record values, implements the Serde interface (see also key.serdes) Default: winton_kafka_streams.processor.serialization.serdes.BytesSerde Importance: Medium
-
winton_kafka_streams.kafka_config.METRICS_NUM_SAMPLES= 2¶ The highest recording level for metrics. Default: info Importance: Low
-
winton_kafka_streams.kafka_config.METRICS_RECORDING_LEVEL= 'info'¶ The window of time a metrics sample is computed over. Default: 30000 (millisecs) Importance: Low
-
winton_kafka_streams.kafka_config.METRICS_SAMPLE_WINDOW_MS= 30000¶ Partition grouper class that implements the PartitionGrouper interface Defatult: see Partition Grouper Importance: Low
-
winton_kafka_streams.kafka_config.METRIC_REPORTERS= []¶ The number of samples maintained to compute metrics. Default: 2 Importance: Low
-
winton_kafka_streams.kafka_config.NUM_STANDBY_REPLICAS= 0¶ The number of threads to execute stream processing Default: 1 Importance: Medium
-
winton_kafka_streams.kafka_config.NUM_STREAM_THREADS= 1¶ Timestamp extractor class that implements the TimestampExtractor interface Default: see Timestamp Extractor Importance: Medium
-
winton_kafka_streams.kafka_config.PARITION_GROUPER= None¶ The amount of time in milliseconds to block waiting for input Default: 100 (millisecs) Importance: Low
-
winton_kafka_streams.kafka_config.POLL_MS= 100¶ The amount of time in milliseconds to wait before deleting state when a partition has migrated Default: 60000 (millisecs) Importance: Low
-
winton_kafka_streams.kafka_config.REPLICATION_FACTOR= 1¶ Directory location for state stores Default: /var/lib/kafka-streams Importance: Low
-
winton_kafka_streams.kafka_config.RETRIES= 10¶ Consumer Default: earliest Importance: low
-
winton_kafka_streams.kafka_config.STATE_CLEANUP_DELAY_MS= 60000¶ Added to a windows maintainMs to ensure data is not deleted from the log prematurely. Allows for clock drift. Default: 86400000 (millisecons) = 1 day Importance: Low
-
winton_kafka_streams.kafka_config.STATE_DIR= '/var/lib/kafka-streams'¶ Maximum number of memory bytes to be used for record caches across all threads Default: 10485760 (bytes) Importance: Medium
-
winton_kafka_streams.kafka_config.TIMESTAMP_EXTRACTOR= None¶ A host:port pair pointing to an embedded user defined endpoint that can be used for discovering the locations of state stores within a single Winton Kafka Streams application. The value of this must be different for each instance of the application. Default “” Importance: Low
-
winton_kafka_streams.kafka_config.VALUE_SERDE= 'winton_kafka_streams.processor.serialization.serdes.bytes_serde.BytesSerde'¶ Default serializer/deserializer class for record keys, implements the Serde interface (see also value.serdes) Default: winton_kafka_streams.processor.serialization.serdes.BytesSerde Importance: Medium
-
winton_kafka_streams.kafka_config.WINDOWSTORE_CHANGELOG_ADDITIONAL_RETENTION_MS= 86000000¶ linger.ms (low) Producer Default: 100 Importance: low
-
winton_kafka_streams.kafka_config.log= <Logger winton_kafka_streams.kafka_config (WARNING)>¶ A list of host/port pairs to use for establishing the initial connection to the Kafka cluster
-
winton_kafka_streams.kafka_config.read_local_config(config_file)¶