haproxy

mirror of http://git.haproxy.org/git/haproxy.git synced 2026-02-26 21:18:04 +02:00

Author	SHA1	Message	Date
Christopher Faulet	783db96ccb	MEDIUM: htx: Refactor transfer of htx blocks to merge DATA blocks if possible When htx_xfer_blks() is called and when HTX DATA blocks are copied, we now merge them in one block. This will help all payload analysis, if any.	2026-02-18 13:26:21 +01:00
Christopher Faulet	a8887e55a0	BUG/MEDIUM: htx: Fix function used to change part of a block value when defrag htx_replace_blk_value() is buggy when a defrag is performed. It only happens on data expension. But in that case, because a defragmentation is performed, the blocks data are moved and old data of the updated block are no longer accessible. To fix the bug, we now use a chunk to temporarily copy the new data of the block. This way we can safely perform the HTX defragmentation and then recopy the data from the chunk to the HTX message. It is theorically possible to hit this bug but concretly it is pretty hard. This patch should be backported to all stable versions.	2026-02-18 13:26:21 +01:00
Christopher Faulet	e62e8de5a7	MEDIUM: stream: Offer buffers of default size only Check the channels buffers size on release before trying to offer it to waiting entities. Only normal buffers must be considered. This will be mandatory when the large buffers support on channels will be added.	2026-02-18 13:26:21 +01:00
Christopher Faulet	bc586b4138	MINOR: h1-htx: Disable 0-copy for buffers of different size When a message payload is parsed, it is possible to swap buffers. We must only take care both buffers have same size. It will be mandatory when the large buffers support on channels will be added.	2026-02-18 13:26:21 +01:00
Christopher Faulet	8b27dfdfb0	MEDIUM: applet: Disable 0-copy for buffers of different size Just like the previous commit, we must take care to never swap buffers of different size when data are exchanged between an applet and a SC. it will be mandatory when the large buffers support on channels will be added.	2026-02-18 13:26:21 +01:00
Christopher Faulet	36282ae348	MEDIUM: mux-h1/mux-h2/mux-fcgi/h3: Disable 0-copy for buffers of different size Today, it is useless to check the buffers size before performing a 0-copy in muxes when data are sent, but it will be mandatory when the large buffers support on channels will be added. Indeed, muxes will still rely on normal buffers, so we must take care to never swap buffers of different size.	2026-02-18 13:26:21 +01:00
Christopher Faulet	f82ace414b	MEDIUM: compression: Be sure to never compress more than a chunk at once When the compression is performed, a trash chunk is used. So be sure to never compression more data than the trash size. Otherwise the commression could fail. Today, this cannot happen. But with the large buffers support on channels, it could be an issue. Note that this part should be reviewed to evaluate if we should use a larger chunk too to perform the compression, maybe via an option.	2026-02-18 13:26:21 +01:00
Christopher Faulet	53b7150357	MEDIUM: stream: Limit number of synchronous send per stream wakeup It is not a bug fix, because there is no way to hit the issue for now. But there is nothing preventing a loop of synchronous sends in process_stream(). Indead, when a synchronous send is successfully performed, we restart the SCs evaluation and at the end another synchronous send is attempted. So with an endpoint consuming data bit by bit or with a filter fowarding few bytes at each call, it is possible to loop for a while in process_stream(). Because it is not expected, we now limit the number of synchronous send per wakeup to two calls. In a nominal case, it should never be more. This commit is mandatory to be able to handle large buffers on channels There is no reason to backport this commit except if the large buffers support on channels are backported.	2026-02-18 13:26:21 +01:00
Christopher Faulet	5965a6e1d2	MEDIUM: cache: Don't rely on a chunk to store messages payload When the response payload is stored in the cache, we can avoid to use a trash chunk as temporary space area before copying everything in the cache in one call. Instead we can directly write each HTX block in the cache, one by one. It should not be an issue because, most of time, there is only one DATA block. This commit depends on "BUG/MEDIUM: shctx: Use the next block when data exactly filled a block".	2026-02-18 13:26:20 +01:00
Christopher Faulet	9ad9def126	BUG/MINOR: config: Check buffer pool creation for failures The call to init_buffer() during the worker startup may fail. In that case, an error message is displayed but the error was not properly handled. So let's add the proper check and exit on error.	2026-02-18 13:26:20 +01:00
Christopher Faulet	fae478dae5	MINOR: buffers: Swap buffers of same size only In b_xfer(), we now take care to swap buffers of the same size only. For now, it is always the case. But that will change.	2026-02-18 13:26:20 +01:00
Christopher Faulet	6bf450b7fe	MINOR: tree-wide: Use the buffer size instead of global setting when possible At many places, we rely on global.tune.bufsize value instead of using the buffer size. For now, it is not a problem. But if we want to be able to deal with buffers of different sizes, it is good to reduce as far as possible dependencies on the global value. most of time, we can use b_size() or c_size() functions. The main change is performed on the error snapshot where the buffer size was added into the error_snapshot structure.	2026-02-18 13:26:20 +01:00
David Carlier	fc89ff76c7	BUG/MEDIUM: jwe: fix timing side-channel and dead code in JWE decryption Fix two issues in JWE token processing: - Replace memcmp() with CRYPTO_memcmp() for authentication tag verification in build_and_check_tag() to prevent timing side-channel attacks. Also add a tag length validation check before the comparison to avoid potential buffer over-read when the decoded tag length doesn't match the expected HMAC half. - Remove unreachable break statement after JWE_ALG_A256GCMKW case in decrypt_cek_aesgcmkw().	2026-02-18 10:46:32 +01:00
Christopher Faulet	0aca25f725	BUG/MINOR: http-ana: Stop to wait for body on client error/abort During the message analysis, we must take care to stop wait for the message body if an error was reported on client side or an abort was detected with abort-on-close configured (by default now). The bug was introduced when the "wait-for-body" action was added. Only the producer state was tested. So, when we were waiting for the request payload, there was no issue. But when we were waiting for the response payload, error or abort on client side was not considered. This patch should be backported to all stable versions.	2026-02-18 09:44:16 +01:00
Christopher Faulet	9e17087aeb	BUG/MEDIUM: shctx: Use the next block when data exactly filled a block When the hot list was removed in 3.0, a regression was introduced. Theorically, it is possible to override data in a block when new data are appended. It happens when data are copied. If the data size is a multiple of the block size, all data are copied and the last used block is full. But instead of saving a reference on the next block as the restart point for the next copies, we keep a reference on the last full one. On the next read, we reuse this block and old data are crushed. To hit the bug, no new blocks should be reserved between the two data copy attempts. Concretely, for now, it seems not possible to hit the bug. But with a block size set to 1024, if more than 1024 bytes are reseved, with a first copy of 1024 bytes and a second one with remaining data, data in the first block will be crushed. So to fix the bug, the reference of the last block used to write data (which is in fact the next one to use to perform the next copy) is only updated when a block is full. In that case the next block is used. This patch should be backported as far as 3.0 after a period of observation.	2026-02-18 09:44:15 +01:00
Christopher Faulet	b248b1c021	CLEANUP: compression: Remove unused static buffers Since the legacy HTTP code was removed, the global and thread-local buffers, tmpbuf and zbuf, are no longer used. So let's removed them. This could be backported, theorically to all supported versions. But at least it could be good to do so as far as 3.2 as it saves 2 buffers per-thread.	2026-02-18 09:44:15 +01:00
Christopher Faulet	829002d459	MINOR: flt-trace: Add an option to limit the amount of data forwarded "max-fwd <max>" option can now be used to limit the maximum amount of data forwarded at a time by the filter. It could be handy to make tests.	2026-02-18 09:44:15 +01:00
Christopher Faulet	92307b5fec	BUG/MINOR: flt-trace: Properly compute length of the first DATA block This bug is quite old. When the length of the first DATA block is computed, the offset is used instead of the block length minus the offset. It is only used with random forwarding and there is a test just after to prevent any issue, so there is no effect. It could be backported to all stable versions.	2026-02-18 09:44:15 +01:00
Christopher Faulet	1b7843f1c1	BUG/MEDIUM: applet: Fix test on shut flags for legacy applets (v2) The previous fix was wrong. When shut flags are tested for legacy applets, to know if the I/O handler can be called or not, we must be sure shut for reads and for writes are both set to skip the applet I/O handler. This bug introduced regression, at least for the peer applet and for the DNS applet. This patch must be backported with `abc1947e1` ("BUG/MEDIUM: applet: Fix test on shut flags for legacy applets"), so as far as 3.0.	2026-02-18 09:44:15 +01:00
Christopher Faulet	8e0c2599b6	BUG/MEDIUM: mux-h1: Stop sending vi fast-forward for unexpected states If a producer tries to send data via the fast-forward mechanism while the message is in an unexpected state from the consumer point of view, the fast-forward is now disabled. Concretely, we now take care that the message is in its data/tunnel stage to proceed in h1_nego_ff(). By disabling fast-forward in that case, we will automatically fall back on the regular sending path and be able to handle the error in h1_snd_buf(). This patch should be backported as far as 3.0	2026-02-18 09:44:15 +01:00
Christopher Faulet	cda056b9f4	BUG/MEDIUM: mux-h2/quic: Stop sending via fast-forward if stream is closed If is illegal to send data if the stream is already closed. The case is properly handled when data are sent via snd_buf(), by draining the data. But it was still possible to process these data via nego_ff(). So, in this patch, both for the H2 and QUIC multiplexers, the fast-forward is disabled if the stream is closed and nothing is performed. Doing so, we will automatically fall back on the regular sending path and be able to drain data in snd_buf(). Thanks to Mike Walker for his investigation on the subject. This patch should be backported as far as 3.0.	2026-02-18 09:44:09 +01:00
Amaury Denoyelle	18a78956cb	MINOR: mux-quic: add BUG_ON_STRESS() when draining data on closed stream Add a BUG_ON_STRESS() to be able to detect if data draining is performed due to early stream closure.	2026-02-17 18:18:44 +01:00
Amaury Denoyelle	4c275c7d17	BUG/MEDIUM: h3: reject frontend CONNECT as currently not implemented HTTP/3 CONNECT transcoding is not properly implemented on the frontend side. Neither tunnel mode of application nor extended connect are currently functional. Clarify this situation by rejecting any CONNETC attempts on the frontend side. The stream is thus now closed via a RESET_STREAM with error code REQUEST_REJECTED. This should be backported to every stable versions.	2026-02-17 18:18:44 +01:00
Amaury Denoyelle	f3003d1508	BUG/MAJOR: Revert "MEDIUM: mux-quic: add BUG_ON if sending on locally closed QCS" This reverts commit `235e8f1afd`. Prior to the above commit, snd_buf callback for QUIC MUX was able to deal with data even after stream closure. The excess was simply discarded, as no STREAM frame can be emitted after FIN/RESET_STREAM. This code was later removed and replaced by a BUG_ON() to ensure snd_buf is never called after stream closure. However, this approach is too strict. Indeed, there is nothing in the haproxy stream architecture which forbids this scheduling, in part because QUIC MUX is the sole responsible of the stream closure. As such, it is preferable to revert to the old code to prevent any triggering of a BUG_ON() failure. Note that nego_ff does not implement data draining if called after stream closure. This will be done in a future patch. Thanks to Mike Walker for his investigation on the subject. This must be backported up to 2.8.	2026-02-17 18:18:44 +01:00
Aurelien DARRAGON	747ff09818	MEDIUM: backend: make "balance random" consider tg local req rate when loads are equal This is a follow up to `b6bdb2553` ("MEDIUM: backend: make "balance random" consider req rate when loads are equal") In the above patch, we used the global sess_per_sec metric to choose which server we should be using. But the original intent was to use the per thread group statistic. No backport needed, the previous patch already improved the situation in 3.3, so let's not take the risk of breaking that.	2026-02-17 09:51:46 +01:00
William Lallemand	1274c21a42	BUG/MINOR: ssl: error with ssl-f-use when no "crt" ssl-f-use lines tries to load a crt file, but the "crt" keyword is not mandatory. That could lead to crtlist_load_crt() being called with a NULL path, and trying to do a stat. In this particular case we don't need to try anything and it's better to leave with an actual error. Must be backported as far as 3.2.	2026-02-16 18:41:40 +01:00
William Lallemand	0016d45a9c	BUG/MINOR: ssl: clarify ssl-f-use errors in post-section parsing crtlist_load_crt() in post_section_frontend_crt_init() won't give details about the line being parsed, this should be done by the caller. Modify post_section_frontend_crt_init() to ouput the right error format. Must be backported to 3.2.	2026-02-16 18:41:08 +01:00
William Lallemand	e0d1cdff6a	BUG/MINOR: ssl: fix leak in ssl-f-use parser upon error cfg_crt_node->filename is leaked on the error path in the ssl-f-use configuration parser. Could be backported as far as 3.2	2026-02-16 16:04:35 +01:00
William Lallemand	86df0e206e	BUG/MINOR: ssl: double-free on error path w/ ssl-f-use parser In post_section_frontend_crt_init(), the crt_entry is populated by the ssl_conf fromt the cfg_crt_node. On error path, the crt_list is completely freed, including the ssl_conf structure. But the ssl_conf structure was already freed when freeing the cfg_crt_node. Fix the issue by doing a crtlist_dup_ssl_conf(n->ssl_conf) in the crtlist_entry instead of an assignation. Fix issue #3268. Need to be backported as far as 3.2. The previous patch which adds the crtlist_dup_ssl_conf() declaration is needed.	2026-02-16 16:04:35 +01:00
Aurelien DARRAGON	d71e2e73ea	MEDIUM: filters: use per-channel filter list when relevant In the historical implementation, all filter related information where stored at the stream level (using struct strm_flt * context), and filters iteration was performed at the stream level also. We identified that this was not ideal and would make the implementation of future filters more complex since filters ordering should be handled in a different order during request and response handling for decompression for instance. To make such thing possible, in this commit we migrate some channel specific filter contexts in the channel directly (request or response), and we implement 2 additional filter lists, one on the request channel and another on the response channel. The historical stream filter list is kept as-is because in some contexts only the stream is available and we have to iterate on all filters. But for functions where we only are interested in request side or response side filters, we now use dedicated channel filters list instead. The only overhead is that the "struct filter" was expanded by two "struct list". For now, no change of behavior is expected.	2026-02-13 12:24:13 +01:00
Aurelien DARRAGON	bb6cfbe754	MINOR: filters: rework filter iteration for channel related callback functions Multiple channel related functions have the same construction: they use list_for_each_entry() to work on a given filter from the stream+channel combination. In future commits we will try to use filter list from dedicated channel list instead of the stream one, thus in this patch we need as a prerequisite to implement and use the flt_list_{start,next} API to iterate over filter list, giving the API the responsibility to iterate over the correct list depending on the context, while the calling function remains free to use the iteration construction it needs. This way we will be able to easily change the way we iterate over filter list without duplicating the code for requests and responses.	2026-02-13 12:24:07 +01:00
Aurelien DARRAGON	e88b219331	MINOR: filters: rework RESUME_FILTER_* macros as inline functions There is no need to have those helpers defined as macro, and since it is not mandatory, code maintenance is much easier using functions, thus let's switch to function definitions. Also, we change the way we iterate over the list so that the calling function now has a pseudo API to get and iterate over filter pointers while keeping control on how they implement the iterating logic. One benefit of this is that we will also be able to switch between lists depending on the channel type, which is a prerequisite for upcoming rework that split the filter list over request and response channels (commit will follow) No change of behavior is expected.	2026-02-13 12:24:00 +01:00
William Lallemand	d13164e105	MINOR: startup: show the list of detected features at runtime with haproxy -vv Features prefixed by "HAVE_WORKING_" in the haproxy -vv feature list, are features that are detected during runtime. This patch splits these features on another line in haproxy -vv. This line is named "Detected feature list".	2026-02-12 18:02:19 +01:00
William Lallemand	b90b312a50	MINOR: startup: sort the feature list in haproxy -vv The feature list in haproxy -vv is partly generated from the Makefile using the USE_* keywords, but it's also possible to add keywords in the feature list using hap_register_feature(), which adds the keyword at the end of list. When doing so, the list is not correctly sorted anymore. This patch fixes the problem by splitting the string using an array of ist and applying a qsort() on it.	2026-02-12 18:02:19 +01:00
William Lallemand	1592ed9854	MINOR: startup: Add HAVE_WORKING_TCP_MD5SIG in haproxy -vv the TCP_MD5SIG ifdef is not enough to check if the feature is usable. The code might compile but the OS could prevent to use it. This patch tries to use the TCP_MD5SIG setsockopt before adding HAVE_WORKING_TCP_MD5SIG in the feature list. so it would prevent to start reg-tests if the OS can't run it.	2026-02-12 18:02:19 +01:00
Remi Tricot-Le Breton	aad212954f	MINOR: jwt: Add new jwt_decrypt_jwk converter This converter takes a private key in the JWK format (RFC7517) that can be provided as a string of via a variable. The only keys managed for now are of type 'RSA' or 'oct'.	2026-02-12 16:31:27 +01:00
Remi Tricot-Le Breton	b26f0cc45a	MINOR: jwt: Convert an RSA JWK into an EVP_PKEY Add helper functions that take a JWK (JSON representation of an RSA private key) into an EVP_PKEY (containing the private key). Those functions are not used yet, they will be used in the upcoming 'jwt_decrypt_jwk' converter.	2026-02-12 16:31:12 +01:00
Remi Tricot-Le Breton	b3a44158fb	MINOR: ssl: Missing '\n' in error message Fix missing '\n' in error message raised when trying to load a password protected private key.	2026-02-12 16:29:01 +01:00
Amaury Denoyelle	8e16fd2cf1	BUG/MAJOR: quic: fix parsing frame type QUIC frame type is encoded as a varint. Initially, haproxy parsed it as a single byte, which was enough to cover frames defined in RFC9000. The code has been extended recently to support multi-bytes encoded value, in anticipation of QUIC frames extension support. However, there was no check on the varint format. This is interpreted erroneously as a PADDING frame as this serves as the initial value. Thus the rest of the packet is incorrectly handled, with various resulting effects, including infinite loops and/or crashes. This patch fixes this by checking the return value of quic_dec_int(). If varint cannot be parsed, the connection is immediately closed. This issue is assigned to CVE-2026-26080 report. This must be backported up to 3.2. Reported-by: Asim Viladi Oglu Manizada <manizada@pm.me>	2026-02-12 09:09:44 +01:00
Amaury Denoyelle	4aa974f949	BUG/MAJOR: quic: reject invalid token Token parsing code on INITIAL packet for the NEW_TOKEN format is not robust enough and may even crash on some rare malformed packets. This patch fixes this by adding a check on the expected length of the received token. The packet is now rejected if the token does not match QUIC_TOKEN_LEN. This check is legitimate as haproxy should only parse tokens emitted by itself. This issue has been introduced with the implementation of NEW_TOKEN tokens parsing required for 0-RTT support. This issue is assigned to CVE-2026-26081 report. This must be backported up to 3.0. Reported-by: Asim Viladi Oglu Manizada <manizada@pm.me>	2026-02-12 09:09:44 +01:00
Amaury Denoyelle	d80f0143c9	BUG/MINOR: quic: ensure handshake speed up is only run once per conn When a duplicated CRYPTO frame is received during handshake, a server may consider that there was a packet loss and immediately retransmit its pending CRYPTO data without having to wait for PTO expiration. However, RFC 9002 indicates that this should only be performed at most once per connection to avoid excessive packet transmission. QUIC connection is flagged with QUIC_FL_CONN_HANDSHAKE_SPEED_UP to mark that a fast retransmit has been performed. However, during the refactoring on CRYPTO handling with the storage conversion from ncbuf to ncbmbuf, the check on the flag was accidentely removed. The faulty patch is the following one : commit `f50425c021` MINOR: quic: remove received CRYPTO temporary tree storage This patch adds again the check on QUIC_FL_CONN_HANDSHAKE_SPEED_UP before initiating fast retransmit. This ensures this is only performed once per connection. This must be backported up to 3.3.	2026-02-12 09:09:44 +01:00
Olivier Houchard	b65df062be	MINOR: servers: Call process_srv_queue() without lock when possible In server_warmup(), call process_srv_queue() only once we released the server lock, as we don't need it.	2026-02-12 02:19:38 +01:00
Olivier Houchard	a8f50cff7e	MINOR: queues: Check minconn first in srv_dynamic_maxconn() In srv_dynamic_maxconn(), we'll decide that the max number of connection is the server's maxconn if 1) the proxy's number of connection is over fullconn, or if minconn was not set. Check if minconn is not set first, as it will be true most of the time, and as the proxy's "beconn" variable is in a busy cache line, it can be costly to access it, while minconn/maxconn is in a cache line that should very rarely change.	2026-02-12 02:18:59 +01:00
William Lallemand	ea92b0ef01	BUG/MINOR: ssl: SSL_CERT_DIR environment variable doesn't affect haproxy The documentation of @system-ca specifies that one can overwrite the value provided by the SSL Library using SSL_CERT_DIR. However it seems like X509_get_default_cert_dir() is not affected by this environment variable, and X509_get_default_cert_dir_env() need to be used in order to get the variable name, and get the value manually. This could be backported in every stable branches. Note that older branches don't have the memprintf in ssl_sock.c.	2026-02-10 21:34:45 +01:00
William Lallemand	2ac0d12790	MINOR: startup: Add the SSL lib verify directory in haproxy -vv SSL libraries built manually might lack the right X509_get_default_cert_dir() value. The common way to fix the problem is to build openssl with ./configure --openssldir=/etc/ssl/ In order to verify this setting, output it with haproxy -vv.	2026-02-10 21:06:38 +01:00
Willy Tarreau	c724693b95	MINOR: activity: allow to switch per-task lock/memory profiling at runtime Given that we already have "set profiling task", it's easy to permit to enable/disable the lock and/or memory profiling at run time. However, the change will only be applied next time the task profiling will be switched from off/auto to on. The patch is very minor and is best viewed with git show -b because it indents a whole block that moves in a "if" clause. This can be backported to 3.3 along with the two previous patches.	2026-02-10 17:53:01 +01:00
Willy Tarreau	e2631ee5f7	MEDIUM: activity: apply and use new finegrained task profiling settings In continuity of previous patch, this one makes use of the new profiling flags. For this, based on the global "profiling" setting, when switching profiling on, we set or clear two flags on the thread context, TH_FL_TASK_PROFILING_L and TH_FL_TASK_PROFILING_M to indicate whether lock profiling and/or malloc profiling are desired when profiling is enabled. These flags are checked along with TH_FL_TASK_PROFILING to decide when to collect time around a lock or a malloc. And by default we're back to the behavior of 3.2 in that neither lock nor malloc times are collected anymore. This is sufficient to see the CPU usage spent in the VDSO to significantly drop from 22% to 2.2% on a highly loaded system. This should be backported to 3.3 along with the previous patch.	2026-02-10 17:52:59 +01:00
Willy Tarreau	a7b2353cb3	MINOR: activity: support setting/clearing lock/memory watching for task profiling Damien Claisse reported in issue #3257 a performance regression between 3.2 and 3.3 when task profiling is enabled, more precisely in relation with the following patches were merged: `98cc815e3e` ("MINOR: activity: collect time spent with a lock held for each task") `503084643f` ("MINOR: activity: collect time spent waiting on a lock for each task") `9d8c2a888b` ("MINOR: activity: collect CPU time spent on memory allocations for each task") The issue mostly comes from the first patches. What happens is that the local time is taken when entering and leaving each lock, which costs a lot on a contended system. The problem here is the lack of finegrained settings for lock and malloc profiling. This patch introduces a better approach. The task profiler goes back to its default behavior in on/auto modes, but the configuration now accepts new extra options "lock", "no-lock", "memory", "no-memory" to precisely indicate other timers to watch for each task when profiling turns on. This is achieved by setting two new flags HA_PROF_TASKS_LOCK and HA_PROF_TASKS_MEM in the global "profiling" variable. This patch only parses the new values and assigns them to the global variable from the config file for now. The doc was updated.	2026-02-10 17:47:02 +01:00
Willy Tarreau	3b45beb465	CLEANUP: haproxy: fix bad line wrapping in run_poll_loop() Commit `3674afe8a0` ("BUG/MEDIUM: threads: Atomically set TH_FL_SLEEPING and clr FL_NOTIFIED") accidentally left a strange-looking line wrapping making one think of an editing mistake, let's fix it and keep it on a single line given that even indented wrapping is almost as large. This can be backported with the fix above till 2.8 to keep the patch context consistent between versions.	2026-02-10 14:11:42 +01:00
Willy Tarreau	64c5d45a26	BUG/MEDIUM: lb-chash: always properly initialize lb_nodes with dynamic servers An issue was introduced in 3.0 with commit `faa8c3e024` ("MEDIUM: lb-chash: Deterministic node hashes based on server address"): the new server_key field and lb_nodes entries initialization were not updated for servers added at run time with "add server": server_key remains zero and the key used in lb_node remains the one depending only on the server's ID. This will cause trouble when adding new servers with consistent hashing, because the hash-key will be ignored until the server's weight changes and the key difference is detected, leading to its recalculation. This is essentially caused by the poorly placed lb_nodes initialization that is specific to lb-chash and had to be replicated in the code dealing with server addition. This commit solves the problem by adding a new ->server_init() function in the lbprm proxy struct, that is called by the server addition code. This also allows to abandon the complex check for LB algos that was placed there for that purpose. For now only lb-chash provides such a function, and calls it as well during initial setup. This way newly added servers always use the correct key now. While it should also theoretically have had an impact on servers added with the "random" algorithm, it's unlikely that the difference between proper server keys and those based on their ID could have had any visible effect. This patch should be backported as far as 3.0. The backport may be eased by a preliminary backport of previous commit "CLEANUP: lb-chash: free lb_nodes from chash's deinit(), not global", though this is not strictly necessary if context is manually adjusted.	2026-02-10 07:22:54 +01:00

1 2 3 4 5 ...

20592 Commits