[BUG](exec) fix coalesce function output null by qzsee · Pull Request #63092 · apache/doris

qzsee · 2026-05-08T13:24:04Z

What problem does this PR solve?

Issue Number: close #xxx

Example: COALESCE(same_department_income_amount, 0) ==> outputs NULL (where same_department_income_amount is of type double).

When assigning the value to the result column in the computation, the assignment is done unconditionally (forced), as in:

result_raw_data[row] +=
                    column_raw_data[row] *
                    typename ColumnType::value_type(!(null_map_data[row] | filled_flag[row]));

If the argument column column_raw_data's null_map[row] is 1, then the value stored in column_raw_data[row] is garbage data. This garbage may contain values such as NaN. If a preceding argument of COALESCE happens to be assigned NaN, then during subsequent assignments we run into cases like:

0 * NaN = NaN
num + NaN = NaN

so the assigned result also becomes NaN, which causes value pollution.

By rights the final output should also be NaN, but what is actually returned is NULL. The reason is that during result serialization/output, NaN values are emitted as NULL.

tatus DataTypeNumberSerDe<T>::_write_column_to_mysql(const IColumn& column,
                                                      MysqlRowBuffer<is_binary_format>& result,
                                                      int row_idx, bool col_const,
                                                      const FormatOptions& options) const {
    //...
    else if constexpr (std::is_same_v<T, float>) {
        if (std::isnan(data[col_index])) {
            // Handle NaN for float, we should push null value
            buf_ret = result.push_null();
        } else {
            buf_ret = result.push_float(data[col_index]);
        }
    } 
  //...
}

Related PR: #xxx

Problem Summary:

Release note

None

Check List (For Author)

Test
- Regression test
- Unit Test
- Manual test (add detailed scripts or steps below)
- No need to test or manual test. Explain why:
  - This is a refactor/code format and no logic has been changed.
  - Previous test can cover this change.
  - No code files have been changed.
  - Other reason
Behavior changed:
- No.
- Yes.
Does this need documentation?
- No.
- Yes.

Check List (For Reviewer who merge this PR)

Confirm the release note
Confirm test cases
Confirm document
Add branch pick label

hello-stephen · 2026-05-08T13:24:09Z

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

What problem was fixed (it's best to include specific error reporting information). How it was fixed.
Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
What features were added. Why was this function added?
Which code was refactored and why was this part of the code refactored?
Which functions were optimized and what is the difference before and after the optimization?

zclllyybb · 2026-05-09T06:36:00Z

/review

github-actions

The implementation change is small and directly addresses the reported COALESCE value-pollution path, but this PR currently has no regression or unit test for the bug it fixes. Because this is BE expression correctness code, Doris review standards require a targeted test before merge.

Critical checkpoint conclusions:

Goal/test: The code appears to accomplish the stated goal for scalar non-string COALESCE results by avoiding arithmetic with null-position payloads, but no test proves the reported nullable DOUBLE/NaN case or prevents regression.
Scope: The code change is focused and minimal.
Concurrency/lifecycle: Not applicable; this is per-block expression evaluation with no new shared state or lifecycle management.
Configuration/compatibility/protocol: Not applicable; no config, persisted format, or FE-BE protocol changes.
Parallel paths: String/array/map/struct/JSON and bitmap paths already use guarded insertion/assignment; the changed scalar path is the relevant divergent path.
Conditions/error handling: No new error path; the new guard is the necessary data-correctness condition.
Tests/results: Missing targeted regression/unit coverage and no expected result update was included.
Observability/transactions/data writes: Not applicable.
Performance: The new branch is in a hot scalar loop but is justified by correctness; no obvious blocking performance issue found.

Focus points: No additional user-provided review focus was supplied.

github-actions · 2026-05-09T06:40:57Z

-                    column_raw_data[row] *
-                    typename ColumnType::value_type(!(null_map_data[row] | filled_flag[row]));
+            if (!(null_map_data[row] | filled_flag[row])) {
+                result_raw_data[row] = column_raw_data[row];


This bug fix needs a targeted regression test. The PR description describes a concrete data-correctness failure where COALESCE(nullable_double, 0) can return NULL after a null-position payload contains NaN, but the PR only changes BE code and does not add a case under the existing conditional function/coalesce tests (or a BE unit test) that fails before this change and passes after it. Please add coverage for the nullable floating-point path so this does not regress.

yiguolei · 2026-05-09T08:19:47Z

-            result_raw_data[row] +=
-                    column_raw_data[row] *
-                    typename ColumnType::value_type(!(null_map_data[row] | filled_flag[row]));
+            if (!(null_map_data[row] | filled_flag[row])) {


add test please and also check the document

HappenLee · 2026-05-11T08:16:41Z

run buildall

hello-stephen · 2026-05-11T08:44:58Z

TPC-H: Total hot run time: 29271 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 1409eb15ef01acad148594f53309b505c1c3be7e, data reload: false

------ Round 1 ----------------------------------
orders	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	17635	3800	3796	3796
q2	q3	10701	861	597	597
q4	4660	463	347	347
q5	7468	1335	1130	1130
q6	185	168	138	138
q7	915	955	743	743
q8	9292	1373	1297	1297
q9	5564	5361	5259	5259
q10	6294	2064	1832	1832
q11	490	261	247	247
q12	673	421	286	286
q13	18161	3675	2713	2713
q14	285	281	262	262
q15	q16	902	885	781	781
q17	997	987	731	731
q18	6455	5656	5487	5487
q19	1180	1183	1041	1041
q20	509	406	256	256
q21	4782	2302	2019	2019
q22	463	385	309	309
Total cold run time: 97611 ms
Total hot run time: 29271 ms

----- Round 2, with runtime_filter_mode=off -----
orders	Doris	NULL	NULL	150000000	42	6422171781	NULL	22778155	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	4720	4499	4716	4499
q2	q3	4716	4832	4296	4296
q4	2181	2200	1453	1453
q5	5018	5020	5242	5020
q6	192	168	137	137
q7	2099	1835	1600	1600
q8	3362	3095	3094	3094
q9	8444	8560	8464	8464
q10	4483	4533	4267	4267
q11	602	410	386	386
q12	701	743	511	511
q13	3220	3594	2912	2912
q14	313	299	277	277
q15	q16	756	768	682	682
q17	1335	1304	1259	1259
q18	7972	7079	7113	7079
q19	1150	1113	1129	1113
q20	2232	2242	1992	1992
q21	6070	5385	4898	4898
q22	559	477	394	394
Total cold run time: 60125 ms
Total hot run time: 54333 ms

hello-stephen · 2026-05-11T08:55:53Z

TPC-DS: Total hot run time: 170437 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 1409eb15ef01acad148594f53309b505c1c3be7e, data reload: false

query5	4316	665	514	514
query6	344	221	212	212
query7	4271	575	317	317
query8	345	232	215	215
query9	8833	4015	4005	4005
query10	450	346	297	297
query11	5862	2367	2214	2214
query12	188	141	132	132
query13	1277	622	419	419
query14	6821	5347	5058	5058
query14_1	4364	4356	4371	4356
query15	211	205	182	182
query16	1047	456	465	456
query17	1148	768	648	648
query18	2736	479	366	366
query19	230	210	185	185
query20	139	132	133	132
query21	221	142	119	119
query22	13560	13685	13371	13371
query23	17178	16455	16004	16004
query23_1	16175	16170	16258	16170
query24	7434	1765	1369	1369
query24_1	1341	1373	1363	1363
query25	646	524	476	476
query26	1316	332	175	175
query27	2655	576	336	336
query28	4405	1966	1956	1956
query29	1040	674	540	540
query30	305	235	207	207
query31	1124	1126	927	927
query32	90	76	70	70
query33	551	358	295	295
query34	1193	1105	645	645
query35	747	790	691	691
query36	1346	1371	1147	1147
query37	152	102	88	88
query38	3223	3131	3047	3047
query39	930	934	893	893
query39_1	884	905	866	866
query40	241	156	134	134
query41	69	63	62	62
query42	114	107	107	107
query43	331	326	285	285
query44	
query45	213	207	194	194
query46	1127	1166	728	728
query47	2353	2350	2218	2218
query48	407	414	308	308
query49	635	545	467	467
query50	699	287	215	215
query51	4298	4222	4184	4184
query52	105	104	97	97
query53	247	273	205	205
query54	314	270	251	251
query55	94	95	85	85
query56	300	299	307	299
query57	1440	1403	1339	1339
query58	304	279	276	276
query59	1549	1638	1449	1449
query60	349	344	335	335
query61	164	160	189	160
query62	674	614	563	563
query63	236	202	205	202
query64	2292	822	703	703
query65	
query66	1686	502	398	398
query67	29883	29831	29744	29744
query68	
query69	452	338	300	300
query70	1039	982	956	956
query71	315	273	273	273
query72	2954	2698	2517	2517
query73	866	779	438	438
query74	5052	4910	4729	4729
query75	2766	2660	2313	2313
query76	2311	1114	749	749
query77	416	430	349	349
query78	12967	12901	12410	12410
query79	1478	992	730	730
query80	1372	580	497	497
query81	525	280	239	239
query82	940	156	122	122
query83	354	270	250	250
query84	273	142	113	113
query85	907	506	475	475
query86	463	336	335	335
query87	3468	3345	3205	3205
query88	3493	2643	2614	2614
query89	442	386	346	346
query90	1910	179	176	176
query91	178	167	143	143
query92	82	79	78	78
query93	1143	952	563	563
query94	716	336	292	292
query95	656	476	353	353
query96	1045	758	353	353
query97	2708	2712	2544	2544
query98	242	234	235	234
query99	1104	1135	996	996
Total cold run time: 254750 ms
Total hot run time: 170437 ms

dqz123 · 2026-05-12T02:25:49Z

run buildall

hello-stephen · 2026-05-12T02:53:33Z

TPC-H: Total hot run time: 29699 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit ac1d83dd1e4f3423d0170575b4a7cd906b1456f6, data reload: false

------ Round 1 ----------------------------------
orders	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	17615	3874	3838	3838
q2	q3	10699	876	630	630
q4	4660	465	349	349
q5	7443	1352	1138	1138
q6	194	167	138	138
q7	905	936	753	753
q8	9410	1402	1278	1278
q9	5996	5359	5314	5314
q10	6311	2066	1799	1799
q11	473	264	253	253
q12	687	408	286	286
q13	18198	3285	2752	2752
q14	292	280	259	259
q15	q16	899	876	784	784
q17	1022	947	819	819
q18	6484	5725	5607	5607
q19	1446	1273	1104	1104
q20	517	395	257	257
q21	4684	2393	2024	2024
q22	453	390	317	317
Total cold run time: 98388 ms
Total hot run time: 29699 ms

----- Round 2, with runtime_filter_mode=off -----
orders	Doris	NULL	NULL	150000000	42	6422171781	NULL	22778155	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	4677	4527	4502	4502
q2	q3	4678	4779	4236	4236
q4	2137	2175	1401	1401
q5	5041	5045	5272	5045
q6	197	168	134	134
q7	2056	1790	1606	1606
q8	3362	3131	3108	3108
q9	8475	8523	8491	8491
q10	4449	4525	4266	4266
q11	614	438	415	415
q12	687	744	511	511
q13	3318	3586	2923	2923
q14	295	309	273	273
q15	q16	738	786	693	693
q17	1338	1277	1262	1262
q18	7944	7028	7060	7028
q19	1197	1148	1194	1148
q20	2245	2268	1988	1988
q21	6108	5425	4917	4917
q22	531	475	400	400
Total cold run time: 60087 ms
Total hot run time: 54347 ms

hello-stephen · 2026-05-12T03:04:29Z

TPC-DS: Total hot run time: 170254 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit ac1d83dd1e4f3423d0170575b4a7cd906b1456f6, data reload: false

query5	4317	639	528	528
query6	323	218	215	215
query7	4294	579	321	321
query8	331	228	214	214
query9	8821	4069	4040	4040
query10	457	340	292	292
query11	5799	2383	2260	2260
query12	184	132	132	132
query13	1302	603	441	441
query14	6723	5380	5058	5058
query14_1	4328	4328	4359	4328
query15	208	208	185	185
query16	1005	455	421	421
query17	1134	740	629	629
query18	2740	479	347	347
query19	216	204	161	161
query20	147	130	132	130
query21	215	142	118	118
query22	13569	13492	13313	13313
query23	17096	16414	15891	15891
query23_1	16045	16126	16067	16067
query24	7440	1777	1358	1358
query24_1	1374	1365	1372	1365
query25	597	539	485	485
query26	1170	321	173	173
query27	2716	602	347	347
query28	4435	1984	1967	1967
query29	1019	677	535	535
query30	309	246	208	208
query31	1118	1069	950	950
query32	92	80	74	74
query33	551	362	305	305
query34	1209	1118	648	648
query35	781	804	694	694
query36	1332	1370	1241	1241
query37	169	106	105	105
query38	3206	3123	3045	3045
query39	936	913	904	904
query39_1	880	886	876	876
query40	257	165	156	156
query41	72	69	68	68
query42	116	112	111	111
query43	325	332	298	298
query44	
query45	218	207	200	200
query46	1051	1226	726	726
query47	2354	2277	2196	2196
query48	405	412	308	308
query49	653	546	458	458
query50	718	284	223	223
query51	4349	4252	4229	4229
query52	109	108	97	97
query53	256	281	217	217
query54	326	288	282	282
query55	95	95	85	85
query56	312	333	326	326
query57	1423	1430	1336	1336
query58	329	291	281	281
query59	1553	1634	1425	1425
query60	352	356	342	342
query61	221	150	157	150
query62	667	653	558	558
query63	245	208	217	208
query64	2251	827	690	690
query65	
query66	1672	522	395	395
query67	30000	30017	29937	29937
query68	
query69	455	361	308	308
query70	1043	998	937	937
query71	320	275	266	266
query72	2938	2699	2425	2425
query73	833	742	437	437
query74	5069	4913	4786	4786
query75	2758	2679	2329	2329
query76	2321	1135	766	766
query77	434	420	356	356
query78	12910	12934	12283	12283
query79	1597	926	752	752
query80	1376	583	490	490
query81	518	286	234	234
query82	916	158	125	125
query83	340	278	250	250
query84	273	144	109	109
query85	898	549	449	449
query86	454	334	327	327
query87	3472	3360	3248	3248
query88	3526	2698	2619	2619
query89	459	382	335	335
query90	1903	182	178	178
query91	182	171	146	146
query92	83	78	79	78
query93	1232	970	548	548
query94	733	338	304	304
query95	687	380	350	350
query96	1099	784	333	333
query97	2715	2672	2581	2581
query98	244	225	241	225
query99	1112	1144	983	983
Total cold run time: 254810 ms
Total hot run time: 170254 ms

hello-stephen · 2026-05-12T06:23:21Z

BE UT Coverage Report

Increment line coverage 100.00% (6/6) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	53.62% (20650/38515)
Line Coverage	37.23% (195128/524084)
Region Coverage	33.60% (152467/453750)
Branch Coverage	34.61% (66501/192122)

hello-stephen · 2026-05-12T07:41:06Z

BE Regression && UT Coverage Report

Increment line coverage 100.00% (6/6) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	73.78% (27830/37718)
Line Coverage	57.63% (301250/522710)
Region Coverage	54.85% (251289/458169)
Branch Coverage	56.32% (108617/192850)

HappenLee

LGTM

github-actions · 2026-05-12T09:50:50Z

PR approved by at least one committer and no changes requested.

not need

zclllyybb · 2026-05-15T04:15:02Z

skip buildall

github-actions · 2026-05-15T04:15:12Z

PR approved by anyone and no changes requested.

HappenLee · 2026-05-18T03:20:38Z

run buildall

dqz123 · 2026-05-18T12:33:39Z

run buildall

hello-stephen · 2026-05-18T13:47:29Z

TPC-H: Total hot run time: 31376 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 6ab74884f1326a36a83574adbfca820c141cf921, data reload: false

------ Round 1 ----------------------------------
orders	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	17652	3878	3887	3878
q2	q3	10772	1362	792	792
q4	4681	472	340	340
q5	7562	2251	2092	2092
q6	235	175	138	138
q7	904	805	629	629
q8	9363	1660	1596	1596
q9	5109	4922	4836	4836
q10	6417	2080	1775	1775
q11	429	282	245	245
q12	628	419	291	291
q13	18186	3408	2775	2775
q14	264	256	231	231
q15	q16	817	768	710	710
q17	948	976	974	974
q18	6821	5821	5653	5653
q19	1367	1405	1067	1067
q20	509	408	316	316
q21	6482	2812	2723	2723
q22	461	371	315	315
Total cold run time: 99607 ms
Total hot run time: 31376 ms

----- Round 2, with runtime_filter_mode=off -----
orders	Doris	NULL	NULL	150000000	42	6422171781	NULL	22778155	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	4713	4573	4516	4516
q2	q3	4913	5217	4581	4581
q4	2132	2183	1417	1417
q5	4889	4746	4601	4601
q6	225	184	130	130
q7	1913	1716	1498	1498
q8	2353	2015	2014	2014
q9	7621	7235	7197	7197
q10	4435	4401	4021	4021
q11	520	386	346	346
q12	707	719	514	514
q13	2981	3363	2819	2819
q14	279	273	254	254
q15	q16	681	698	604	604
q17	1267	1227	1228	1227
q18	7589	6755	6765	6755
q19	1150	1048	1083	1048
q20	2217	2195	1914	1914
q21	5301	4633	4478	4478
q22	522	441	399	399
Total cold run time: 56408 ms
Total hot run time: 50333 ms

hello-stephen · 2026-05-18T13:58:19Z

TPC-DS: Total hot run time: 168320 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 6ab74884f1326a36a83574adbfca820c141cf921, data reload: false

query5	4305	651	509	509
query6	330	216	200	200
query7	4220	556	307	307
query8	336	233	216	216
query9	8806	3952	3949	3949
query10	458	343	301	301
query11	5725	2398	2167	2167
query12	182	129	126	126
query13	1262	617	439	439
query14	5948	5295	4984	4984
query14_1	4318	4291	4306	4291
query15	214	200	182	182
query16	1024	438	431	431
query17	1134	714	573	573
query18	2435	483	356	356
query19	208	196	155	155
query20	134	133	127	127
query21	205	136	116	116
query22	13541	13501	13387	13387
query23	17203	16367	15969	15969
query23_1	16102	16039	16120	16039
query24	7399	1780	1310	1310
query24_1	1313	1294	1304	1294
query25	583	513	444	444
query26	1329	332	176	176
query27	2655	539	342	342
query28	4529	1951	1920	1920
query29	1018	653	529	529
query30	315	244	203	203
query31	1128	1062	946	946
query32	101	75	73	73
query33	567	342	282	282
query34	1175	1143	644	644
query35	753	788	657	657
query36	1354	1278	1188	1188
query37	151	97	90	90
query38	3215	3131	3028	3028
query39	932	905	901	901
query39_1	876	874	866	866
query40	223	142	123	123
query41	64	62	62	62
query42	110	111	107	107
query43	327	336	282	282
query44	
query45	202	201	195	195
query46	1100	1154	730	730
query47	2354	2338	2199	2199
query48	394	423	286	286
query49	625	487	368	368
query50	944	353	241	241
query51	4420	4291	4178	4178
query52	106	103	93	93
query53	250	267	204	204
query54	307	273	249	249
query55	96	93	88	88
query56	306	308	303	303
query57	1435	1375	1299	1299
query58	297	268	264	264
query59	1548	1631	1432	1432
query60	322	330	308	308
query61	167	154	162	154
query62	666	628	563	563
query63	248	205	209	205
query64	2414	798	647	647
query65	
query66	1727	475	341	341
query67	29987	29392	29795	29392
query68	
query69	460	330	306	306
query70	1001	958	1003	958
query71	308	275	277	275
query72	3209	2870	2636	2636
query73	801	728	419	419
query74	5093	4880	4730	4730
query75	2665	2571	2219	2219
query76	2268	1146	749	749
query77	388	397	319	319
query78	12283	12188	11619	11619
query79	1424	1014	755	755
query80	635	547	441	441
query81	455	283	239	239
query82	1289	155	121	121
query83	356	278	250	250
query84	258	135	108	108
query85	880	541	448	448
query86	393	325	309	309
query87	3417	3344	3244	3244
query88	3519	2662	2620	2620
query89	449	389	333	333
query90	1946	185	173	173
query91	176	174	136	136
query92	80	75	73	73
query93	1436	1429	865	865
query94	529	345	306	306
query95	674	385	342	342
query96	1051	834	324	324
query97	2693	2725	2541	2541
query98	235	231	230	230
query99	1131	1097	991	991
Total cold run time: 252698 ms
Total hot run time: 168320 ms

hello-stephen · 2026-05-18T14:48:07Z

BE UT Coverage Report

Increment line coverage 100.00% (5/5) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	53.54% (20670/38604)
Line Coverage	37.20% (195358/525121)
Region Coverage	33.55% (152805/455438)
Branch Coverage	34.60% (66645/192628)

hello-stephen · 2026-05-18T15:41:23Z

BE Regression && UT Coverage Report

Increment line coverage 100.00% (5/5) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	73.52% (27793/37804)
Line Coverage	57.49% (301095/523741)
Region Coverage	54.71% (251573/459851)
Branch Coverage	56.19% (108655/193354)

HappenLee

LGTM

hello-stephen · 2026-05-21T08:30:00Z

/review

github-actions

I found one blocking issue in the new BE unit test: it accesses protected VExpr internals directly, so the test target will not compile.

Critical checkpoint conclusions:

Goal/test: the production change targets COALESCE floating-point NaN/Inf pollution and the added tests cover that behavior, but the tests are currently uncompilable.
Scope: the production code change is small and focused.
Concurrency/lifecycle/config/compatibility/persistence/write paths: not applicable for this expression-local fix.
Parallel paths: Float32 and Float64 are covered; integer path is left unchanged and has a sanity test.
Error handling/memory/observability: no new Status propagation, large allocations, or observability concerns found in the changed production path.
Test coverage/results: coverage was added in response to the earlier review thread, but must be fixed to compile before it can prove the behavior.

User focus: no additional user-provided review focus was specified.

github-actions · 2026-05-21T08:37:19Z

+    auto coalesce_expr = VectorizedCoalesceExpr::create_shared(coalesce_node);
+
+    // _data_type is already set to Nullable(Float64) by the base ctor; reassert it explicitly.
+    coalesce_expr->_data_type =


This test file will not compile because it writes coalesce_expr->_data_type here and coalesce_expr->_open_finished below, but both members are protected in VExpr (be/src/exprs/vexpr.h). The test is neither a derived class nor a friend of VExpr/VectorizedCoalesceExpr, so the BE UT target fails at compile time before exercising the regression. Please avoid direct protected-member access, for example use the public data_type() accessor if the type really needs overriding and remove the _open_finished assignments since these tests call execute_column_impl() directly. The same pattern appears in the other test cases in this file as well.

… into fix-coalesce

dqz123 · 2026-05-22T03:32:35Z

run buildall

hello-stephen · 2026-05-22T04:51:31Z

TPC-H: Total hot run time: 31055 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit ff603c972df605fb5a426f0481a3e809587e8dbb, data reload: false

------ Round 1 ----------------------------------
orders	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	17718	3968	3875	3875
q2	q3	10835	1392	792	792
q4	4687	465	340	340
q5	7653	2297	2101	2101
q6	236	173	134	134
q7	931	782	637	637
q8	9413	1745	1441	1441
q9	5095	4910	4896	4896
q10	6405	2147	1785	1785
q11	432	269	243	243
q12	631	423	295	295
q13	18137	3348	2738	2738
q14	262	256	228	228
q15	q16	815	797	716	716
q17	968	932	912	912
q18	7002	5817	5529	5529
q19	1313	1169	1114	1114
q20	516	398	353	353
q21	6260	2829	2618	2618
q22	453	363	308	308
Total cold run time: 99762 ms
Total hot run time: 31055 ms

----- Round 2, with runtime_filter_mode=off -----
orders	Doris	NULL	NULL	150000000	42	6422171781	NULL	22778155	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	4854	4572	4531	4531
q2	q3	4994	5211	4631	4631
q4	2143	2172	1372	1372
q5	4743	4650	4576	4576
q6	234	177	131	131
q7	1905	1754	1528	1528
q8	2352	2015	2020	2015
q9	7752	7583	7162	7162
q10	4448	4368	3950	3950
q11	522	372	343	343
q12	702	728	510	510
q13	2999	3412	2845	2845
q14	269	275	252	252
q15	q16	678	700	608	608
q17	1252	1229	1217	1217
q18	7344	6803	6830	6803
q19	1169	1116	1114	1114
q20	2198	2194	1924	1924
q21	5261	4582	4409	4409
q22	513	465	412	412
Total cold run time: 56332 ms
Total hot run time: 50333 ms

hello-stephen · 2026-05-22T04:53:27Z

BE UT Coverage Report

Increment line coverage 100.00% (5/5) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	53.68% (20763/38677)
Line Coverage	37.28% (196731/527765)
Region Coverage	33.60% (154184/458914)
Branch Coverage	34.59% (67147/194098)

hello-stephen · 2026-05-22T05:02:23Z

TPC-DS: Total hot run time: 168962 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit ff603c972df605fb5a426f0481a3e809587e8dbb, data reload: false

query5	4317	638	509	509
query6	340	232	196	196
query7	4226	572	302	302
query8	327	240	223	223
query9	8821	3996	3993	3993
query10	443	345	292	292
query11	5770	2430	2208	2208
query12	181	126	124	124
query13	1276	600	434	434
query14	5960	5347	5014	5014
query14_1	4312	4317	4263	4263
query15	209	205	175	175
query16	997	450	398	398
query17	946	702	574	574
query18	2433	487	349	349
query19	221	201	157	157
query20	136	135	127	127
query21	219	142	119	119
query22	13628	13482	13340	13340
query23	17182	16382	15977	15977
query23_1	16157	16162	16075	16075
query24	7505	1746	1333	1333
query24_1	1305	1300	1292	1292
query25	583	494	440	440
query26	1325	330	172	172
query27	2680	553	347	347
query28	4496	1954	1978	1954
query29	1049	622	518	518
query30	313	237	201	201
query31	1109	1074	948	948
query32	97	78	75	75
query33	558	366	306	306
query34	1177	1160	651	651
query35	783	798	677	677
query36	1393	1349	1183	1183
query37	154	101	98	98
query38	3210	3140	3038	3038
query39	925	923	898	898
query39_1	873	892	875	875
query40	241	150	127	127
query41	71	70	71	70
query42	115	113	123	113
query43	322	331	286	286
query44	
query45	216	205	200	200
query46	1081	1174	733	733
query47	2323	2375	2204	2204
query48	414	409	301	301
query49	659	502	398	398
query50	975	353	261	261
query51	4310	4260	4178	4178
query52	110	108	101	101
query53	253	286	208	208
query54	318	283	269	269
query55	99	92	86	86
query56	301	313	310	310
query57	1421	1414	1304	1304
query58	305	281	279	279
query59	1606	1617	1408	1408
query60	371	308	300	300
query61	155	158	159	158
query62	665	625	556	556
query63	244	194	212	194
query64	2444	805	639	639
query65	
query66	1738	476	349	349
query67	29986	29941	29900	29900
query68	
query69	471	345	298	298
query70	1009	990	981	981
query71	296	268	265	265
query72	3080	2800	2306	2306
query73	846	759	427	427
query74	5059	4899	4722	4722
query75	2662	2573	2278	2278
query76	2296	1119	732	732
query77	394	400	335	335
query78	12035	12155	11565	11565
query79	1441	1053	777	777
query80	700	544	451	451
query81	467	279	242	242
query82	1362	155	118	118
query83	337	276	243	243
query84	310	142	109	109
query85	893	524	463	463
query86	449	355	312	312
query87	3408	3394	3261	3261
query88	3523	2653	2658	2653
query89	466	380	336	336
query90	1877	177	181	177
query91	177	164	138	138
query92	81	73	73	73
query93	1481	1391	937	937
query94	563	372	317	317
query95	678	471	356	356
query96	1016	742	313	313
query97	2695	2695	2558	2558
query98	236	227	223	223
query99	1297	1108	965	965
Total cold run time: 252977 ms
Total hot run time: 168962 ms

hello-stephen · 2026-05-22T06:28:33Z

BE Regression && UT Coverage Report

Increment line coverage 100.00% (5/5) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	73.64% (27898/37883)
Line Coverage	57.58% (303138/526419)
Region Coverage	54.70% (253463/463343)
Branch Coverage	56.28% (109644/194825)

already

HappenLee

LGTM

hello-stephen · 2026-05-28T07:17:14Z

/review

github-actions

Automated review summary:

I reviewed the full PR diff and existing inline review context. I did not find any additional blocking issues distinct from the already-known review threads.

Critical checkpoint conclusions:

Goal and tests: The PR targets COALESCE result pollution from NaN/Inf payloads in skipped nullable floating-point rows. The added BE unit tests cover Float64 NaN, Float64 Inf, selected NaN preservation, Float32 NaN, and an integer control path.
Scope and clarity: The production change is small and localized to the coalesce scalar fill helper; the test is focused on the regression scenario.
Concurrency and lifecycle: No new shared state, threads, locks, static initialization, or lifecycle-sensitive ownership were introduced.
Compatibility and persistence: No storage format, wire protocol, EditLog, or rolling-upgrade compatibility impact found.
Parallel code paths: The regular coalesce numeric fill path is the affected path; the short-circuit coalesce path fills selected rows directly and does not appear to share this arithmetic pollution issue.
Error handling and observability: No new Status-discarding path or user-visible error path was introduced; no extra observability appears necessary for this local arithmetic fix.
Memory and performance: No significant new allocation or memory-tracking concern in production code. The per-row finite check is only on Float32/Float64 coalesce fill and is bounded by the existing scan.
Test result files: No regression .out files are involved because coverage is added as BE UTs.

User focus: No additional user-provided review focus was specified.

### What problem does this PR solve? Issue Number: close #xxx Example: COALESCE(same_department_income_amount, 0) ==> outputs NULL (where same_department_income_amount is of type double). When assigning the value to the result column in the computation, the assignment is done unconditionally (forced), as in: ```cpp result_raw_data[row] += column_raw_data[row] * typename ColumnType::value_type(!(null_map_data[row] | filled_flag[row])); ``` If the argument column column_raw_data's null_map[row] is 1, then the value stored in column_raw_data[row] is garbage data. This garbage may contain values such as NaN. If a preceding argument of COALESCE happens to be assigned NaN, then during subsequent assignments we run into cases like: 0 * NaN = NaN num + NaN = NaN so the assigned result also becomes NaN, which causes value pollution. By rights the final output should also be NaN, but what is actually returned is NULL. The reason is that during result serialization/output, NaN values are emitted as NULL. ```cpp tatus DataTypeNumberSerDe<T>::_write_column_to_mysql(const IColumn& column, MysqlRowBuffer<is_binary_format>& result, int row_idx, bool col_const, const FormatOptions& options) const { //... else if constexpr (std::is_same_v<T, float>) { if (std::isnan(data[col_index])) { // Handle NaN for float, we should push null value buf_ret = result.push_null(); } else { buf_ret = result.push_float(data[col_index]); } } //... } ``` Co-authored-by: garenshi <garenshi@tencent.com>

[BUG] fix coalesce function output null

3c1584a

HappenLee changed the title ~~[BUG] fix coalesce function output null~~ [BUG](exec) fix coalesce function output null May 8, 2026

github-actions Bot previously requested changes May 9, 2026

View reviewed changes

yiguolei reviewed May 9, 2026

View reviewed changes

add ut

1409eb1

1

ac1d83d

HappenLee previously approved these changes May 12, 2026

View reviewed changes

github-actions Bot added the approved Indicates a PR has been approved by one committer. label May 12, 2026

github-actions Bot added the reviewed label May 15, 2026

zclllyybb closed this May 15, 2026

zclllyybb reopened this May 15, 2026

HappenLee added dev/4.0.x dev/4.1.x labels May 15, 2026

1

9d6411e

qzsee dismissed HappenLee’s stale review via 9d6411e May 18, 2026 03:00

HappenLee previously approved these changes May 21, 2026

View reviewed changes

hello-stephen closed this May 21, 2026

hello-stephen reopened this May 21, 2026

github-actions Bot previously requested changes May 21, 2026

View reviewed changes

garenshi added 2 commits May 22, 2026 11:30

fix ut

e37b546

Merge branch 'fix-coalesce' of https://github.com/qzsee/incubator-doris…

db740db

… into fix-coalesce

qzsee dismissed HappenLee’s stale review via db740db May 22, 2026 03:30

Merge branch 'master' into fix-coalesce

ff603c9

HappenLee approved these changes May 27, 2026

View reviewed changes

hello-stephen closed this May 28, 2026

hello-stephen reopened this May 28, 2026

github-actions Bot reviewed May 28, 2026

View reviewed changes

hello-stephen merged commit a95974c into apache:master May 28, 2026
32 of 33 checks passed

This was referenced May 28, 2026

branch-4.0: [BUG](exec) fix coalesce function output null #63092 #63807

Open

branch-4.1: [BUG](exec) fix coalesce function output null #63092 #63808

Open

Conversation

qzsee commented May 8, 2026

What problem does this PR solve?

Release note

Check List (For Author)

Check List (For Reviewer who merge this PR)

Uh oh!

hello-stephen commented May 8, 2026

Uh oh!

zclllyybb commented May 9, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 9, 2026

Choose a reason for hiding this comment

Uh oh!

yiguolei May 9, 2026

Choose a reason for hiding this comment

Uh oh!

HappenLee commented May 11, 2026

Uh oh!

hello-stephen commented May 11, 2026

Uh oh!

hello-stephen commented May 11, 2026

Uh oh!

dqz123 commented May 12, 2026

Uh oh!

hello-stephen commented May 12, 2026

Uh oh!

hello-stephen commented May 12, 2026

Uh oh!

hello-stephen commented May 12, 2026

BE UT Coverage Report

Uh oh!

hello-stephen commented May 12, 2026

BE Regression && UT Coverage Report

Uh oh!

HappenLee left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

zclllyybb commented May 15, 2026

Uh oh!

github-actions Bot commented May 15, 2026

Uh oh!

HappenLee commented May 18, 2026

Uh oh!

dqz123 commented May 18, 2026

Uh oh!

hello-stephen commented May 18, 2026

Uh oh!

hello-stephen commented May 18, 2026

Uh oh!

hello-stephen commented May 18, 2026

BE UT Coverage Report

Uh oh!

hello-stephen commented May 18, 2026

BE Regression && UT Coverage Report

Uh oh!

HappenLee left a comment

Choose a reason for hiding this comment

Uh oh!

hello-stephen commented May 21, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 21, 2026

Choose a reason for hiding this comment

Uh oh!

dqz123 commented May 22, 2026

Uh oh!

hello-stephen commented May 22, 2026

Uh oh!

hello-stephen commented May 22, 2026

BE UT Coverage Report

Uh oh!

hello-stephen commented May 22, 2026