sql使用加和进行合并去重并提升速率

sql使用加和进行合并去重并提升速率

码农世界 2024-05-23 前端 77 次浏览 0个评论

背景

  • 有三张表ltd1 、ltd0051和、ltd0011
  • ltd1作为主表,左关联 ltd0051和ltd0011
  • 如果ltd0051有两条重复数据、td0011有两条重复数据,左关联之后就会得到4条,同时ltd0051和ltd0011这两条数据都是正确,基于主键我们需要将两个相同主键的数据合并成一条

ltd0051和ltd0011 中是存在重复数据的,

ltd0051重复数据的判定规则:planid和batchno,MatCode相同的判定为重复数据

ltd0011重复数据的判定规则:planid和batchno,MatCode相同的判定为重复数据

第一步:实现需求

使用了两个子查询,对重复数据进行加和,得到两个新表

SELECT
    ltd1.plan_id_out,
    ltd1.batchno_out,
    ltd1.lot_id_out,
    ltd1.material_code_out,
    ltd1.material_name_out,
    ltd1.equip_id_out,
    ltd1.pro_date_out,
    ltd1.shift_id_out,
    ltd1.weight_out,
    ltd1.state_out,
    CASE WHEN t11.MATCODE IS NOT NULL THEN t11.MATCODE ELSE t51.MATCODE END AS material_code_in,
    CASE WHEN t11.MATNAME IS NOT NULL THEN t11.MATNAME ELSE t51.MATNAME END AS material_name_in,
    CASE WHEN t11.SWeight IS NOT NULL THEN t11.SWeight ELSE t51.ActWT END AS weight_in,
    t11.lotid AS lot_id_in,
    ltin.orderno AS plan_id_in,
    ltin.JDAT AS pro_date_in,
    ltin.JSHT AS shift_id_in,
    ltin.JMCH AS equip_id_in
FROM
    (
        SELECT
            ORDERNO AS plan_id_out,
            LOTID AS lot_id_out,
            JDAT AS pro_date_out,
            CAST(JSHT AS INT) AS shift_id_out,
            JMCH AS equip_id_out,
            ITNBR AS material_code_out,
            ITDSC AS material_name_out,
            '' AS unit_cost_out,
            jwt AS weight_out,
            '' AS cost_out,
            CAST(VALUE AS INT) AS batchno_out,
            CASE WHEN STATE IN (4, 5) THEN '空走' ELSE '非空走' END AS state_out
        FROM
            PLMES.dbo.LTD0001
        CROSS APPLY STRING_SPLIT(CNUMNEW, ',')
        WHERE
            DIV <> 'XL'
    ) LTD1
    LEFT JOIN (
        SELECT
            planid,
            batchno,
            MATCODE,
            SUM(ActWT) AS ActWT,
            MAX(MatName) AS MatName
        FROM
            ltd0051
        GROUP BY
            planid, batchno, MATCODE
    ) t51 ON LTD1.plan_id_out = t51.planid AND ltd1.batchno_out = t51.batchno
    LEFT JOIN (
        SELECT
            planid,
            batchno,
            MATCODE,
            SUM(CAST(SWeight AS DECIMAL(20, 10))) AS SWeight,
            MAX(MatName) AS MatName,
            lotid
        FROM
            ltd0011
        GROUP BY
            planid, batchno, MATCODE, lotid
    ) t11 ON LTD1.plan_id_out = t11.planid AND ltd1.batchno_out = t11.batchno AND t51.MATCODE = t11.MATCODE
    LEFT JOIN ltd0001 ltin ON t11.LOTID = ltin.lotid
WHERE
    LTD1.pro_date_out = '2024-05-20'
    AND LTD1.shift_id_out = 1

优化查询效率

  • 查询出来了,但是很慢
  1. 索引优化:确保在 ltd0051 和 ltd0011 表的 planid、batchno 和 MATCODE 列上有适当的索引。但是这不是我们自己的表,无法实现。
  1. 减少数据量:在子查询中添加过滤条件,减少需要处理的数据量。但是,我们是根据主表ltd1作为筛选条件的,无法对子表进行条件查询
  1. CTE (Common Table Expressions):使用 WITH 语句创建两个 CTE (t51_agg 和 t11_agg) 来存储聚合后的数据。
-- 取 产出数据
WITH LTD1 AS (
	SELECT
		ORDERNO AS plan_id_out,
		LOTID AS lot_id_out,
		JDAT AS pro_date_out,
		CAST ( JSHT AS INT ) AS shift_id_out,
		JMCH AS equip_id_out,
		ITNBR AS material_code_out,
		ITDSC AS material_name_out,
		'' AS unit_cost_out,
		jwt AS weight_out,
		'' AS cost_out,
		CAST ( VALUE AS INT ) AS batchno_out,
	CASE
			
			WHEN STATE IN ( 4, 5 ) THEN
			'空走' ELSE '非空走' 
		END AS state_out 
	FROM
		PLMES.dbo.LTD0001 CROSS APPLY STRING_SPLIT ( CNUMNEW, ',' ) 
	WHERE
		DIV <> 'XL' 
		AND jdat = '2024-05-20' 
		AND jsht = 1 
	),
-- 关联得到 称重数据
	t51_agg AS (
	SELECT
		planid,
		batchno,
		MATCODE,
		SUM ( ActWT ) AS ActWT,
		MAX ( MatName ) AS MatName 
	FROM
		ltd0051 
	GROUP BY
		planid,
		batchno,
		MATCODE 
	),
-- 关联得到 追溯数据
	t11_agg AS (
	SELECT
		planid,
		batchno,
		MATCODE,
		SUM ( CAST ( SWeight AS DECIMAL ( 20, 10 ) ) ) AS SWeight,
		MAX ( MatName ) AS MatName,
		lotid 
	FROM
		ltd0011 
	GROUP BY
		planid,
		batchno,
		MATCODE,
		lotid 
	) SELECT
	ltd1.plan_id_out,
	ltd1.batchno_out,
	ltd1.lot_id_out,
	ltd1.material_code_out,
	ltd1.material_name_out,
	ltd1.equip_id_out,
	ltd1.pro_date_out,
	ltd1.shift_id_out,
	ltd1.weight_out,
	ltd1.state_out,
CASE
		
		WHEN t11.MATCODE IS NOT NULL THEN
		t11.MATCODE ELSE t51.MATCODE 
	END AS material_code_in,
CASE
		
		WHEN t11.MATNAME IS NOT NULL THEN
		t11.MATNAME ELSE t51.MATNAME 
	END AS material_name_in,
CASE
		
		WHEN t11.SWeight IS NOT NULL THEN
		t11.SWeight ELSE t51.ActWT 
	END AS weight_in,
	t11.lotid AS lot_id_in,
	ltin.orderno AS plan_id_in,
	ltin.JDAT AS pro_date_in,
	ltin.JSHT AS shift_id_in,
	ltin.JMCH AS equip_id_in 
FROM
	LTD1
	LEFT JOIN t51_agg t51 ON LTD1.plan_id_out = t51.planid 
	AND ltd1.batchno_out = t51.batchno
	LEFT JOIN t11_agg t11 ON LTD1.plan_id_out = t11.planid 
	AND ltd1.batchno_out = t11.batchno 
	AND t51.MATCODE = t11.MATCODE
	LEFT JOIN ltd0001 ltin ON t11.LOTID = ltin.lotid;

转载请注明来自码农世界,本文标题:《sql使用加和进行合并去重并提升速率》

百度分享代码,如果开启HTTPS请参考李洋个人博客
每一天,每一秒,你所做的决定都会改变你的人生!

发表评论

快捷回复:

评论列表 (暂无评论,77人围观)参与讨论

还没有评论,来说两句吧...

Top