在多个列上计数DISTINCT

是否有更好的方法来执行这样的查询:

SELECT COUNT(*) 
FROM (SELECT DISTINCT DocumentId, DocumentSessionId
      FROM DocumentOutputItems) AS internalQuery

我需要数一下这个表中不同项的数量，但不同项超过两列。

我的查询工作得很好，但我想知道我是否可以只使用一个查询(不使用子查询)得到最终结果

当前回答

希望这能起作用，我正在prima vista上写

SELECT COUNT(*) 
FROM DocumentOutputItems 
GROUP BY DocumentId, DocumentSessionId

2009-09-24 12:10:44

其他回答

这段代码使用distinct on 2参数，并提供特定于这些不同值的行数计数。它在MySQL中为我工作，就像一个魅力。

select DISTINCT DocumentId as i,  DocumentSessionId as s , count(*) 
from DocumentOutputItems   
group by i ,s;

2018-11-16 07:17:32

这个怎么样，

Select DocumentId, DocumentSessionId, count(*) as c 
from DocumentOutputItems 
group by DocumentId, DocumentSessionId;

这将得到documententid和DocumentSessionId的所有可能组合的计数

2018-05-01 10:45:09

编辑:从不太可靠的仅校验和查询更改我发现了一种方法来做到这一点(在SQL Server 2005中)，这对我来说很好，我可以使用尽可能多的列，因为我需要(通过将它们添加到CHECKSUM()函数)。REVERSE()函数将int类型转换为varchars类型，以使distinct类型更加可靠

SELECT COUNT(DISTINCT (CHECKSUM(DocumentId,DocumentSessionId)) + CHECKSUM(REVERSE(DocumentId),REVERSE(DocumentSessionId)) )
FROM DocumentOutPutItems

2012-07-06 23:01:04

这对我很管用。在oracle中:

SELECT SUM(DECODE(COUNT(*),1,1,1))
FROM DocumentOutputItems GROUP BY DocumentId, DocumentSessionId;

在jpql:

SELECT SUM(CASE WHEN COUNT(i)=1 THEN 1 ELSE 1 END)
FROM DocumentOutputItems i GROUP BY i.DocumentId, i.DocumentSessionId;

2018-03-29 07:59:14

如果您使用的是固定长度的数据类型，则可以将其转换为二进制，从而非常容易和快速地完成此操作。假设documententid和DocumentSessionId都是int，因此都是4字节长…

SELECT COUNT(DISTINCT CAST(DocumentId as binary(4)) + CAST(DocumentSessionId as binary(4)))
FROM DocumentOutputItems

My specific problem required me to divide a SUM by the COUNT of the distinct combination of various foreign keys and a date field, grouping by another foreign key and occasionally filtering by certain values or keys. The table is very large, and using a sub-query dramatically increased the query time. And due to the complexity, statistics simply wasn't a viable option. The CHECKSUM solution was also far too slow in its conversion, particularly as a result of the various data types, and I couldn't risk its unreliability.

然而，使用上述解决方案几乎没有增加查询时间(与简单使用SUM相比)，并且应该是完全可靠的!它应该能够帮助其他处于类似情况的人，所以我把它贴在这里。

2019-09-18 00:27:22

在多个列上计数DISTINCT

推荐文章

最新文章

标签