%中的%的反义词:排除在vector中指定值的行

数据帧D1中的分类变量V1可以有从A到z的字母表示的值。我想创建一个子集D2，其中不包括一些值，比如B、N和t。基本上，我想要一个与%中的%相反的命令

D2 = subset(D1, V1 %in% c("B", "N", "T"))

当前回答

require(TSDT)

c(1,3,11) %nin% 1:10
# [1] FALSE FALSE  TRUE

更多信息，请访问:https://cran.r-project.org/web/packages/TSDT/TSDT.pdf

其他回答

你可以使用!运算符基本上使任何TRUE为FALSE，每个FALSE为TRUE。所以:

D2 = subset(D1, !(V1 %in% c('B','N','T')))

编辑: 你也可以自己创建一个操作符:

'%!in%' <- function(x,y)!('%in%'(x,y))

c(1,3,11)%!in%1:10
[1] FALSE FALSE  TRUE

如何:

`%ni%` <- Negate(`%in%`)
c(1,3,11) %ni% 1:10
# [1] FALSE FALSE  TRUE

如果你看%in%的代码

 function (x, table) match(x, table, nomatch = 0L) > 0L

然后你应该能够写出你的相反版本。我使用

`%not in%` <- function (x, table) is.na(match(x, table, nomatch=NA_integer_))

另一种方法是:

function (x, table) match(x, table, nomatch = 0L) == 0L

另一个解决方案是使用setdiff

D1 = c("A",..., "Z") ; D0 = c("B","N","T")

D2 = setdiff(D1, D0)

D2是你想要的子集。

使用negate from purrr也可以快速而整洁地达到目的:

`%not_in%` <- purrr::negate(`%in%`)

例如，用法是，

c("cat", "dog") %not_in% c("dog", "mouse")

推荐文章