ggplot2条形图中的条形排序

我要做一个柱状图，其中最大的柱状图离y轴最近，最短的柱状图离y轴最远。这有点像我的表格

    Name   Position
1   James  Goalkeeper
2   Frank  Goalkeeper
3   Jean   Defense
4   Steve  Defense
5   John   Defense
6   Tim    Striker

所以我试图建立一个条形图，根据位置显示球员的数量

p <- ggplot(theTable, aes(x = Position)) + geom_bar(binwidth = 1)

但是图表显示的是门将栏，然后是防守栏，最后是前锋栏。我希望图表的顺序是，防守条最靠近y轴，守门员条，最后是前锋条。谢谢

当前回答

我同意zach的观点，在dplyr内计数是最好的解决方案。我发现这是最短的版本:

dplyr::count(theTable, Position) %>%
          arrange(-n) %>%
          mutate(Position = factor(Position, Position)) %>%
          ggplot(aes(x=Position, y=n)) + geom_bar(stat="identity")

这也将比事先重新排序因子级别快得多，因为计数是在dplyr中完成的，而不是在ggplot或使用table中完成的。

2016-07-31 19:11:08

其他回答

排序的关键是按照您想要的顺序设置因子的级别。不需要有序因子;一个有序因子中的额外信息是不必要的，如果这些数据被用于任何统计模型中，可能会导致错误的参数化——多项式对比不适用于这样的名义数据。

## set the levels in order we want
theTable <- within(theTable, 
                   Position <- factor(Position, 
                                      levels=names(sort(table(Position), 
                                                        decreasing=TRUE))))
## plot
ggplot(theTable,aes(x=Position))+geom_bar(binwidth=1)

在最一般的意义上，我们只需要将因子级别设置为所需的顺序。如果不指定，因子的级别将按字母顺序排序。您还可以如上所述在因子调用中指定级别顺序，也可以采用其他方法。

theTable$Position <- factor(theTable$Position, levels = c(...))

2011-03-06 13:42:41

由于我们只关注单个变量(“位置”)的分布，而不是两个变量之间的关系，那么直方图可能是更合适的图形。Ggplot有geom_histogram()，这使得它很容易:

ggplot(theTable, aes(x = Position)) + geom_histogram(stat="count")

使用geom_histogram ():

我认为geom_histogram()有点古怪，因为它对待连续数据和离散数据是不同的。

对于连续数据，可以只使用不带参数的geom_histogram()。例如，如果我们添加一个数字向量“Score”……

    Name   Position   Score  
1   James  Goalkeeper 10
2   Frank  Goalkeeper 20
3   Jean   Defense    10
4   Steve  Defense    10
5   John   Defense    20
6   Tim    Striker    50

然后在“Score”变量上使用geom_histogram()…

ggplot(theTable, aes(x = Score)) + geom_histogram()

对于像“Position”这样的离散数据，我们必须指定一个由美学计算出来的统计数据，使用stat = "count"来给出条形高度的y值:

 ggplot(theTable, aes(x = Position)) + geom_histogram(stat = "count")

注意:奇怪且令人困惑的是，你也可以使用stat = "count"来表示连续的数据，我认为它提供了一个更美观的图形。

ggplot(theTable, aes(x = Score)) + geom_histogram(stat = "count")

编辑:对DebanjanB的有用建议的扩展回答。

2019-02-14 11:41:42

@GavinSimpson:重新排序是一个强大而有效的解决方案:

ggplot(theTable,
       aes(x=reorder(Position,Position,
                     function(x)-length(x)))) +
       geom_bar()

2012-02-10 17:13:16

如果不想使用ggplot2，还有一个ggpubr，它为ggbarplot函数提供了一个非常有用的参数。你可以对条形图进行排序。Val在“desc”和“asc”中是这样的:

library(dplyr)
library(ggpubr)
# desc
df %>%
  count(Position) %>%
  ggbarplot(x = "Position", 
            y = "n",
            sort.val = "desc")

# asc
df %>%
  count(Position) %>%
  ggbarplot(x = "Position", 
            y = "n",
            sort.val = "asc")

由reprex包于2022-08-14创建(v2.0.1)

如您所见，对条形进行排序非常简单。如果条形图是分组的，也可以这样做。查看上面的链接，获取一些有用的例子。

2022-08-14 16:43:33

另一种方法是使用重新排序来排列因子的级别。根据计数的升序(n)或降序(-n)。非常类似于使用forcats包中的fct_reorder:

降序排列

df %>%
  count(Position) %>%
  ggplot(aes(x = reorder(Position, -n), y = n)) +
  geom_bar(stat = 'identity') +
  xlab("Position")

升序排序 df % > % 数(位置)% > % ggplot(aes(x = reorder(Position, n)， y = n)) + Geom_bar (stat = 'identity') + xlab(“位置”) 数据帧: df <- structure(list(Position = structure(c(3L, 3L, 1L, 1L, 1L, 2L)， .Label = c("防御"， "前锋"，"Zoalkeeper")， class = "factor")， Name =结构(c(2L， 1 l, 3 l 5 l 4 l, 6 l), .Label = c(“弗兰克”,“詹姆斯”,“琼”,“约翰”, "Steve"， "Tim")， class = "factor"))， class = "data.frame"， row.names = c(NA， 6 l))

2019-02-03 15:27:03

ggplot2条形图中的条形排序

推荐文章

最新文章

标签