从路径中提取文件名，不管os/path格式是什么

我可以使用哪个Python库从路径中提取文件名，而不管操作系统或路径格式是什么?

例如，我希望所有这些路径都返回c:

a/b/c/
a/b/c
\a\b\c
\a\b\c\
a\b\c
a/b/../../a/b/c/
a/b/../../a/b/c

当前回答

在你的例子中，你还需要从右边去掉斜杠来返回c:

>>> import os
>>> path = 'a/b/c/'
>>> path = path.rstrip(os.sep) # strip the slash from the right side
>>> os.path.basename(path)
'c'

第二个层面:

>>> os.path.filename(os.path.dirname(path))
'b'

更新:我认为lazyr提供了正确的答案。我的代码将无法在unix系统上使用类似windows的路径，而在windows系统上与类似unix的路径相反。

2011-12-05 11:51:21

其他回答

import os
head, tail = os.path.split('path/to/file.exe')

尾巴是你想要的，文件名。

详见python os模块文档

2011-12-05 11:45:21

也许只是我的all in one解决方案，没有重要的一些新的(考虑tempfile创建临时文件:D)

import tempfile
abc = tempfile.NamedTemporaryFile(dir='/tmp/')
abc.name
abc.name.replace("/", " ").split()[-1]

获取abc.name的值将是这样的字符串:'/tmp/tmpks5oksk7' 所以我可以用空格.replace("/"， " ")替换/，然后调用split()。它会返回一个列表，我得到列表中最后一个带有[-1]的元素

不需要导入任何模块。

2014-08-21 15:23:01

在你的例子中，你还需要从右边去掉斜杠来返回c:

>>> import os
>>> path = 'a/b/c/'
>>> path = path.rstrip(os.sep) # strip the slash from the right side
>>> os.path.basename(path)
'c'

第二个层面:

>>> os.path.filename(os.path.dirname(path))
'b'

更新:我认为lazyr提供了正确的答案。我的代码将无法在unix系统上使用类似windows的路径，而在windows系统上与类似unix的路径相反。

2011-12-05 11:51:21

我从来没有见过双开的路，它们存在吗?python模块os的内置特性在这些方面失败了。所有其他工作，还有你用os.path.normpath()给出的警告:

paths = ['a/b/c/', 'a/b/c', '\\a\\b\\c', '\\a\\b\\c\\', 'a\\b\\c', 
...     'a/b/../../a/b/c/', 'a/b/../../a/b/c', 'a/./b/c', 'a\b/c']
for path in paths:
    os.path.basename(os.path.normpath(path))

2015-03-19 21:18:33

像其他人建议的那样使用os.path.split或os.path.basename并不能在所有情况下工作:如果您在Linux上运行脚本并试图处理经典的windows样式的路径，它将失败。

Windows路径可以使用反斜杠或正斜杠作为路径分隔符。因此，ntpath模块(相当于os. path)在windows上运行时的路径)将适用于所有平台上的所有(1)路径。

import ntpath
ntpath.basename("a/b/c")

当然，如果文件以斜杠结束，basename将为空，所以创建自己的函数来处理它:

def path_leaf(path):
    head, tail = ntpath.split(path)
    return tail or ntpath.basename(head)

验证:

>>> paths = ['a/b/c/', 'a/b/c', '\\a\\b\\c', '\\a\\b\\c\\', 'a\\b\\c', 
...     'a/b/../../a/b/c/', 'a/b/../../a/b/c']
>>> [path_leaf(path) for path in paths]
['c', 'c', 'c', 'c', 'c', 'c', 'c']

(1) There's one caveat: Linux filenames may contain backslashes. So on linux, r'a/b\c' always refers to the file b\c in the a folder, while on Windows, it always refers to the c file in the b subfolder of the a folder. So when both forward and backward slashes are used in a path, you need to know the associated platform to be able to interpret it correctly. In practice it's usually safe to assume it's a windows path since backslashes are seldom used in Linux filenames, but keep this in mind when you code so you don't create accidental security holes.

2011-12-05 11:45:25

从路径中提取文件名，不管os/path格式是什么

推荐文章

最新文章

标签