如何从文件内容创建Java字符串？

我已经用下面这个成语有一段时间了。这似乎是传播最广的，至少在我访问过的网站上。

在Java中有没有更好/不同的方法将文件读入字符串？

private String readFile(String file) throws IOException {
    BufferedReader reader = new BufferedReader(new FileReader (file));
    String         line = null;
    StringBuilder  stringBuilder = new StringBuilder();
    String         ls = System.getProperty("line.separator");

    try {
        while((line = reader.readLine()) != null) {
            stringBuilder.append(line);
            stringBuilder.append(ls);
        }

        return stringBuilder.toString();
    } finally {
        reader.close();
    }
}

如果您愿意使用外部库，请查看ApacheCommonsIO（200KBJAR）。它包含一个org.apache.commons.io.FileUtils.readFileToString（）方法，该方法允许您用一行代码将整个文件读取为字符串。

例子：

import java.io.*;
import java.nio.charset.*;
import org.apache.commons.io.*;

public String readFile() throws IOException {
    File file = new File("data.txt");
    return FileUtils.readFileToString(file, StandardCharsets.UTF_8);
}

2008-11-28 18:44:36

Java试图在所有方面都非常通用和灵活。因此，脚本语言中相对简单的东西（python中的代码将被替换为“open（file）.read（）”）要复杂得多。除了使用外部库（如Willi aus Rohr提到的），似乎没有任何更短的方法。您的选项：

使用外部库。将此代码复制到所有项目中。创建自己的迷你库，其中包含您经常使用的函数。

你最好的选择可能是第二个，因为它的依赖性最小。

2008-11-28 18:52:37

读取文件中的所有文本

Java 11添加了readString（）方法，将小文件作为字符串读取，保留了行终止符：

String content = Files.readString(path, encoding);

对于Java 7和11之间的版本，这里有一个紧凑、健壮的习惯用法，用实用方法概括：

static String readFile(String path, Charset encoding)
  throws IOException
{
  byte[] encoded = Files.readAllBytes(Paths.get(path));
  return new String(encoded, encoding);
}

从文件中读取文本行

Java7添加了一种方便的方法，以文本行形式读取文件，表示为List<String>。这种方法是“有损的”，因为行分隔符是从每行的末尾剥离的。

List<String> lines = Files.readAllLines(Paths.get(path), encoding);

Java8添加了Files.line（）方法来生成Stream<String>。同样，这种方法是有损耗的，因为行分隔符被剥离了。如果在读取文件时遇到IOException，它将被包装在UncheckedIOException中，因为Stream不接受引发选中异常的lambda。

try (Stream<String> lines = Files.lines(path, encoding)) {
  lines.forEach(System.out::println);
}

此流确实需要close（）调用；这在API中记录得很差，我怀疑很多人甚至没有注意到Stream有一个close（）方法。确保使用如图所示的ARM块。

如果使用的是文件以外的源，则可以改用BufferedReader中的lines（）方法。

内存利用率

如果文件相对于可用内存足够小，那么一次读取整个文件可能会很好。但是，如果文件太大，一次读取一行，处理它，然后在继续下一行之前丢弃它可能是更好的方法。以这种方式进行流处理可以消除作为内存需求因素的总文件大小。

字符编码

原始文章中的示例缺少的一点是字符编码。这种编码通常无法从文件本身确定，并且需要元数据（如HTTP头）来传递这一重要信息。

StandardCharsets类为所有Java运行时所需的编码定义了一些常量：

String content = readFile("test.txt", StandardCharsets.UTF_8);

平台默认值可从Charset类本身获得：

String content = readFile("test.txt", Charset.defaultCharset());

在某些特殊情况下，平台默认值是您想要的，但这种情况很少见。您应该能够证明您的选择是合理的，因为平台默认值是不可移植的。一个可能正确的例子是读取标准输入或写入标准输出时。

注意：这个答案很大程度上取代了我的Java6版本。Java7的实用程序安全地简化了代码，旧的答案使用了映射字节缓冲区，在映射缓冲区被垃圾收集之前，可以防止读取的文件被删除。您可以通过此答案上的“编辑”链接查看旧版本。

2008-11-28 18:56:13

如果您正在寻找不涉及第三方库（例如Commons I/O）的替代方案，可以使用Scanner类：

private String readFile(String pathname) throws IOException {

    File file = new File(pathname);
    StringBuilder fileContents = new StringBuilder((int)file.length());        

    try (Scanner scanner = new Scanner(file)) {
        while(scanner.hasNextLine()) {
            fileContents.append(scanner.nextLine() + System.lineSeparator());
        }
        return fileContents.toString();
    }
}

2008-11-28 19:00:35

该代码将规范换行符，这可能是您真正想要做的，也可能不是。

这里有一个替代方案，它没有做到这一点，而且比NIO代码更容易理解（IMO）（尽管它仍然使用java.NIO.charset.charset）：

public static String readFile(String file, String csName)
            throws IOException {
    Charset cs = Charset.forName(csName);
    return readFile(file, cs);
}

public static String readFile(String file, Charset cs)
            throws IOException {
    // No real need to close the BufferedReader/InputStreamReader
    // as they're only wrapping the stream
    FileInputStream stream = new FileInputStream(file);
    try {
        Reader reader = new BufferedReader(new InputStreamReader(stream, cs));
        StringBuilder builder = new StringBuilder();
        char[] buffer = new char[8192];
        int read;
        while ((read = reader.read(buffer, 0, buffer.length)) > 0) {
            builder.append(buffer, 0, read);
        }
        return builder.toString();
    } finally {
        // Potential issue here: if this throws an IOException,
        // it will mask any others. Normally I'd use a utility
        // method which would log exceptions and swallow them
        stream.close();
    }        
}

2008-11-28 19:56:35

同一主题上有一个变体，它使用for循环而不是while循环来限制行变量的范围。是否“更好”取决于个人品味。

for(String line = reader.readLine(); line != null; line = reader.readLine()) {
    stringBuilder.append(line);
    stringBuilder.append(ls);
}

2008-11-28 20:33:13

public static String slurp (final File file)
throws IOException {
    StringBuilder result = new StringBuilder();

    BufferedReader reader = new BufferedReader(new FileReader(file));

    try {
        char[] buf = new char[1024];

        int r = 0;

        while ((r = reader.read(buf)) != -1) {
            result.append(buf, 0, r);
        }
    }
    finally {
        reader.close();
    }

    return result.toString();
}

2010-02-08 19:51:12

Guava有一种类似于Willi aus Rohr提到的Commons IOUtils的方法：

import com.google.common.base.Charsets;
import com.google.common.io.Files;

// ...

String text = Files.toString(new File(path), Charsets.UTF_8);

由PiggyPiglet编辑文件#toString已弃用，将于2019年10月删除。而是使用Files.asCharSource（新文件（路径），StandardCharsets.UTF_8）.read（）；

奥斯卡·雷耶斯编辑

这是引用库中的（简化）基础代码：

InputStream in = new FileInputStream(file);
byte[] b  = new byte[file.length()];
int len = b.length;
int total = 0;

while (total < len) {
  int result = in.read(b, total, len - total);
  if (result == -1) {
    break;
  }
  total += result;
}

return new String( b , Charsets.UTF_8 );

编辑（Jonik）：以上内容与最近的Guava版本的源代码不匹配。有关当前源代码，请参阅com.google.common.io包中的类Files、CharStreams、ByteSource和CharSource。

2010-02-08 20:08:37

将文件读取为二进制文件并在末尾转换

public static String readFileAsString(String filePath) throws IOException {
    DataInputStream dis = new DataInputStream(new FileInputStream(filePath));
    try {
        long len = new File(filePath).length();
        if (len > Integer.MAX_VALUE) throw new IOException("File "+filePath+" too large, was "+len+" bytes.");
        byte[] bytes = new byte[(int) len];
        dis.readFully(bytes);
        return new String(bytes, "UTF-8");
    } finally {
        dis.close();
    }
}

2010-04-18 07:34:29

基于扫描仪的非常精简的解决方案：

Scanner scanner = new Scanner( new File("poem.txt") );
String text = scanner.useDelimiter("\\A").next();
scanner.close(); // Put this call in a finally block

或者，如果要设置字符集：

Scanner scanner = new Scanner( new File("poem.txt"), "UTF-8" );
String text = scanner.useDelimiter("\\A").next();
scanner.close(); // Put this call in a finally block

或者，使用trywithresources块，它将为您调用scanner.close（）：

try (Scanner scanner = new Scanner( new File("poem.txt"), "UTF-8" )) {
    String text = scanner.useDelimiter("\\A").next();
}

请记住，Scanner构造函数可以引发IOException。不要忘记导入java.io和java.util。

来源：Pat Niemeyer的博客

2011-09-16 20:02:17

如果是文本文件，为什么不使用apachecommons io？

它有以下方法

public static String readFileToString(File file) throws IOException

如果要将行作为列表，请使用

public static List<String> readLines(File file) throws IOException

2011-10-17 15:34:03

这一个使用RandomAccessFile.readFully方法，它似乎可以从JDK 1.0中获得！

public static String readFileContent(String filename, Charset charset) throws IOException {
    RandomAccessFile raf = null;
    try {
        raf = new RandomAccessFile(filename, "r");
        byte[] buffer = new byte[(int)raf.length()];
        raf.readFully(buffer);
        return new String(buffer, charset);
    } finally {
        closeStream(raf);
    }
} 


private static void closeStream(Closeable c) {
    if (c != null) {
        try {
            c.close();
        } catch (IOException ex) {
            // do nothing
        }
    }
}

2011-10-23 07:43:29

一个灵活的解决方案，使用Apache commons io中的IOUItils和StringWriter：

Reader input = new FileReader();
StringWriter output = new StringWriter();
try {
  IOUtils.copy(input, output);
} finally {
  input.close();
}
String fileContents = output.toString();

它适用于任何读取器或输入流（不仅仅是文件），例如从URL读取时。

2012-02-15 10:47:35

import java.nio.file.Files;

.......

 String readFile(String filename) {
            File f = new File(filename);
            try {
                byte[] bytes = Files.readAllBytes(f.toPath());
                return new String(bytes,"UTF-8");
            } catch (FileNotFoundException e) {
                e.printStackTrace();
            } catch (IOException e) {
                e.printStackTrace();
            }
            return "";
    }

2012-04-16 14:33:59

请注意，当使用fileInputStream.available（）时，返回的整数不必表示实际的文件大小，而是系统应该能够在不阻塞IO的情况下从流中读取的猜测字节数

public String readStringFromInputStream(FileInputStream fileInputStream) {
    StringBuffer stringBuffer = new StringBuffer();
    try {
        byte[] buffer;
        while (fileInputStream.available() > 0) {
            buffer = new byte[fileInputStream.available()];
            fileInputStream.read(buffer);
            stringBuffer.append(new String(buffer, "ISO-8859-1"));
        }
    } catch (FileNotFoundException e) {
    } catch (IOException e) { }
    return stringBuffer.toString();
}

应该考虑的是，这种方法不适用于UTF-8等多字节字符编码。

2013-03-15 09:09:56

我还不能评论其他条目，所以我就把它留在这里。

这里最好的答案之一(https://stackoverflow.com/a/326448/1521167):

private String readFile(String pathname) throws IOException {

File file = new File(pathname);
StringBuilder fileContents = new StringBuilder((int)file.length());
Scanner scanner = new Scanner(file);
String lineSeparator = System.getProperty("line.separator");

try {
    while(scanner.hasNextLine()) {        
        fileContents.append(scanner.nextLine() + lineSeparator);
    }
    return fileContents.toString();
} finally {
    scanner.close();
}
}

仍然有一个缺陷。它总是在字符串末尾添加换行符，这可能会导致一些奇怪的错误。我的建议是将其更改为：

    private String readFile(String pathname) throws IOException {
    File file = new File(pathname);
    StringBuilder fileContents = new StringBuilder((int) file.length());
    Scanner scanner = new Scanner(new BufferedReader(new FileReader(file)));
    String lineSeparator = System.getProperty("line.separator");

    try {
        if (scanner.hasNextLine()) {
            fileContents.append(scanner.nextLine());
        }
        while (scanner.hasNextLine()) {
            fileContents.append(lineSeparator + scanner.nextLine());
        }
        return fileContents.toString();
    } finally {
        scanner.close();
    }
}

2013-12-13 09:33:32

如果您需要字符串处理（并行处理），Java8有很棒的StreamAPI。

String result = Files.lines(Paths.get("file.txt"))
                    .parallel() // for parallel processing 
                    .map(String::trim) // to change line   
                    .filter(line -> line.length() > 2) // to filter some lines by a predicate                        
                    .collect(Collectors.joining()); // to join lines

JDK示例samples/lambda/BulkDataOperations中提供了更多示例，可以从Oracle Java SE 8下载页面下载

另一个单线示例

String out = String.join("\n", Files.readAllLines(Paths.get("file.txt")));

2014-10-29 08:51:58

如果您无权访问Files类，则可以使用本机解决方案。

static String readFile(File file, String charset)
        throws IOException
{
    FileInputStream fileInputStream = new FileInputStream(file);
    byte[] buffer = new byte[fileInputStream.available()];
    int length = fileInputStream.read(buffer);
    fileInputStream.close();
    return new String(buffer, 0, length, charset);
}

2015-01-06 18:52:59

在扫描程序后按Ctrl+F键，我认为也应该列出扫描程序解决方案。最容易阅读的方式如下：

public String fileToString(File file, Charset charset) {
  Scanner fileReader = new Scanner(file, charset);
  fileReader.useDelimiter("\\Z"); // \Z means EOF.
  String out = fileReader.next();
  fileReader.close();
  return out;
}

如果您使用Java 7或更高版本（您确实应该），请考虑使用try with资源，以使代码更易于阅读。不要再把任何东西乱丢了。但我认为这主要是一种文体选择。

我发布这篇文章主要是为了完成主义，因为如果您需要经常这样做，java.nio.file.file中应该有一些东西可以更好地完成这项工作。

我的建议是使用File#readAllBytes（Path）获取所有字节，并将其输入到新的String（byte[]字符集）中，以从中获取一个您可以信任的String。在你的一生中，Charset会对你很刻薄，所以现在就要小心这些东西。

其他人已经给出了代码和东西，我不想抢走他们的荣耀

2015-11-29 14:09:52

对于Java 7，这是我读取UTF-8文件的首选选项：

String content = new String(Files.readAllBytes(Paths.get(filename)), "UTF-8");

自Java7以来，JDK有了新的Java.nio.file API，它提供了许多快捷方式，因此简单的文件操作并不总是需要第三方库。

由于人们仍然对这个答案投赞成票，这里有一个在Java 11中引入的更好的解决方案：

String content = Files.readString(path);

2016-04-19 16:39:06

import java.nio.charset.StandardCharsets;
import java.nio.file.Files;
import java.nio.file.Paths;

Java 7

String content = new String(Files.readAllBytes(Paths.get("readMe.txt")), StandardCharsets.UTF_8);

Java 11

String content = Files.readString(Paths.get("readMe.txt"));

2016-10-28 07:04:17

在java8中，有一个新类

java.util.stream.stream流

流表示一系列元素，并支持对这些元素执行计算的不同类型的操作

阅读更多信息：

Oracle文档

这里有一个例子：

import java.nio.charset.StandardCharsets;
import java.nio.file.Files;
import java.nio.file.Paths;
import java.util.stream.Stream;

public Class ReadFile{
  public  static String readFile(String filePath) {
 StringBuilder  stringBuilder = new StringBuilder();
    String ls = System.getProperty("line.separator");
        try {

            try (Stream<String> lines = Files.lines(Paths.get(filePath), StandardCharsets.UTF_8)) {
                for (String line : (Iterable<String>) lines::iterator) {


                      stringBuilder.append(line);
                      stringBuilder.append(ls);


                }
            }

        } catch (Exception e) {
            e.printStackTrace();
        }

      return stringBuilder.toString(); 


}

}

2016-12-06 12:00:40

使用此库时，只有一行：

String data = IO.from(new File("data.txt")).toString();

2016-12-10 15:55:40

使用代码：

File file = new File("input.txt");
BufferedInputStream bin = new BufferedInputStream(new FileInputStream(
                file));
byte[] buffer = new byte[(int) file.length()];
bin.read(buffer);
String fileStr = new String(buffer);

fileStr包含字符串形式的输出。

2017-01-16 06:53:39

您可以尝试扫描程序和文件类，几行解决方案

 try
{
  String content = new Scanner(new File("file.txt")).useDelimiter("\\Z").next();
  System.out.println(content);
}
catch(FileNotFoundException e)
{
  System.out.println("not found!");
}

2017-02-07 06:10:25

此外，如果您的文件恰好位于jar中，您也可以使用此选项：

public String fromFileInJar(String path) {
    try ( Scanner scanner 
            = new Scanner(getClass().getResourceAsStream(path))) {
        return scanner.useDelimiter("\\A").next();
    }
}

例如，如果您的jar是

my.jar/com/some/thing/a.txt

然后你想这样调用它：

String myTxt = fromFileInJar("/com/com/thing/a.txt");

2017-02-15 21:15:44

在一行（Java 8）中，假设您有一个Reader：

String sMessage = String.join("\n", reader.lines().collect(Collectors.toList()));

2017-03-15 02:39:05

根据@erickson的回答，您可以使用：

public String readAll(String fileName) throws IOException {
    List<String> lines = Files.readAllLines(new File(fileName).toPath());
    return String.join("\n", lines.toArray(new String[lines.size()]));
}

2018-04-01 13:34:54

收集了从磁盘或网络中以字符串形式读取文件的所有可能方法。

Guava:Google使用类资源，文件静态字符集字符集=com.google.common.base.Charsets.UTF_8；公共静态字符串guava_ServerFile（URL URL）引发IOException{return Resources.toString（url，charset）；}公共静态字符串guava_DiskFile（文件文件）引发IOException{return Files\toString（文件，字符集）；}

APACHE-COMMONS IO使用IOUItils、FileUtils类静态字符集编码=org.apache.mons.io.Charsets.UTF_8；公共静态字符串commons_IOUtils（URL URL）引发IOException{java.io.InputStream in=url.openStream（）；尝试{return IOUtils.toString（in，编码）；}最后{IOUItils.close安静（in）；}}公共静态字符串commons_FileUtils（文件文件）引发IOException{return FileUtils.readFileToString（文件，编码）；/*List<String>lines=FileUtils.readLines（文件名，编码）；return lines.stream（）.collector（Collectors.joining（“\n”））*/}

使用流API的Java 8 BufferReader公共静态字符串streamURL_Buffer（URL URL）引发IOException{java.io.InputStream source=url.openStream（）；BufferedReader读取器=新的BufferedReader（新的InputStreamReader（源））；//List<String>lines=reader.lines（）.collector（Collectors.toList（））；return reader.line（）.collector（Collectors.joining（System.lineSeparator（）））；}公共静态字符串streamFile_Buffer（文件文件）引发IOException{BufferedReader读取器=新的BufferedReader（新的FileReader（文件））；return reader.line（）.collector（Collectors.joining（System.lineSeparator（）））；}

带有正则表达式\A的扫描程序类。其匹配输入的开始。静态字符串字符集名称=java.nio.charset.StandardCharsets.UTF_8.toString（）；公共静态字符串streamURL_Scanner（URL URL）引发IOException{java.io.InputStream source=url.openStream（）；Scanner Scanner=新扫描仪（源，charsetName）。使用分隔符（“\\A”）；return scanner.hasNext（）？scanner.next（）：“”；}公共静态字符串streamFile_Scanner（文件文件）引发IOException{Scanner Scanner=新扫描仪（文件，charsetName）。使用分隔符（“\\A”）；return scanner.hasNext（）？scanner.next（）：“”；}

Java 7（Java.nio.file.Files.readAllBytes）公共静态字符串getDiskFile_Java7（文件文件）引发IOException{byte[]readAllBytes=java.nio.file.Files.readAllBytes（Paths.get（file.getAbsolutePath（）））；返回新字符串（readAllBytes）；}

BufferedReader使用InputStreamReader。公共静态字符串getDiskFile_Lines（文件文件）引发IOException{StringBuffer text=新StringBuffer（）；FileInputStream fileStream=新的FileInputStream（文件）；BufferedReader br=新的BufferedReader（新的InputStreamReader（fileStream））；for（字符串行；（行=br.readLine（））！=null；）text.append（line+System.lineSeparator（））；return text.toString（）；}

使用main方法访问上述方法的示例。

public static void main(String[] args) throws IOException {
    String fileName = "E:/parametarisation.csv";
    File file = new File( fileName );

    String fileStream = commons_FileUtils( file );
            // guava_DiskFile( file );
            // streamFile_Buffer( file );
            // getDiskFile_Java7( file );
            // getDiskFile_Lines( file );
    System.out.println( " File Over Disk : \n"+ fileStream );


    try {
        String src = "https://code.jquery.com/jquery-3.2.1.js";
        URL url = new URL( src );

        String urlStream = commons_IOUtils( url );
                // guava_ServerFile( url );
                // streamURL_Scanner( url );
                // streamURL_Buffer( url );
        System.out.println( " File Over Network : \n"+ urlStream );
    } catch (MalformedURLException e) {
        e.printStackTrace();
    }
}

@see

将InputStream转换为字符串的方法

2018-05-29 10:09:02

使用JDK 8或更高版本：

未使用外部库

您可以从文件内容创建一个新的String对象（使用java.nio.file包中的类）：

public String readStringFromFile(String filePath) throws IOException {
    String fileContent = new String(Files.readAllBytes(Paths.get(filePath)));
    return fileContent;
}

2018-06-21 05:46:40

自JDK 11以来：

String file = ...
Path path = Paths.get(file);
String content = Files.readString(path);
// Or readString(path, someCharset), if you need a Charset different from UTF-8

2018-06-26 14:26:44

用户java.nio.Files读取文件的所有行。

public String readFile() throws IOException {
        File fileToRead = new File("file path");
        List<String> fileLines = Files.readAllLines(fileToRead.toPath());
        return StringUtils.join(fileLines, StringUtils.EMPTY);
}

2018-10-26 06:38:33

Scanner sc = new Scanner(new File("yourFile.txt"));
sc.useDelimiter("\\Z");

String s = sc.next();

2022-02-04 18:39:10

纯kotlin代码，无依赖关系

适用于所有android版本

val fileAsString = file.bufferedReader().use { it.readText() }

2022-11-17 18:55:14

使用前面评论中提到的BufferedReader，但这种方式更可读：

String FILE_PATH = "filepath.txt";

try (FileReader fileReader = new FileReader(FILE_PATH)) {
    BufferedReader fileBufferReader = new BufferedReader(fileReader);
    String text = fileBufferReader.lines()
        .collect(Collectors.joining(System.lineSeparator()));
    System.out.println(text);
} catch (IOException e) {
    // exception handling
}

2023-01-24 02:38:59

如何从文件内容创建Java字符串？

推荐文章

最新文章

标签