<![CDATA[]]>在XML中的含义是什么?

我经常在XML文件中发现这个奇怪的CDATA标签:

<![CDATA[some stuff]]>

我观察到这个CDATA标记总是出现在开头，然后跟着一些东西。

但有时用，有时不用。我假设这是为了标记一些东西是“数据”，将被插入之后。但是什么是数据呢?我用XML标记写的东西不是某种数据吗?

其中包含的数据不会被解析为XML，因此不需要是有效的XML，或者可以包含看似XML但实际上不是的元素。

2010-05-06 20:23:19

CDATA代表字符数据，这意味着这些字符串之间的数据包括可以解释为XML标记的数据，但不应该这样做。

CDATA和注释之间的主要区别是:

正如Richard指出的，CDATA仍然是文档的一部分，而注释不是。在CDATA中不能包含字符串]]> (cend)，而在注释中—是无效的。在注释中不能识别参数实体引用。

这意味着从一个格式良好的文档中给出以下四个XML片段:

<!ENTITY MyParamEntity "Has been expanded">

<!--
Within this comment I can use ]]>
and other reserved characters like <
&, ', and ", but %MyParamEntity; will not be expanded
(if I retrieve the text of this node it will contain
%MyParamEntity; and not "Has been expanded")
and I can't place two dashes next to each other.
-->

<![CDATA[
Within this Character Data block I can
use double dashes as much as I want (along with <, &, ', and ")
*and* %MyParamEntity; will be expanded to the text
"Has been expanded" ... however, I can't use
the CEND sequence. If I need to use CEND I must escape one of the
brackets or the greater-than sign using concatenated CDATA sections.
]]>

<description>An example of escaped CENDs</description>
<!-- This text contains a CEND ]]> -->
<!-- In this first case we put the ]] at the end of the first CDATA block
     and the > in the second CDATA block -->
<data><![CDATA[This text contains a CEND ]]]]><![CDATA[>]]></data>
<!-- In this second case we put a ] at the end of the first CDATA block
     and the ]> in the second CDATA block -->
<alternative><![CDATA[This text contains a CEND ]]]><![CDATA[]>]]></alternative>

2010-05-06 20:23:37

它用于包含可能被视为xml的数据，因为它包含某些字符。

这样，里面的数据将被显示，但不会被解释。

2010-05-06 20:23:52

通常用于在XML文档中嵌入自定义数据，如图片或声音数据。

2010-05-06 20:26:29

CDATA节是“元素内容的一段，它被标记为仅供解析器解释为字符数据，而不是标记”。

在语法上，它的行为类似于注释:

<exampleOfAComment>
<!--
    Since this is a comment
    I can use all sorts of reserved characters
    like > < " and &
    or write things like
    <foo></bar>
    but my document is still well-formed!
-->
</exampleOfAComment>

．.．但它仍然是文件的一部分:

<exampleOfACDATA>
<![CDATA[
    Since this is a CDATA section
    I can use all sorts of reserved characters
    like > < " and &
    or write things like
    <foo></bar>
    but my document is still well formed!
]]>
</exampleOfACDATA>

尝试将以下内容保存为.xhtml文件(不是.html)，并使用FireFox(不是Internet Explorer)打开它，以查看注释和CDATA部分之间的差异;当你在浏览器中查看文档时，注释不会出现，而CDATA部分将:

<?xml version="1.0" encoding="UTF-8" standalone="no" ?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en" >
<head>
<title>CDATA Example</title>
</head>
<body>

<h2>Using a Comment</h2>
<div id="commentExample">
<!--
You won't see this in the document
and can use reserved characters like
< > & "
-->
</div>

<h2>Using a CDATA Section</h2>
<div id="cdataExample">
<![CDATA[
You will see this in the document
and can use reserved characters like
< > & "
]]>
</div>

</body>
</html>

使用CDATA节需要注意的一点是，它们没有编码，因此无法在其中包含字符串]]>。任何包含]]>的字符数据都必须-据我所知-是一个文本节点。同样地，从DOM操作的角度来看，你不能创建一个包含[]>的CDATA节:

var myEl = xmlDoc.getElementById("cdata-wrapper");
myEl.appendChild(xmlDoc.createCDATASection("This section cannot contain ]]>"));

这个DOM操作代码要么抛出一个异常(在Firefox中)，要么导致一个结构糟糕的XML文档:http://jsfiddle.net/9NNHA/

2010-05-06 20:35:12

CDATA代表字符数据。您可以使用它来转义某些字符，否则这些字符将被视为常规XML。其中的数据将不会被解析。例如，如果你想传递一个包含&的URL，你可以使用CDATA来实现。否则，您将得到一个错误，因为它将被解析为常规XML。

2012-05-10 04:52:52

Cdata是您可能希望传递给xml解析器但仍然不解释为xml的数据。

Say for eg :- You have an xml which has encapsulates question/answer object . Such open fields can have any data which does not strictly fall under basic data type or xml defined custom data types. Like --Is this a correct tag for xml comment ? .-- You may have a requirement to pass it as it is without being interpreted by the xml parser as another child element. Here Cdata comes to your rescue . By declaring as Cdata you are telling the parser don't treat the data wrapped as an xml (though it may look like one )

2013-11-19 19:12:02

一个大的用例:你的xml包含一个程序，作为数据(例如Java的网页教程)。在这种情况下，您的数据包含大量字符，其中包括'&'和'<'，但这些字符并不是xml字符。

比较:

<example-code>
while (x &lt; len &amp;&amp; !done) {
    print( &quot;Still working, &apos;zzz&apos;.&quot; );
    ++x;
    }
</example-code>

with

<example-code><![CDATA[
while (x < len && !done) {
    print( "Still working, 'zzzz'." );
    ++x;
    }
]]></example-code>

Especially if you are copy/pasting this code from a file (or including it, in a pre-processor), it's nice to just have the characters you want in your xml file, w/o confusing them with XML tags/attributes. As @paary mentioned, other common uses include when you're embedding URLs that contain ampersands. Finally, even if the data only contains a few special characters but the data is very very long (the text of a chapter, say), it's nice to not have to be en/de-coding those few entities as you edit your xml file.

(我怀疑所有与评论的比较都有点误导/没有帮助。)

2014-05-28 13:26:19

另一个例子是:

如果你有一个RSS提要(xml文档)，并且想在描述的显示中包含一些基本的HTML编码，你可以使用CData对它进行编码:

<item>
  <title>Title of Feed Item</title>
  <link>/mylink/article1</link>
  <description>
    <![CDATA[
      <p>
      <a href="/mylink/article1"><img style="float: left; margin-right: 5px;" height="80" src="/mylink/image" alt=""/></a>
      Author Names
      <br/><em>Date</em>
      <br/>Paragraph of text describing the article to be displayed</p>
    ]]>
  </description>
</item>

RSS阅读器拉入描述并在CDATA中呈现HTML。

注意，不是所有的HTML标签都有效——我认为这取决于你使用的RSS阅读器。

解释一下为什么这个例子使用CData(而不是适当的pubData和dc:creator标签):这是用于使用RSS小部件的网站显示，我们没有真正的格式控制。

这使我们能够指定所包含图像的高度和位置，正确地格式化作者姓名和日期，等等，而不需要一个新的小部件。这也意味着我可以编写脚本，而不必手动添加它们。

2014-09-23 15:44:32

从维基百科:

在XML文档或外部解析实体中，CDATA节是一个元素内容的一部分，标记为供解析器解释仅作为字符数据，而不是标记。 http://en.wikipedia.org/wiki/CDATA

因此:CDATA中的文本被解析器视为字符，而不是XML节点。

2015-01-15 14:49:21

当我的xml元素需要存储HTML代码时，我曾经不得不使用CDATA。类似的

<codearea>
  <![CDATA[ 
  <div> <p> my para </p> </div> 
  ]]>
</codearea>

CDATA意味着它会忽略任何可以被解释为XML标签的字符，比如<和>等。

2015-05-14 04:28:52

它转义一个不能像往常一样传递给XML的字符串:

例子:

字符串中包含“&”。

你不能:

<FL val="Company Name">Dolce & Gabbana</FL>

因此，您必须使用CDATA:

<FL val="Company Name"> <![CDATA["Dolce & Gabbana"]]> </FL>

2019-03-25 03:28:39

注意，只有在直接将文本放入XML文本文件时才需要CDATA结构。

也就是说，只有在手动输入或以编程方式直接构建XML文本时才需要使用CDATA。

使用DOM处理器API或SimpleXML输入的任何文本都将自动转义，以防止违反XML内容规则。

尽管如此，有时使用CDATA可以减少原本由所有实体编码产生的文本大小，例如样式标记中的css或脚本标记中的javascript，其中许多语言构造使用HTML|XML中的字符，如<和>。

2020-05-06 02:11:17

<![CDATA[]]>在XML中的含义是什么?

推荐文章

最新文章

标签