XML属性vs XML元素

在工作中，我们被要求创建XML文件来将数据传递给另一个脱机应用程序，然后该应用程序将创建第二个XML文件来传递回去，以更新我们的一些数据。在这个过程中，我们一直在与另一个应用程序的团队讨论XML文件的结构。

我提出的样本基本上是这样的:

<INVENTORY>
   <ITEM serialNumber="something" location="something" barcode="something">
      <TYPE modelNumber="something" vendor="something"/> 
   </ITEM>
</INVENTORY>

另一个团队说，这不是行业标准，属性应该只用于元数据。他们建议:

<INVENTORY>
   <ITEM>
      <SERIALNUMBER>something</SERIALNUMBER>
      <LOCATION>something</LOCATION>
      <BARCODE>something</BARCODE>
      <TYPE>
         <MODELNUMBER>something</MODELNUMBER>
         <VENDOR>something</VENDOR>
      </TYPE>
   </ITEM>
</INVENTORY>

我建议使用第一个方法的原因是，创建的文件的大小要小得多。在传输过程中，文件中将有大约80000个项目。事实上，他们的建议比我的建议大三倍。我搜索了提到的神秘的“行业标准”，但我能找到的最接近的是XML属性应该只用于元数据，但争论的焦点是什么才是实际的元数据。

在冗长的解释(抱歉)之后，如何确定什么是元数据，以及在设计XML文档的结构时，如何决定何时使用属性或元素?

当前回答

这个问题没有统一的答案(我曾大量参与W3C规范的创建)。XML可以用于许多目的——类似文本的文档、数据和声明性代码是其中最常见的三种。我也经常把它用作数据模型。在这些应用程序的某些方面，属性更常见，而在其他方面，子元素更自然。各种工具的特性也使它们的使用变得更容易或更困难。

XHTML is one area where attributes have a natural use (e.g. in class='foo'). Attributes have no order and this may make it easier for some people to develop tools. OTOH attributes are harder to type without a schema. I also find namespaced attributes (foo:bar="zork") are often harder to manage in various toolsets. But have a look at some of the W3C languages to see the mixture that is common. SVG, XSLT, XSD, MathML are some examples of well-known languages and all have a rich supply of attributes and elements. Some languages even allow more-than-one-way to do it, e.g.

<foo title="bar"/>;

<foo>
  <title>bar</title>;
</foo>;

注意，它们在语法上是不等价的，需要处理工具的显式支持)

我的建议是，查看与您的应用程序最接近的领域的常见实践，并考虑您可能希望应用什么工具集。

最后，确保将名称空间与属性区分开来。一些XML系统(例如Linq)在API中将名称空间表示为属性。在我看来，这很丑陋，而且可能令人困惑。

2009-07-05 11:58:27

其他回答

只是对一些不好的信息做一些更正:

属性可以包含任何字符数据。< > & " '需要转义到<>,“;事情就让它,,,分别。如果您使用XML库，它将为您处理这些问题。

一个属性可以包含二进制数据，如图像，如果你真的需要，只需通过base64编码并将其设置为data: URL。

@feenster:在IDS或NAMES的情况下，属性可以包含空格分隔的多个项，其中包括数字。吹毛求疵，但这最终可以节省空间。

使用属性可以保持XML与JSON的竞争力。参见脂肪加价:每次减少一卡路里的脂肪加价神话。

2009-07-23 21:38:28

这是个价值百万美元的问题!

首先，现在不要太担心性能。您会惊讶于优化的XML解析器解析XML的速度有多快。更重要的是，您对未来的设计是什么:随着XML的发展，您将如何保持松耦合和互操作性?

更具体地说，您可以使元素的内容模型更加复杂，但扩展属性则更加困难。

2008-08-29 01:24:43

“XML”代表“可扩展标记语言”。标记语言意味着数据是文本，用关于结构或格式的元数据标记。

XHTML是按预期方式使用XML的一个例子:

<p><span lang="es">El Jefe</span> insists that you
    <em class="urgent">MUST</em> complete your project by Friday.</p>

这里，元素和属性之间的区别很明显。文本元素显示在浏览器中，属性是关于如何显示它们的指令(尽管有一些标记不是这样工作的)。

当XML不是作为标记语言，而是作为数据序列化语言使用时，就会出现混淆，在这种情况下，“数据”和“元数据”之间的区别更加模糊。因此，元素和属性之间的选择或多或少是任意的，除非不能用属性表示(参见feenster的回答)。

2010-06-24 04:02:05

在我的模式设计中，我使用了以下关于属性和元素的指导原则:

Use elements for long running text (usually those of string or normalizedString types) Do not use an attribute if there is grouping of two values (e.g. eventStartDate and eventEndDate) for an element. In the previous example, there should be a new element for "event" which may contain the startDate and endDate attributes. Business Date, DateTime and numbers (e.g. counts, amount and rate) should be elements. Non-business time elements such as last updated, expires on should be attributes. Non-business numbers such as hash codes and indices should be attributes.* Use elements if the type will be complex. Use attributes if the value is a simple type and does not repeat. xml:id and xml:lang must be attributes referencing the XML schema Prefer attributes when technically possible.

属性的优先级是它提供了以下内容:

唯一的(该属性不能出现多次) 顺序不重要上面的属性是可继承的(这是“所有”内容模型在当前模式语言中不支持的) 额外的好处是它们不那么冗长，占用的带宽也更少，但这并不是更喜欢属性而不是元素的真正原因。

我在技术上可能的情况下添加了属性，因为有时不可能使用属性。例如，属性集选择。例如，对于当前的模式语言，使用(startDate和endDate) xor (startTS和endTS)是不可能的

如果XML Schema开始允许限制或扩展“所有”内容模型，那么我可能会放弃它

2010-12-16 07:33:25

这很大程度上是个人偏好的问题。在可能的情况下，我使用元素进行分组，并使用属性进行数据，因为我认为这比替代方法更紧凑。

例如，我更喜欢.....

<?xml version="1.0" encoding="utf-8"?>
<data>
    <people>
         <person name="Rory" surname="Becker" age="30" />
        <person name="Travis" surname="Illig" age="32" />
        <person name="Scott" surname="Hanselman" age="34" />
    </people>
</data>

.．.而不是……

<?xml version="1.0" encoding="utf-8"?>
<data>
    <people>
        <person>
            <name>Rory</name>
            <surname>Becker</surname>
            <age>30</age>
        </person>
        <person>
            <name>Travis</name>
            <surname>Illig</surname>
            <age>32</age>
        </person>
        <person>
            <name>Scott</name>
            <surname>Hanselman</surname>
            <age>34</age>
        </person>
    </people>
</data>

然而，如果我的数据不容易在20-30个字符内表示，或者包含许多引号或其他需要转义的字符，那么我会说，是时候拆分元素了……可能使用CData块。

<?xml version="1.0" encoding="utf-8"?>
<data>
    <people>
        <person name="Rory" surname="Becker" age="30" >
            <comment>A programmer whose interested in all sorts of misc stuff. His Blog can be found at http://rorybecker.blogspot.com and he's on twitter as @RoryBecker</comment>
        </person>
        <person name="Travis" surname="Illig" age="32" >
            <comment>A cool guy for who has helped me out with all sorts of SVn information</comment>
        </person>
        <person name="Scott" surname="Hanselman" age="34" >
            <comment>Scott works for MS and has a great podcast available at http://www.hanselminutes.com </comment>
        </person>
    </people>
</data>

2008-09-30 09:23:18

XML属性vs XML元素

推荐文章

最新文章

标签