从文本JavaScript中删除HTML

有没有一种简单的方法可以在JavaScript中获取一个html字符串并去掉html？

当前回答

我认为最简单的方法就是像上面提到的那样使用正则表达式。虽然没有理由使用一堆。尝试：

stringWithHTML = stringWithHTML.replace(/<\/?[a-z][a-z0-9]*[^<>]*>/ig, "");

2011-01-10 05:40:34

其他回答

将HTML转换为纯文本电子邮件，保持超链接（a href）完整

hypoxide发布的上述功能运行良好，但我所追求的是基本上转换在WebRichText编辑器（例如FCKEditor）中创建的HTML并清除所有HTML，但保留所有链接，因为我希望HTML和纯文本版本都能帮助创建STMP电子邮件的正确部分（HTML和纯文字）。

经过长时间的谷歌搜索，我和我的同事使用Javascript中的正则表达式引擎得出了这个结论：

str='this string has <i>html</i> code i want to <b>remove</b><br>Link Number 1 -><a href="http://www.bbc.co.uk">BBC</a> Link Number 1<br><p>Now back to normal text and stuff</p>
';
str=str.replace(/<br>/gi, "\n");
str=str.replace(/<p.*>/gi, "\n");
str=str.replace(/<a.*href="(.*?)".*>(.*?)<\/a>/gi, " $2 (Link->$1) ");
str=str.replace(/<(?:.|\s)*?>/g, "");

str变量的开头如下：

this string has <i>html</i> code i want to <b>remove</b><br>Link Number 1 -><a href="http://www.bbc.co.uk">BBC</a> Link Number 1<br><p>Now back to normal text and stuff</p>

然后在代码运行之后，它看起来像这样：-

this string has html code i want to remove
Link Number 1 -> BBC (Link->http://www.bbc.co.uk)  Link Number 1


Now back to normal text and stuff

正如你所看到的，所有HTML都被删除了，链接也被保留了下来，超链接文本仍然完好无损。此外，我还将＜p＞和＜br＞标记替换为\n（换行符），以便保留某种视觉格式。

更改链接格式（例如，BBC（链接->http://www.bbc.co.uk)）只需编辑$2（Link->$1），其中$1是href URL/URI，$2是超链接文本。由于链接直接位于纯文本正文中，大多数SMTP邮件客户端都会转换这些链接，以便用户能够单击它们。

希望你觉得这很有用。

2009-08-06 08:30:22

const-htmlParser=new DOMParser（）.parseFromString（“<h6>用户<p>名称</p></h6>”，'text/html'）；const textString=htmlParser.body.textContent；console.log（textString）

2022-07-14 06:25:29

对于转义字符，也可以使用模式匹配：

myString.replace(/((&lt)|(<)(?:.|\n)*?(&gt)|(>))/gm, '');

2016-11-08 10:44:34

myString.replace(/<[^>]*>?/gm, '');

2009-05-04 22:42:52

我只需要去掉＜a＞标签，并用链接的文本替换它们。

这似乎很有效。

htmlContent= htmlContent.replace(/<a.*href="(.*?)">/g, '');
htmlContent= htmlContent.replace(/<\/a>/g, '');

2013-08-19 16:12:49

从文本JavaScript中删除HTML

推荐文章

最新文章

标签