您可以在此处查看完整代码:HTTPS://帕萨特斌.com/X GN QJ QS7
将使用PowerShell代码修改的HTML页面的代码示例。 将上述文本复制到.html文件,将其保存到位置C:\ folder1
xmlns= "http://www.w3.org/1999/xhtml" dir= "ltr" lang= "ro" >
如何使用PowerShell和Regex创建批处理处理器来替换HTML标记(解析)| Neculai Fantanaru.
rel= "canonical" href= "https://MY-WEBSITE.COM" />
name= "description" content= "I LOVE HTML and CSS" />
name= "keywords" content= "abordarea frontala a lucrurilor neelucidate" />
name= "abstract" content= "My laptop works just fine" />
name= "Subject" content= "I think I need a new car." />
property= "og:url" content= "https://otherwebsite.com" />
property= "og:title" content= "Nobody is here?" />
property= "og:description" content= "Dance is my passion." />
下面的PowerShell代码将通过解析数据将HTML标记的内容复制到其他标记。 您只需要填写标签 四
$sourcedir = "C:\Folder1\"
$resultsdir = "C:\Folder1\"
Get-ChildItem -Path $sourcedir -Filter *.html | ForEach -Object {
$content = Get-Content -Path $_ .FullName -Raw
# Copy the content of the tag
$replaceValue = (Select-String -InputObject $content -Pattern '(?<= ).Matches.Value
$content = $content -replace '(?<= ,$replaceValue
$content = $content -replace '(?<="@id": ").*(")' ,$replaceValue
# Copy the content of the tag in the tags ABSTRACT, SUBJECT, OG:TITLE, HEADLINE, KEYWORDS #
$replaceValue = (Select-String -InputObject $content -Pattern '(?<=如何使用PowerShell和Regex创建批处理处理器来替换HTML标记(解析)| Neculai Fantanaru. )' ).Matches.Value
$content = $content -replace '(?<= ,$replaceValue
$content = $content -replace '(?<= ,$replaceValue
$content = $content -replace '(?<= ,$replaceValue
$content = $content -replace '(?<= ,$replaceValue
$content = $content -replace '(?<="headline": ").+(?=")' ,$replaceValue
$content = $content -replace '(?<="keywords": "如何使用PowerShell和Regex创建批处理处理器来替换HTML标记(解析)| Neculai Fantanaru.",
# Copy the content of the tag $replaceValue = (Select-String -InputObject $content -Pattern '(?<= ).Matches.Value
$content = $content -replace '(?<= ,$replaceValue
$content = $content -replace '(?<="description": "如何使用PowerShell和Regex创建批处理处理器来替换HTML标记(解析)| Neculai Fantanaru.",
Set-Content -Path $resultsdir \$($_ .name) $content
}
可选的。 这是一个正则表达式表达式,它将在HTML页面中更改“关键字”标记,在每个单词后添加逗号。
使用Notepad ++ - > Ctr + F - >检查:正则表达式
SEARCH: (? s)< title>.*?<\/ title>.*?< meta\ x20name= "keywords" \ x20content= " \ K(\ w+)|\ G[^\ w\ r\ n]+(\ w+)
REPLACE BY: ? 1 \ l\ 1 :,\ x20\ l\ 2
That's all folks.
If you like my code, please SHARE IT
您还可以查看代码版本Python
Latest articles accessed by readers:
An Eye To See And A Mind To Understand
Turn Towards Me With An Eye Full Of Your Own Gaze
The Snapshot Of Magic In God's Universe
Rhythm Of My Heart
* Note: If you want to read all my articles in real time, please check the romanian version !