ro  fr  en  es  pt  ar  zh  hi  de  ru
ART 2.0 ART 3.0 ART 4.0 ART 5.0 ART 6.0 Pinterest

Nigute wakora ibikoresho byateguwe hamwe na powershell na regex kugirango usimbuze html tags (parsing)

On Iunie 16, 2021, in Python Scripts Examples, by Neculai Fantanaru

Urashobora kureba kode yuzuye hano:HTTPS: // Passatbin .com / x Gn QJ Qs7

Urugero rwa kode ya HTML ruzahindurwa hamwe na kode yububasha. Gukoporora inyandiko yavuzwe haruguru kuri dosiye .html, uyikize ahantuC: \ Ububiko1

   

 xmlns="http://www.w3.org/1999/xhtml" dir="ltr" lang="ro">

Nigute wakora ibikoresho byateguwe hamwe nububasha na Regex kugirango usimbuze HTML Tagi (Parsing)
 rel="canonical" href="https://MY-WEBSITE.COM" />
 name="description" content="I LOVE HTML and CSS"/>

 name="keywords" content="abordarea frontala a lucrurilor neelucidate"/>
 name="abstract" content="My laptop works just fine"/>
 name="Subject" content="I think I need a new car."/>
 property="og:url" content="https://otherwebsite.com"/>
 property="og:title" content="Nobody is here?" />
 property="og:description" content="Dance is my passion."/>





Kode ya Powershell hepfo izakoporora ibiri muri tagi ya html kurindi tagi mugusenya amakuru. Ukeneye gusa kuzuza tagina

$sourcedir = "C:\Folder1\"
$resultsdir = "C:\Folder1\"

Get-ChildItem -Path $sourcedir -Filter *.html | ForEach-Object {
    $content = Get-Content -Path $_.FullName -Raw
	
# Copy the content of the tag 
	
    $replaceValue = (Select-String -InputObject $content -Pattern '(?<=).Matches.Value
    $content = $content -replace '(?<=,$replaceValue
    $content = $content -replace '(?<="@id": ").*(")',$replaceValue
	
# Copy the content of the tag   in the tags ABSTRACT, SUBJECT, OG:TITLE, HEADLINE, KEYWORDS            #</span>
	
    <span style="color: #996633">$replaceValue</span> = (<span style="color: #007020">Select-String</span> -InputObject <span style="color: #996633">$content</span> -Pattern <span style="background-color: #fff0f0">'(?<=<title>Nigute wakora ibikoresho byateguwe hamwe nububasha na Regex kugirango usimbuze HTML Tagi (Parsing))').Matches.Value
    $content = $content -replace '(?<=,$replaceValue
    $content = $content -replace '(?<=,$replaceValue
    $content = $content -replace '(?<=,$replaceValue
    $content = $content -replace '(?<=,$replaceValue
    $content = $content -replace '(?<="headline": ").+(?=")',$replaceValue
    $content = $content -replace '(?<="keywords": "Nigute wakora ibikoresho byateguwe hamwe nububasha na Regex kugirango usimbuze HTML Tagi (Parsing)",
	
# Copy the content of the tag $replaceValue = (Select-String -InputObject $content -Pattern '(?<=).Matches.Value
    $content = $content -replace '(?<=,$replaceValue
    $content = $content -replace '(?<="description": "Nigute wakora ibikoresho byateguwe hamwe nububasha na Regex kugirango usimbuze HTML Tagi (Parsing) |  Neculai Fantanaru",
  
   Set-Content -Path $resultsdir\$($_.name) $content
}
  

Bidashoboka. Hano hari imvugo ya regex izahindura "Ijambo ryibanze" kuri page ya HTML, ongeraho koma nyuma ya buri jambo.

Koresha hamwe na TIRESPad ++ -> CTR + F -> Reba: Imvugo isanzwe

SEARCH: (?s)<title>.*?<\/title>.*?<meta\x20name="keywords"\x20content="\K(\w+)|\G[^\w\r\n]+(\w+)  
REPLACE BY:  ?1\l\1:,\x20\l\2

That's all folks.

If you like my code, please SHARE IT

Urashobora kandi kubona verisiyo ya kode muriPython


Latest articles accessed by readers:

  1. An Eye To See And A Mind To Understand
  2. Turn Towards Me With An Eye Full Of Your Own Gaze
  3. The Snapshot Of Magic In God's Universe
  4. Rhythm Of My Heart

Donate via Paypal

Alternate Text

RECURRENT DONATION

Donate monthly to support
the NeculaiFantanaru.com project

SINGLE DONATION

Donate the desired amount to support
the NeculaiFantanaru.com project

Donate by Bank Transfer

Account Ron: RO34INGB0000999900448439

Open account at ING Bank

Join The Neculai Fantanaru Community



* Note: If you want to read all my articles in real time, please check the romanian version !

decoration
About | Site Map | Partners | Feedback | Terms & Conditions | Privacy | RSS Feeds
© Neculai Fântânaru - All rights reserved