ro  fr  en  es  pt  ar  zh  hi  de  ru
ART 2.0 ART 3.0 ART 4.0 ART 5.0 ART 6.0 Pinterest

Parsing Python: Ahoana ny fomba handikana fizarana data avy amin'ny rakitra HTML iray amin'ireo rakitra HTML hafa

June 20, 2021, in Leadership and Attitude, by Neculai Fantanaru

Azonao atao ny mijery ny kaody feno:Https: // passatin.com / a sy xm cg3

Ireto takelaka manaraka ireto dia tsy maintsy ao amin'ny rakitra HTML ao amin'ny Folder A sy ny rakitra HTML hafa ao amin'ny Folder B. Ny kaody python dia hanaratsy ireto marika manaraka ireto:HTML novokarina tamin'ny fampiasana hilite.me

 , Anarana ="Famaritana" Content =""/>, avy amin'nyny, avy amin'nyny, avy amin'nyny

This is the structure of the files. Both the file in Folder A must have the same html tags, respectively the same commented sections. From the html file in Folder A all these sections will be copied to the files in Folder B.

Zava-dehibe: The content of the tags and the content of the comments (Text Text) are different in the file in Folder A compared to the html files in Folder B. This is also the idea. I want the contents of these tags in Folder A to replace the contents of the same tags in the files in Folder B.

E.g. From the example.html file (from Folder a) the following sections will be copied to the one.html and two.html files (from Folder b)

   

Parsing Python: Ahoana ny fomba handikana fizarana data avy amin'ny rakitra HTML iray amin'ny rakitra HTML hafa

 name="description" content="I LOVE HTML and CSS"/>


	Text Text





	Text Text




 Text Text


Kaody python:

import requests
import re

# The folder that contains the file you want to parse
english_folder1 = r"d:\Downloads\A"

# The folder with the files you want to change
english_folder2 = r"d:\Downloads\B"

# The file you want to make parsing
file_to_parse_from = 'example.html'

extension_file = ".html"

use_parse_folder = True

import os

en1_directory = os.fsencode(english_folder1)
en2_directory = os.fsencode(english_folder2)

print('Going through english folder')
for file in os.listdir(en2_directory):
    filename = os.fsdecode(file)
    print(filename)
    if filename == 'y_key_e479323ce281e459.html' or filename == 'directory.html':
        continue
    if filename.endswith(extension_file):
        with open(os.path.join(english_folder1, file_to_parse_from), encoding='utf-8') as html:
            html = html.read()

            try:
                with open(os.path.join(english_folder2, filename), encoding='utf-8') as en_html:
                    en_html = en_html.read()
                    
                    title = re.search('Parsing Python: Ahoana ny fomba handikana fizarana data avy amin'ny rakitra HTML iray amin'ny rakitra HTML hafa', html)[0]
                    meta = re.search(']
                    comment_body = re.search('.+', html, flags=re.DOTALL)[0]

                    try:
                        comment_body2 = re.search('.+', html, flags=re.DOTALL)[0]
                        en_html = re.sub('.+', comment_body2, en_html, flags=re.DOTALL)
                    except:
                        pass

                    try:
                        comment_body3 = re.search('.+', html, flags=re.DOTALL)[0]
                        en_html = re.sub('.+', comment_body3, en_html, flags=re.DOTALL)
                    except:
                        pass
                    
                    en_html = re.sub('.+', comment_body, en_html, flags=re.DOTALL)
                    en_html = re.sub(', meta, en_html)
                    en_html = re.sub('Parsing Python: Ahoana ny fomba handikana fizarana data avy amin'ny rakitra HTML iray amin'ny rakitra HTML hafa', title, en_html)
            except FileNotFoundError:
                continue

        print(f'{filename} parsed')
        if use_parse_folder:
            try:
                with open(os.path.join(english_folder2+r'\parsed', 'parsed_'+filename), 'w', encoding='utf-8') as new_html:
                    new_html.write(en_html)
            except:
                os.mkdir(english_folder2+r'\parsed')
                with open(os.path.join(english_folder2+r'\parsed', 'parsed_'+filename), 'w', encoding='utf-8') as new_html:
                    new_html.write(en_html)
        else:
            with open(os.path.join(english_folder2, 'parsed_'+filename), 'w', encoding='utf-8') as html:
                html.write(en_html)

That's all folks.

Raha tianao ny code-ko, azafady zarao

Jereo ity kaody ity aoPowershellna python codesVersion 3naVersion 4naVersion 5

 


Latest articles accessed by readers:

  1. An Eye To See And A Mind To Understand
  2. Turn Towards Me With An Eye Full Of Your Own Gaze
  3. The Snapshot Of Magic In God's Universe
  4. Rhythm Of My Heart

Donate via Paypal

Alternate Text

RECURRENT DONATION

Donate monthly to support
the NeculaiFantanaru.com project

SINGLE DONATION

Donate the desired amount to support
the NeculaiFantanaru.com project

Donate by Bank Transfer

Account Ron: RO34INGB0000999900448439

Open account at ING Bank

Join The Neculai Fantanaru Community



* Note: If you want to read all my articles in real time, please check the romanian version !

decoration
About | Site Map | Partners | Feedback | Terms & Conditions | Privacy | RSS Feeds
© Neculai Fântânaru - All rights reserved