ro  fr  en  es  pt  ar  zh  hi  de  ru
ART 2.0 ART 3.0 ART 4.0 ART 5.0 ART 6.0 Pinterest

Python: Replaces the string ( ) with a space, and then removes all duplicate spaces from html tags

On Noiembrie 23, 2021, in Leadership and Attitude, by Neculai Fantanaru

Sampeyan bisa ndeleng kode lengkap ing kene:Https: // Passatin.com / 03 Aku ing Ve PX

InstalPythonWaca rangkeng-.

Kode python bakal ngganti senar ( ) Kanthi papan lan mbusak papan duplikat ing tag HTML sing ana ing antarane rong wateslanWaca rangkeng-.

Uga bakal mbusak ruang kosong ing wiwitan lan pungkasan saben baris sing ana ing tag HTML. Aku mung njupuk tag menyang akun

.. lan

..



<p class="obisnuit"><em>    Honor  your  moral and spiritual      obligations    .em>p>
<p class="nint">   Bishop  knew how to say the    most meaningful      of things  speech p>

Bakal dadi:



<p class="obisnuit"><em>Honor your moral and spiritual obligations.em>p>
<p class="nint">Bishop knew how to say the most meaningful of things speech.p>

Kode: Nyalin lan mbukak kode ing ngisor iki ing program penerjemah(Aku nggunakakePyscripterWaca rangkeng-.Aja lali ngganti dalan ing "direktori_name =".

import re
import os


def read_text_from_file(file_path):
    """
    Aceasta functie returneaza continutul unui fisier.
    file_path: calea catre fisierul din care vrei sa citesti
    """
    with open(file_path, encoding='utf8') as f:
        text = f.read()
        return text


def write_to_file(text, file_path):
    """
    Aceasta functie scrie un text intr-un fisier.
    text: textul pe care vrei sa il scrii
    file_path: calea catre fisierul in care vrei sa scrii
    """
    with open(file_path, 'wb') as f:
        f.write(text.encode('utf8', 'ignore'))


def replace_white_spaces(tag_name, file_path):
    """
    Aceasta functie modifica textul dintre un tag dat ca argument.
    """

    text = read_text_from_file(file_path)

    text = str(text)

    articol_pattern = re.compile('[\s\S]*?[\s\S]*?')
    text_articol = re.findall(articol_pattern, text)
    if len(text_articol) != 0:
        text_articol = str(text_articol[0])
        pattern = re.compile('<{} class=\".*?\">(.*?)'.format(tag_name, tag_name))  
       
        tag_texts = re.findall(pattern, text_articol)
       
        new_text_articol = text_articol
        for tag_text in tag_texts:
          
           
           
            new_text = tag_text.strip()
            m = re.findall('(.*?)', new_text)
            if len(m) >= 1:
                text_em = str(m[0])
                text_em_new = text_em
                text_em_new = text_em_new.replace(r' ', r' ')
                text_em_new = text_em_new.strip()
                new_text = new_text.replace(text_em, text_em_new)
            
            new_text = new_text.replace(r' ', r' ')
            new_text = " ".join(new_text.split())
            
            new_text_articol = new_text_articol.replace(tag_text, new_text)
        
        text = text.replace(text_articol, new_text_articol)
        write_to_file(text, file_path)
        print("Fisierul modificat cu succes este: ", file_path)
    else:
        print("Fisierul nu are structura corecta: ", file_path)




def gaseste_nbsp(file_path):
    text_Reg = f.read()


def replace_white_spaces_only_html_php(tag_name, directory_name):
    for root, dirs, files in os.walk(directory_name):
        for f in files:
            if f.endswith('html'):
                file_path = os.path.join(root, f)
                replace_white_spaces(tag_name, file_path)
            else:
                continue

if __name__ == '__main__':  
    
    
    directory_name = 'c:\\Folder1'
   
    tag_name = 'p'
    
    replace_white_spaces_only_html_php(tag_name, directory_name)

That's all folks.

If you like my code, then make me a favor: translate your website into Romanian, "ro".

Uga, anaVersi 2saka kode iki utawaVersi 3utawaVersi 4utawaVersi 5utawaVersi 6

 


Latest articles accessed by readers:

  1. An Eye To See And A Mind To Understand
  2. Turn Towards Me With An Eye Full Of Your Own Gaze
  3. The Snapshot Of Magic In God's Universe
  4. Rhythm Of My Heart

Donează prin Paypal

Alternate Text

DONAŢIE RECURENTĂ

Donează lunar pentru susţinerea proiectului NeculaiFantanaru.com

DONAŢIE SINGULARĂ

Donează suma dorită pentru susţinerea proiectului NeculaiFantanaru.com

Donează prin Transfer Bancar

Cont Lei: RO34INGB0000999900448439

Deschis la ING Bank

Alatura-te Comunitatii Neculai Fantanaru

decoration
About | Site Map | Partners | Feedback | Terms & Conditions | Privacy | RSS Feeds
© Neculai Fântânaru - All rights reserved