Skip to main content
0 votes
1 answer
96 views

What are the details of your problem? I am a teacher and I want to use Python to create a worksheet for my students. I have a vocabulary PDF with content like this: do your best duː jɔː best 33, 81 do ...
Hưng Phạm Đăng's user avatar
0 votes
2 answers
150 views

I'm trying to extract the part of a string that is outside parentheses using regular expressions. For example, from the input string: "Hello (this is extra) world (more text)" I want to ...
Kalyani Boyini's user avatar
0 votes
0 answers
102 views

ASP.NET Core 9 MVC / C# controller extracts texts from invoices using pdfpig based on code in answer How to group text to lines if there is small difference in Y position. Invoices can have multiple ...
Andrus's user avatar
  • 28.2k
2 votes
5 answers
200 views

I have a script that runs a grep command and formats the results nicely for me, asking if I want to open any of the resulting files in an editor etc. The core of my script is a command like this: grep ...
Robert Mark Bram's user avatar
0 votes
0 answers
43 views

I'm trying to use RegexBuilder/Swift to write a Swift method that extracts for example lists enclosed by <ul> and </ul> from an HTML-string. In this example let htmlText = ""&...
Patrick Bauers's user avatar
3 votes
3 answers
136 views

I need help with a regular expression in Notepad++. I want to: Find lines that contain a ( character but do not contain a ) character. Add a ) at the end of those lines. I tried using this regex to ...
Pubg Mobile's user avatar
1 vote
1 answer
91 views

How do I get rid of non-printing (escaped) characters from the output of the nltk.word_tokenize method? I am working through the book 'Natural Language Processing with Python' and am following the ...
green_ruby's user avatar
1 vote
1 answer
114 views

I'm processing CSV files in Python to extract and format data from another file. However, when writing the output, I get an extra newline before -----END CERTIFICATE-----. I want the output to have ...
Mbuyi Tuambilangana's user avatar
2 votes
2 answers
97 views

raw txt file contains these lines: cat raw.txt ID DESCRIPTION ----- -------------- 2 item2 4 item4 1 item1 3 item3 How can reorder it by ID as ...
showkey's user avatar
  • 375
0 votes
1 answer
114 views

I am working on a Flask-based web application that processes multilingual agenda documents. The documents are in DOC/DOCX format and contain structured agenda items that I need to extract and format ...
Binal Dalia's user avatar
3 votes
1 answer
151 views

Note that this is the follow up question of Parse text file, change some strings to camel case, add other strings . The parsing rules are similar but different: The input order in the output is ...
albertkao9's user avatar
1 vote
1 answer
96 views

The parsing rules are: Replace the string "public static final String" with the string "export const" if that string occurs only once. Replace the string "public static final ...
albertkao9's user avatar
2 votes
1 answer
97 views

Problem: I have a two-step bioinformatics pipeline where: Code 1 generates output files (.marked.bam) and places them into a directory structure. Code 2 processes annotated files (annotated....
Melissa's user avatar
  • 59
1 vote
1 answer
62 views

I am trying to return richness of NLTK text provided in NLTK book but for some reason I get None. Can someone please explain me what I am doing wrong? from nltk.book import * def ...
Dana Tussupbekova's user avatar
-5 votes
1 answer
120 views

I've encountered an issue in python. I have a string that contains both a message and code, and I need to separate them and pass each to different functions. An example: text = """ Can ...
Callme-Milad's user avatar

15 30 50 per page
1
2 3 4 5
133