Subscribe to RSS

Question 1

I'm not yet very familiar with the patterns in Lua's string.gsub function. If I have a string like this: Fishing Lure(+100 Fishing Skill)(1 hour) and I want extract only the string "1 hour"...

Question 2

I have a dataset that originally was a json file which is converted to a data.table with data.table(jsonlite::fromJSON(data)). The resulting data.table is complex with nested data containing not only ...

Question 3

I have a dataset where the column to unnest contains data with unequal rows and columns rather than data with equal dimensions. I'm looking for a fast approach to unnest this dataset using data.table. ...

Question 4

I'm currently trying to filter a dataset containing audio data on bird species. The data looks like this: head(audiomoth_sample) id park park_abbr am_no sci_name com_name start_s end_s conf date_time ...

Question 5

Currently I have a dataframe of bear detections that I want to convert into a binary detection history (14 columns of day1, day2, day3, etc. where: actual_date_out = the date the camera was deployed, ...

Question 6

I have data where, for each individual, the dates of event are related. Here is an example: id Date 1001 2025-06-20 1002 2025-06-24 1002 2025-06-20 1002 2025-06-19 What I would like to ...

Question 7

I am currently working in a project where multiple databses are available to check for specific conditions of a patient. Specifically, I have a "master" database in wide format, with one row ...

Question 8

Question 9

Currently I'm working with a large database using PySpark and stuck with a problem oh how to correctly set row numbers depending on condition My dataframe is: id_company id_client id_loan date c1 ...

Question 10

Currently I'm trying to execute some filtering procedures in PySpark (educational purposes). I'm new to PySpark, so decided to ask for a help. My dataframe look like this: ID ApplicationDate ...

Question 11

Can anyone please help me with this scenario where I have might have multiple OBJECT_CONSTRUCT nested within an ARRAY_CONSTRUCT. I am not able to update one value of an element within it. I am using ...

Question 12

I'm currently in the process of cleaning up a large questionnaire data base. I wanted to know, if in R or pandas, there was a way to graphically change the order of columns. I mostly used RStudio and ...

Question 13

Currently I'm making calculations using PySpark and trying to match data from multiple dataframes on a specific conditions. I'm new to PySpark and decided to ask for a help. My first dataframe ...

Question 14

Currently, I'm making calculations using PySpark on a dataframe where information on how loans are paid by borrowers is shown. I'm new to PySpark and decided to ask for help while trying to execute ...

Question 15

I have a dataframe containing incalculable rows and columns. The df is structured in such that until 6th row and 2nd column, I have string as input and the rest are numbers(floating points). I want to ...

Collectives™ on Stack Overflow

String manipulation: extract words under brackets

Unnest a complex and inconsistent dataset using data.table

Fast unnest complex column with data.table

How do I filter my data with a looped "if" statement while retaining data from both current, past and an average of current + past loops?

Mutating detection data into binary

Get the number of days since last event

Find conditions from multiple databases to have in a single database

Formatting csv file format in pyspark

Setting a row number for each row in PySpark Dataframe

Execution of complex filtering procedures in PySpark

Update Object_construct nested in an Array_construct in Snowflake

Graphically reorganizing columns in DataFrame

Merge dataframes with conditions using PySpark

Advanced Filtering Operations in PySpark

Dropping rows whose row sum = zero keeping the original structure same

Hot Network Questions