GPTZero PPL - ValueError: cannot convert float NaN to integer #11

MelaniaNitu · 2023-07-09T12:04:46Z

@BurhanUlTayyab Thanks for sharing the implementation. When running GPTZero code, I get the following error:

[/content/DetectGPT/model.py](https://localhost:8080/#) in getPPL_1(self, sentence)
    374             if end_loc == seq_len:
    375                 break
--> 376         ppl = int(torch.exp(torch.stack(nlls).sum() / end_loc))
    377         return ppl
    378 

**ValueError: cannot convert float NaN to integer**

The code I use to test GPTZero is:

  import pandas as pd
  from model import GPT2PPLV2
  import torch
  
  model = GPT2PPLV2()
  
  res_texts = []
  max_tokens = 512
  
  filtered_list = [text for text in mylist if len(text.split()) >= 100]  # Remove texts with less than 100 words

  for text in filtered_list:
      input_text = text[:max_tokens]
      result = model(input_text, 300, "v1")
      res_texts.append(result)

I have pre-processed the input text to handle NaN values or empty lines as shown below, however I still get this error when trying to run GPTZero model.

df['text'] = df['text'].fillna('')
df['text'] = df['text'].apply(lambda x: re.sub(r'\n\s*\n', '\n', x.strip()) if isinstance(x, str) else np.nan)
df['text'] = df['text'].apply(lambda x: x.strip().replace('\n\n', '\n') if isinstance(x, str) else '')
new_df = df.dropna(subset=['text'])

Can you please change the model.py code to handle NaN or provide a workaround to "skip" any line containing NaN when running the model?

Thanks in advance.

The text was updated successfully, but these errors were encountered:

nick-tonjum · 2024-03-27T16:19:02Z

Any solution yet? I'm also facing this.

Kanishk-Kumar · 2024-11-26T13:12:52Z

Hi @BurhanUlTayyab , any ideas?

MelaniaNitu changed the title ~~GPTZero PPL error~~ GPTZero PPL - ValueError: cannot convert float NaN to integer Jul 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPTZero PPL - ValueError: cannot convert float NaN to integer #11

GPTZero PPL - ValueError: cannot convert float NaN to integer #11

MelaniaNitu commented Jul 9, 2023 •

edited

Loading

nick-tonjum commented Mar 27, 2024

Kanishk-Kumar commented Nov 26, 2024

GPTZero PPL - ValueError: cannot convert float NaN to integer #11

GPTZero PPL - ValueError: cannot convert float NaN to integer #11

Comments

MelaniaNitu commented Jul 9, 2023 • edited Loading

nick-tonjum commented Mar 27, 2024

Kanishk-Kumar commented Nov 26, 2024

MelaniaNitu commented Jul 9, 2023 •

edited

Loading