Probing conversational ai with publicly available dates

The large language models I’m working with include a disclaimer indicating date limits within its training data. One way I’ve found to directly assess these date ranges is by asking about specific versions of publicly published software packages.

I’ve been working with ruby recently and decided to ask both llama and ChatGPT-4:

what is the latest ruby version on the 2.x series?1

Why is this important?

Small patch versions can contain critical security patches. In this example 2.7.8 is the latest 2.x.x release and contains two security fixes. It’s important to check published first-hand resources and not rely on an llm for this specific of an answer.

Chat-GTP4 got the correct combination of release version and date. I’m not sure where llama got the March 21, 2023 date from. The only releases I see in GitHub from March 2023 are on March 30th. I’m sure I prefer outdated correct answers over false/fabricated assertions.

Ruby maintainers - 2.7.8

Primary sources of truth indicate 2.7.8 and was released on March 30, 2023.

  • https://github.com/ruby/ruby/releases/tag/v2_7_8
  • https://www.ruby-lang.org/en/downloads/releases/

meta llama ruby 2 version

llama - 2.7.7

The latest version of Ruby 2 is Ruby 2.7.7, which was released on March 21, 2023

meta llama ruby 2 version

llama also has a sources link but in this example it linked to a bing search result. Everything your history and/or english teacher told you about citing primary sources is true.

ChatGPT-42 - 2.7.6

The latest version of Ruby in the 2.x series is Ruby 2.7.6, which was released on April 12, 2022

chatgpt-4 ruby 2 version


Author’s Note

It’s very difficult for me to ascertain how large of a problem this is in the software field and beyond. Are the training sets going to be continuously updated? The biggest distinguishing mark from traditional search is the inclusion of publication dates from a variety of sites and publication sources.2 If I could wave a wand I would would get confidence and temperature settings back from the llm responses.

See below a “traditional search” result. Never mind that I had to do a full page scroll below the fold to go past Google’s own “quick answers” section… and of course reddit for the win.

google search ruby 2 version

Significant revisions

tags: 2024, technology, chatgpt, llm, llama, ai

EOF/Footnotes

  1. The original queries were placed within a few hours of each other on May 2nd 2024. 

  2. Citations are now available in both ChatGPT-4o and Alphabet Gemini as noted in ChatGPT-4o Initial impressions on latest model (with citations) Posted with direct answers to the open questions in Author's note 2