Skip to content
Discussion options

You must be logged in to vote

Training a Model with a Huge Database

Context of Big Data

When dealing with a real data mining problem, you may have a huge database, potentially containing
terabytes of data. This is likely to happen in the case of a multi-table database, where your main
analyzed instances are described with detailed records stored in secondary tables.

For example, if your entity is a customer, you may have many logs per customer:

  • Purchase details in marketing
  • Call detail records in telecommunications
  • Web navigation logs
  • ...

At Orange telecommunications, there are tens of millions of customers in France, each with
thousands of logs in secondary tables, amounting to tens of billions of logs at terabyte …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by lucaurelien
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #1004 on April 24, 2026 13:36.