Suggestions requested, please.
For a database management system to handle a two record, free format
text data of highly variable record length (between 2 and 10,000
bytes), with an expected maximum of 500,000 record pairs.
Indexed within one record of the record pair on all ‘words’ not
contained within a ‘stop word’ list or perhaps comprising individual
records within another data file. Index of ‘words’ and also phrases to
correspond to the other record of the record pair.
Expected access to be around 50,000 requests per 24 hour period,
response time is important, disk space is at a premium and as such
solutions with fixed block text fields may be overly inefficient,
platform prefered Linux.
Sensible, thought out, suggestions by experienced database developers
should be welcomed please.
Matt