REAL database
The largest enumerated database of synthetically feasible molecules
The REAL database is one of the ways to explore the REAL space. It is a classical database of enumerated structures. The database is a tool to find new hit molecules using large-scale virtual screening and to search for analogs of your hit molecules. The REAL database is accessible as SMILES, SDF, and it is searchable on Enaminestore.
The current release of the REAL database comprises over 1.36 billion molecules which comply with “rule of 5“ and Veber criteria: MW≤500, SlogP≤5, HBA≤10, HBD≤5, rotatable bonds≤10, and TPSA≤140.
- REAL database, 1.36Bn cpds, SMILES, part 1
- REAL database, 1.36Bn cpds, SMILES, part 2
- REAL database, 1.36Bn cpds, SMILES, part 3
- REAL database, 1.36Bn cpds, SMILES, part 4
- REAL database, 1.36Bn cpds, SMILES, part 5
- REAL database, 1.36Bn cpds, SMILES, part 6
- REAL database, 1.36Bn cpds, SMILES, part 7
- REAL database, 1.36Bn cpds, SMILES, part 8
- REAL database, 1.36Bn cpds, SMILES, part 9
- REAL database, 1.36Bn cpds, SMILES, part 10
- REAL database, 1.36Bn cpds, SMILES, part 11
- REAL database, 1.36Bn cpds, SMILES, part 12
- REAL database, 1.36Bn cpds, SMILES, part 13
- REAL database, 1.36Bn cpds, SMILES, part 14
- REAL database, 1.36Bn cpds, SMILES, part 15
- REAL database, 1.36Bn cpds, SMILES, part 16
- REAL database, 1.36Bn cpds, SMILES, part 17
- REAL database, 1.36Bn cpds, SMILES, part 18
- REAL database, 1.36Bn cpds, SMILES, part 19
- REAL database, 1.36Bn cpds, SMILES, part 20
Despite its size, the REAL database is easy to work with. Along with SMILES and catalog IDs, you can find for each REAL molecule important physicochemical parameters (MW, sLogP, HBA, HBD, etc.,), structural alerts (PAINS, Brenk, and Eli Lilly medchem rules), relation to the REAL compound libraries, and type of chemistry and, therefore, an effort, utilized for the synthesis (“s”, simple chemistry, standard effort, “m”, advanced chemistry, high effort). The database allows you to link the end structures with the corresponding building blocks. Besides mining the entire REAL database, you can conveniently prepare its subset for further processing by selecting first the preferred building blocks and then extract their REAL derivatives.