Accuracy of Quantum Chemistry ML modelsCan quantum entanglement affect the chemistry of molecules?Mathematics...

Is right click on tables bad UX

Manager told a colleague of mine I was getting fired soon

How to refresh wired service getRecord manually?

Is there an in-universe explanation of how Frodo's arrival in Valinor was recorded in the Red Book?

The answer is a girl's name (my future granddaughter) - can anyone help?

Why did they use ultrafast diodes in a 50 or 60 Hz bridge?

If I travelled back in time to invest in X company to make a fortune, roughly what is the probability that it would fail?

Is it appropriate to "shop" through high-impact journals before sending the paper to more specialized journals?

Airport Security - advanced check, 4th amendment breach

How to protect bash function from being overridden?

Booting Ubuntu from USB drive on MSI motherboard -- EVERYTHING fails

Can my Beast Master ranger's baboon animal companion use her Wand of Magic Missiles?

Present participles of the verb esse

Citing CPLEX 12.9

Isn't the detector always measuring, and thus always collapsing the state?

Is "Ram married his daughter" ambiguous?

Is it possible for both sides of an encounter to be surprised?

Did Joe Biden "stop a prosecution" into his son in Ukraine? And did he brag about stopping the prosecution?

What action is recommended if your accommodation refuses to let you leave without paying additional fees?

Is the Basilisk Jaw a Slayer only drop?

Looking for circuit board material that can be dissolved

Why is music is taught by reading sheet music?

What did the Federation give the Prophets in exchange for access to the wormhole in DS9?

Is there a pattern for handling conflicting function parameters?



Accuracy of Quantum Chemistry ML models


Can quantum entanglement affect the chemistry of molecules?Mathematics in inorganic and quantum chemistryManifolds in quantum chemistryQuantum Chemistry: Small imaginary frequenciesCorrelation energy in quantum chemistryPerturbation theory in Quantum/Computational ChemistryCounting basis sets in quantum chemistry






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{
margin-bottom:0;
}








4












$begingroup$


I am trying to compare the performance of few Quantum Chemistry property prediction ML models. I was looking at the following table from DOI:
10.1039/c7sc02664a



enter image description here



The problem is that it does not contain any units. A preprint (https://arxiv.org/pdf/1909.00259.pdf, Table 4) suggests that the energy units are Hartree. Among energy properties, HOMO, LUMO, and gap have reasonable values for Hartree unit. However, values reported for U0, U, H, and G look absurd. As a comparison, look at MAE for mean value of each property:



enter image description here



These are in [eV] units (1 Ha = 27.212 eV). In other words, for instance MPNN models MAE for U property is 55.78 [eV] vs 8.25 [eV] for the mean baseline. This does not make sense at all. [k cal/mol] sounds more reasonable but then why half the energies in one unit the other half in other unit, and then why there are no units in a paper published by Royal Society of Chemistry. Seems like I'm missing something. Thanks!










share|improve this question









$endgroup$





















    4












    $begingroup$


    I am trying to compare the performance of few Quantum Chemistry property prediction ML models. I was looking at the following table from DOI:
    10.1039/c7sc02664a



    enter image description here



    The problem is that it does not contain any units. A preprint (https://arxiv.org/pdf/1909.00259.pdf, Table 4) suggests that the energy units are Hartree. Among energy properties, HOMO, LUMO, and gap have reasonable values for Hartree unit. However, values reported for U0, U, H, and G look absurd. As a comparison, look at MAE for mean value of each property:



    enter image description here



    These are in [eV] units (1 Ha = 27.212 eV). In other words, for instance MPNN models MAE for U property is 55.78 [eV] vs 8.25 [eV] for the mean baseline. This does not make sense at all. [k cal/mol] sounds more reasonable but then why half the energies in one unit the other half in other unit, and then why there are no units in a paper published by Royal Society of Chemistry. Seems like I'm missing something. Thanks!










    share|improve this question









    $endgroup$

















      4












      4








      4


      1



      $begingroup$


      I am trying to compare the performance of few Quantum Chemistry property prediction ML models. I was looking at the following table from DOI:
      10.1039/c7sc02664a



      enter image description here



      The problem is that it does not contain any units. A preprint (https://arxiv.org/pdf/1909.00259.pdf, Table 4) suggests that the energy units are Hartree. Among energy properties, HOMO, LUMO, and gap have reasonable values for Hartree unit. However, values reported for U0, U, H, and G look absurd. As a comparison, look at MAE for mean value of each property:



      enter image description here



      These are in [eV] units (1 Ha = 27.212 eV). In other words, for instance MPNN models MAE for U property is 55.78 [eV] vs 8.25 [eV] for the mean baseline. This does not make sense at all. [k cal/mol] sounds more reasonable but then why half the energies in one unit the other half in other unit, and then why there are no units in a paper published by Royal Society of Chemistry. Seems like I'm missing something. Thanks!










      share|improve this question









      $endgroup$




      I am trying to compare the performance of few Quantum Chemistry property prediction ML models. I was looking at the following table from DOI:
      10.1039/c7sc02664a



      enter image description here



      The problem is that it does not contain any units. A preprint (https://arxiv.org/pdf/1909.00259.pdf, Table 4) suggests that the energy units are Hartree. Among energy properties, HOMO, LUMO, and gap have reasonable values for Hartree unit. However, values reported for U0, U, H, and G look absurd. As a comparison, look at MAE for mean value of each property:



      enter image description here



      These are in [eV] units (1 Ha = 27.212 eV). In other words, for instance MPNN models MAE for U property is 55.78 [eV] vs 8.25 [eV] for the mean baseline. This does not make sense at all. [k cal/mol] sounds more reasonable but then why half the energies in one unit the other half in other unit, and then why there are no units in a paper published by Royal Society of Chemistry. Seems like I'm missing something. Thanks!







      quantum-chemistry machine-learning






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked 9 hours ago









      BladeBlade

      1306 bronze badges




      1306 bronze badges

























          1 Answer
          1






          active

          oldest

          votes


















          4














          $begingroup$

          Yes, you're exactly right - multiple papers in chemistry ML drop the units.



          There are even comparisons (usually by statistics, ML or comp. sci. researchers) where models are compared by "averaging" errors down a column like that. Of course that's meaningless, since you can't average electron volts or Hartree (energies), Debye (dipole moments), and volume (polarizabilities).



          Worse, energies are often computed as atomization energies - so for large molecules, they can be enormous...



          In my opinion, a more relevant and meaningful statistic would be the Mean Absolute Percent Error (MAPE) which is unitless and easier to understand (e.g., 1% error? 0.01% error?)



          That said, the original QM9 paper gives the units Table 3:
          Scientific Data (2014) 1, art. 140022
          QM9 data table






          share|improve this answer









          $endgroup$











          • 1




            $begingroup$
            I'm adding an image of the data table - I believe this is under fair use, since one cannot copyright data - in this case the units of the QM9 set.
            $endgroup$
            – Geoff Hutchison
            8 hours ago










          • $begingroup$
            Thanks for the feedback @GeoffHutchison. I agree that it seems most reasonable to use datasets original units, but for atomization energies it doesn't seem to be right. I've decided to try recreate these results with deepchem. That seems to be the only way to find out.
            $endgroup$
            – Blade
            6 hours ago










          • $begingroup$
            Just to put it in perspective, the reason that I'm so suspicious is that if they are in fact using Ha units, then in a world that DFT error for U0 is 0.1 [eV] and people are already claiming to not only achieving this, but achieving chemical accuracy (0.04 [eV] I think), the Japanese paper is proudly reporting 1.35 [eV] and the benchmark method is 54.97 [eV]! So I believe that the Japanese paper is dead wrong (given that it's just on arxiv).
            $endgroup$
            – Blade
            6 hours ago






          • 1




            $begingroup$
            I have reviewed multiple manuscripts in the field that claim state-of-the-art accuracy when they're unaware of units or of other papers with better accuracy. I'm not sure what the "true" state is for U0 on QM9, but it's definitely < 1 kcal/mol
            $endgroup$
            – Geoff Hutchison
            5 hours ago













          Your Answer








          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "431"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/4.0/"u003ecc by-sa 4.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });















          draft saved

          draft discarded
















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fchemistry.stackexchange.com%2fquestions%2f121744%2faccuracy-of-quantum-chemistry-ml-models%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          4














          $begingroup$

          Yes, you're exactly right - multiple papers in chemistry ML drop the units.



          There are even comparisons (usually by statistics, ML or comp. sci. researchers) where models are compared by "averaging" errors down a column like that. Of course that's meaningless, since you can't average electron volts or Hartree (energies), Debye (dipole moments), and volume (polarizabilities).



          Worse, energies are often computed as atomization energies - so for large molecules, they can be enormous...



          In my opinion, a more relevant and meaningful statistic would be the Mean Absolute Percent Error (MAPE) which is unitless and easier to understand (e.g., 1% error? 0.01% error?)



          That said, the original QM9 paper gives the units Table 3:
          Scientific Data (2014) 1, art. 140022
          QM9 data table






          share|improve this answer









          $endgroup$











          • 1




            $begingroup$
            I'm adding an image of the data table - I believe this is under fair use, since one cannot copyright data - in this case the units of the QM9 set.
            $endgroup$
            – Geoff Hutchison
            8 hours ago










          • $begingroup$
            Thanks for the feedback @GeoffHutchison. I agree that it seems most reasonable to use datasets original units, but for atomization energies it doesn't seem to be right. I've decided to try recreate these results with deepchem. That seems to be the only way to find out.
            $endgroup$
            – Blade
            6 hours ago










          • $begingroup$
            Just to put it in perspective, the reason that I'm so suspicious is that if they are in fact using Ha units, then in a world that DFT error for U0 is 0.1 [eV] and people are already claiming to not only achieving this, but achieving chemical accuracy (0.04 [eV] I think), the Japanese paper is proudly reporting 1.35 [eV] and the benchmark method is 54.97 [eV]! So I believe that the Japanese paper is dead wrong (given that it's just on arxiv).
            $endgroup$
            – Blade
            6 hours ago






          • 1




            $begingroup$
            I have reviewed multiple manuscripts in the field that claim state-of-the-art accuracy when they're unaware of units or of other papers with better accuracy. I'm not sure what the "true" state is for U0 on QM9, but it's definitely < 1 kcal/mol
            $endgroup$
            – Geoff Hutchison
            5 hours ago
















          4














          $begingroup$

          Yes, you're exactly right - multiple papers in chemistry ML drop the units.



          There are even comparisons (usually by statistics, ML or comp. sci. researchers) where models are compared by "averaging" errors down a column like that. Of course that's meaningless, since you can't average electron volts or Hartree (energies), Debye (dipole moments), and volume (polarizabilities).



          Worse, energies are often computed as atomization energies - so for large molecules, they can be enormous...



          In my opinion, a more relevant and meaningful statistic would be the Mean Absolute Percent Error (MAPE) which is unitless and easier to understand (e.g., 1% error? 0.01% error?)



          That said, the original QM9 paper gives the units Table 3:
          Scientific Data (2014) 1, art. 140022
          QM9 data table






          share|improve this answer









          $endgroup$











          • 1




            $begingroup$
            I'm adding an image of the data table - I believe this is under fair use, since one cannot copyright data - in this case the units of the QM9 set.
            $endgroup$
            – Geoff Hutchison
            8 hours ago










          • $begingroup$
            Thanks for the feedback @GeoffHutchison. I agree that it seems most reasonable to use datasets original units, but for atomization energies it doesn't seem to be right. I've decided to try recreate these results with deepchem. That seems to be the only way to find out.
            $endgroup$
            – Blade
            6 hours ago










          • $begingroup$
            Just to put it in perspective, the reason that I'm so suspicious is that if they are in fact using Ha units, then in a world that DFT error for U0 is 0.1 [eV] and people are already claiming to not only achieving this, but achieving chemical accuracy (0.04 [eV] I think), the Japanese paper is proudly reporting 1.35 [eV] and the benchmark method is 54.97 [eV]! So I believe that the Japanese paper is dead wrong (given that it's just on arxiv).
            $endgroup$
            – Blade
            6 hours ago






          • 1




            $begingroup$
            I have reviewed multiple manuscripts in the field that claim state-of-the-art accuracy when they're unaware of units or of other papers with better accuracy. I'm not sure what the "true" state is for U0 on QM9, but it's definitely < 1 kcal/mol
            $endgroup$
            – Geoff Hutchison
            5 hours ago














          4














          4










          4







          $begingroup$

          Yes, you're exactly right - multiple papers in chemistry ML drop the units.



          There are even comparisons (usually by statistics, ML or comp. sci. researchers) where models are compared by "averaging" errors down a column like that. Of course that's meaningless, since you can't average electron volts or Hartree (energies), Debye (dipole moments), and volume (polarizabilities).



          Worse, energies are often computed as atomization energies - so for large molecules, they can be enormous...



          In my opinion, a more relevant and meaningful statistic would be the Mean Absolute Percent Error (MAPE) which is unitless and easier to understand (e.g., 1% error? 0.01% error?)



          That said, the original QM9 paper gives the units Table 3:
          Scientific Data (2014) 1, art. 140022
          QM9 data table






          share|improve this answer









          $endgroup$



          Yes, you're exactly right - multiple papers in chemistry ML drop the units.



          There are even comparisons (usually by statistics, ML or comp. sci. researchers) where models are compared by "averaging" errors down a column like that. Of course that's meaningless, since you can't average electron volts or Hartree (energies), Debye (dipole moments), and volume (polarizabilities).



          Worse, energies are often computed as atomization energies - so for large molecules, they can be enormous...



          In my opinion, a more relevant and meaningful statistic would be the Mean Absolute Percent Error (MAPE) which is unitless and easier to understand (e.g., 1% error? 0.01% error?)



          That said, the original QM9 paper gives the units Table 3:
          Scientific Data (2014) 1, art. 140022
          QM9 data table







          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered 8 hours ago









          Geoff HutchisonGeoff Hutchison

          21.7k3 gold badges56 silver badges117 bronze badges




          21.7k3 gold badges56 silver badges117 bronze badges











          • 1




            $begingroup$
            I'm adding an image of the data table - I believe this is under fair use, since one cannot copyright data - in this case the units of the QM9 set.
            $endgroup$
            – Geoff Hutchison
            8 hours ago










          • $begingroup$
            Thanks for the feedback @GeoffHutchison. I agree that it seems most reasonable to use datasets original units, but for atomization energies it doesn't seem to be right. I've decided to try recreate these results with deepchem. That seems to be the only way to find out.
            $endgroup$
            – Blade
            6 hours ago










          • $begingroup$
            Just to put it in perspective, the reason that I'm so suspicious is that if they are in fact using Ha units, then in a world that DFT error for U0 is 0.1 [eV] and people are already claiming to not only achieving this, but achieving chemical accuracy (0.04 [eV] I think), the Japanese paper is proudly reporting 1.35 [eV] and the benchmark method is 54.97 [eV]! So I believe that the Japanese paper is dead wrong (given that it's just on arxiv).
            $endgroup$
            – Blade
            6 hours ago






          • 1




            $begingroup$
            I have reviewed multiple manuscripts in the field that claim state-of-the-art accuracy when they're unaware of units or of other papers with better accuracy. I'm not sure what the "true" state is for U0 on QM9, but it's definitely < 1 kcal/mol
            $endgroup$
            – Geoff Hutchison
            5 hours ago














          • 1




            $begingroup$
            I'm adding an image of the data table - I believe this is under fair use, since one cannot copyright data - in this case the units of the QM9 set.
            $endgroup$
            – Geoff Hutchison
            8 hours ago










          • $begingroup$
            Thanks for the feedback @GeoffHutchison. I agree that it seems most reasonable to use datasets original units, but for atomization energies it doesn't seem to be right. I've decided to try recreate these results with deepchem. That seems to be the only way to find out.
            $endgroup$
            – Blade
            6 hours ago










          • $begingroup$
            Just to put it in perspective, the reason that I'm so suspicious is that if they are in fact using Ha units, then in a world that DFT error for U0 is 0.1 [eV] and people are already claiming to not only achieving this, but achieving chemical accuracy (0.04 [eV] I think), the Japanese paper is proudly reporting 1.35 [eV] and the benchmark method is 54.97 [eV]! So I believe that the Japanese paper is dead wrong (given that it's just on arxiv).
            $endgroup$
            – Blade
            6 hours ago






          • 1




            $begingroup$
            I have reviewed multiple manuscripts in the field that claim state-of-the-art accuracy when they're unaware of units or of other papers with better accuracy. I'm not sure what the "true" state is for U0 on QM9, but it's definitely < 1 kcal/mol
            $endgroup$
            – Geoff Hutchison
            5 hours ago








          1




          1




          $begingroup$
          I'm adding an image of the data table - I believe this is under fair use, since one cannot copyright data - in this case the units of the QM9 set.
          $endgroup$
          – Geoff Hutchison
          8 hours ago




          $begingroup$
          I'm adding an image of the data table - I believe this is under fair use, since one cannot copyright data - in this case the units of the QM9 set.
          $endgroup$
          – Geoff Hutchison
          8 hours ago












          $begingroup$
          Thanks for the feedback @GeoffHutchison. I agree that it seems most reasonable to use datasets original units, but for atomization energies it doesn't seem to be right. I've decided to try recreate these results with deepchem. That seems to be the only way to find out.
          $endgroup$
          – Blade
          6 hours ago




          $begingroup$
          Thanks for the feedback @GeoffHutchison. I agree that it seems most reasonable to use datasets original units, but for atomization energies it doesn't seem to be right. I've decided to try recreate these results with deepchem. That seems to be the only way to find out.
          $endgroup$
          – Blade
          6 hours ago












          $begingroup$
          Just to put it in perspective, the reason that I'm so suspicious is that if they are in fact using Ha units, then in a world that DFT error for U0 is 0.1 [eV] and people are already claiming to not only achieving this, but achieving chemical accuracy (0.04 [eV] I think), the Japanese paper is proudly reporting 1.35 [eV] and the benchmark method is 54.97 [eV]! So I believe that the Japanese paper is dead wrong (given that it's just on arxiv).
          $endgroup$
          – Blade
          6 hours ago




          $begingroup$
          Just to put it in perspective, the reason that I'm so suspicious is that if they are in fact using Ha units, then in a world that DFT error for U0 is 0.1 [eV] and people are already claiming to not only achieving this, but achieving chemical accuracy (0.04 [eV] I think), the Japanese paper is proudly reporting 1.35 [eV] and the benchmark method is 54.97 [eV]! So I believe that the Japanese paper is dead wrong (given that it's just on arxiv).
          $endgroup$
          – Blade
          6 hours ago




          1




          1




          $begingroup$
          I have reviewed multiple manuscripts in the field that claim state-of-the-art accuracy when they're unaware of units or of other papers with better accuracy. I'm not sure what the "true" state is for U0 on QM9, but it's definitely < 1 kcal/mol
          $endgroup$
          – Geoff Hutchison
          5 hours ago




          $begingroup$
          I have reviewed multiple manuscripts in the field that claim state-of-the-art accuracy when they're unaware of units or of other papers with better accuracy. I'm not sure what the "true" state is for U0 on QM9, but it's definitely < 1 kcal/mol
          $endgroup$
          – Geoff Hutchison
          5 hours ago



















          draft saved

          draft discarded



















































          Thanks for contributing an answer to Chemistry Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fchemistry.stackexchange.com%2fquestions%2f121744%2faccuracy-of-quantum-chemistry-ml-models%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Taj Mahal Inhaltsverzeichnis Aufbau | Geschichte | 350-Jahr-Feier | Heutige Bedeutung | Siehe auch |...

          Baia Sprie Cuprins Etimologie | Istorie | Demografie | Politică și administrație | Arii naturale...

          Nicolae Petrescu-Găină Cuprins Biografie | Opera | In memoriam | Varia | Controverse, incertitudini...