pd.read_html() imports a list rather than a dataframe










7















I used pd.read_html() to import a table from a webpage but instead of structuring the data as a dataframe Python imported it as a list. How can I import the data as a dataframe? Thank you!



The code is the following:



import pandas as pd

import html5lib

url = 'http://www.fdic.gov/bank/individual/failed/banklist.html'

dfs = pd.read_html(url)

type(dfs)

Out[1]: list









share|improve this question




























    7















    I used pd.read_html() to import a table from a webpage but instead of structuring the data as a dataframe Python imported it as a list. How can I import the data as a dataframe? Thank you!



    The code is the following:



    import pandas as pd

    import html5lib

    url = 'http://www.fdic.gov/bank/individual/failed/banklist.html'

    dfs = pd.read_html(url)

    type(dfs)

    Out[1]: list









    share|improve this question


























      7












      7








      7


      1






      I used pd.read_html() to import a table from a webpage but instead of structuring the data as a dataframe Python imported it as a list. How can I import the data as a dataframe? Thank you!



      The code is the following:



      import pandas as pd

      import html5lib

      url = 'http://www.fdic.gov/bank/individual/failed/banklist.html'

      dfs = pd.read_html(url)

      type(dfs)

      Out[1]: list









      share|improve this question
















      I used pd.read_html() to import a table from a webpage but instead of structuring the data as a dataframe Python imported it as a list. How can I import the data as a dataframe? Thank you!



      The code is the following:



      import pandas as pd

      import html5lib

      url = 'http://www.fdic.gov/bank/individual/failed/banklist.html'

      dfs = pd.read_html(url)

      type(dfs)

      Out[1]: list






      python html pandas






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Sep 26 '16 at 19:34









      alecxe

      329k72656878




      329k72656878










      asked Sep 26 '16 at 19:31









      AlKAlK

      81515




      81515






















          1 Answer
          1






          active

          oldest

          votes


















          6














          .read_html() produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:



          dfs = pd.read_html(url)
          df = dfs[0]
          print(df)


          Note that, if there are no tables in the HTML source, it would return an error and would never produce an empty list.






          share|improve this answer























          • Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?

            – AlK
            Sep 26 '16 at 19:45











          • @AlexanderKonstantinidis interesting, df.tail, df.head, df.index work for me. What error(s) are you getting?

            – alecxe
            Sep 26 '16 at 19:48











          • AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable

            – AlK
            Sep 26 '16 at 19:51






          • 1





            @AlexanderKonstantinidis ah, it is tail, head and index - no s prefix and no () (these are not methods).

            – alecxe
            Sep 26 '16 at 19:56











          Your Answer






          StackExchange.ifUsing("editor", function ()
          StackExchange.using("externalEditor", function ()
          StackExchange.using("snippets", function ()
          StackExchange.snippets.init();
          );
          );
          , "code-snippets");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "1"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f39710903%2fpd-read-html-imports-a-list-rather-than-a-dataframe%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          6














          .read_html() produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:



          dfs = pd.read_html(url)
          df = dfs[0]
          print(df)


          Note that, if there are no tables in the HTML source, it would return an error and would never produce an empty list.






          share|improve this answer























          • Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?

            – AlK
            Sep 26 '16 at 19:45











          • @AlexanderKonstantinidis interesting, df.tail, df.head, df.index work for me. What error(s) are you getting?

            – alecxe
            Sep 26 '16 at 19:48











          • AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable

            – AlK
            Sep 26 '16 at 19:51






          • 1





            @AlexanderKonstantinidis ah, it is tail, head and index - no s prefix and no () (these are not methods).

            – alecxe
            Sep 26 '16 at 19:56
















          6














          .read_html() produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:



          dfs = pd.read_html(url)
          df = dfs[0]
          print(df)


          Note that, if there are no tables in the HTML source, it would return an error and would never produce an empty list.






          share|improve this answer























          • Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?

            – AlK
            Sep 26 '16 at 19:45











          • @AlexanderKonstantinidis interesting, df.tail, df.head, df.index work for me. What error(s) are you getting?

            – alecxe
            Sep 26 '16 at 19:48











          • AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable

            – AlK
            Sep 26 '16 at 19:51






          • 1





            @AlexanderKonstantinidis ah, it is tail, head and index - no s prefix and no () (these are not methods).

            – alecxe
            Sep 26 '16 at 19:56














          6












          6








          6







          .read_html() produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:



          dfs = pd.read_html(url)
          df = dfs[0]
          print(df)


          Note that, if there are no tables in the HTML source, it would return an error and would never produce an empty list.






          share|improve this answer













          .read_html() produces a list of dataframes (there could be multiple tables in an HTML source), get the desired one by index. In your case, there is a single dataframe:



          dfs = pd.read_html(url)
          df = dfs[0]
          print(df)


          Note that, if there are no tables in the HTML source, it would return an error and would never produce an empty list.







          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered Sep 26 '16 at 19:36









          alecxealecxe

          329k72656878




          329k72656878












          • Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?

            – AlK
            Sep 26 '16 at 19:45











          • @AlexanderKonstantinidis interesting, df.tail, df.head, df.index work for me. What error(s) are you getting?

            – alecxe
            Sep 26 '16 at 19:48











          • AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable

            – AlK
            Sep 26 '16 at 19:51






          • 1





            @AlexanderKonstantinidis ah, it is tail, head and index - no s prefix and no () (these are not methods).

            – alecxe
            Sep 26 '16 at 19:56


















          • Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?

            – AlK
            Sep 26 '16 at 19:45











          • @AlexanderKonstantinidis interesting, df.tail, df.head, df.index work for me. What error(s) are you getting?

            – alecxe
            Sep 26 '16 at 19:48











          • AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable

            – AlK
            Sep 26 '16 at 19:51






          • 1





            @AlexanderKonstantinidis ah, it is tail, head and index - no s prefix and no () (these are not methods).

            – alecxe
            Sep 26 '16 at 19:56

















          Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?

          – AlK
          Sep 26 '16 at 19:45





          Thank you! Indeed df is a dataframe. However when I tried the .head, .tail and .index parameters on df Python returned error messages. How can I correct this?

          – AlK
          Sep 26 '16 at 19:45













          @AlexanderKonstantinidis interesting, df.tail, df.head, df.index work for me. What error(s) are you getting?

          – alecxe
          Sep 26 '16 at 19:48





          @AlexanderKonstantinidis interesting, df.tail, df.head, df.index work for me. What error(s) are you getting?

          – alecxe
          Sep 26 '16 at 19:48













          AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable

          – AlK
          Sep 26 '16 at 19:51





          AttributeError: 'DataFrame' object has no attribute 'heads', AttributeError: 'DataFrame' object has no attribute 'tails', TypeError: 'Index' object is not callable

          – AlK
          Sep 26 '16 at 19:51




          1




          1





          @AlexanderKonstantinidis ah, it is tail, head and index - no s prefix and no () (these are not methods).

          – alecxe
          Sep 26 '16 at 19:56






          @AlexanderKonstantinidis ah, it is tail, head and index - no s prefix and no () (these are not methods).

          – alecxe
          Sep 26 '16 at 19:56




















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f39710903%2fpd-read-html-imports-a-list-rather-than-a-dataframe%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Use pre created SQLite database for Android project in kotlin

          Darth Vader #20

          Ondo