To pull only the specific ul inside div
I have below html text -
<div class="a-fixed-left-grid-col a-col-left" id="zg-left-col" style="width:200px;margin-left:-200px;float:none;">
<ul id="zg_browseRoot">
<li class="zg_browseUp"> ‹
<a href="https://www.amazon.com/Best-Sellers/zgbs">Any Department</a>
</li>
<ul>
<li class="zg_browseUp"> ‹
<a href="https://www.amazon.com/Best-Sellers/zgbs/amazon-devices">Amazon Devices & Accessories</a>
</li>
<ul>
<li>
<span class="zg_selected"> Amazon Devices</span>
</li>
<ul>
<li><a href="https://www.amazon.com/Best-Sellers-Home-Security-Amazon/zgbs/amazon-devices/17386948011">Home Security from Amazon</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Amazon-Echo-Alexa-Devices/zgbs/amazon-devices/9818047011">Amazon Echo & Alexa Devices</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Dash-Buttons/zgbs/amazon-devices/10667898011">Dash Buttons</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Fire-TV/zgbs/amazon-devices/8521791011">Fire TV</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Fire-Tablets/zgbs/amazon-devices/6669703011">Fire Tablets</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Kindle-readers/zgbs/amazon-devices/6669702011">Kindle E-readers</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Amazon-Device-Bundles/zgbs/amazon-devices/16926003011">Device Bundles</a></li>
</ul>
</ul>
</ul>
</ul>
</div>
I want to pull somewhat like this -
https://www.amazon.com/Best-Sellers-Home-Security-Amazon/zgbs/amazon-devices/17386948011
https://www.amazon.com/Best-Sellers-Amazon-Echo-Alexa-Devices/zgbs/amazon-devices/9818047011
https://www.amazon.com/Best-Sellers-Dash-Buttons/zgbs/amazon-devices/10667898011
https://www.amazon.com/Best-Sellers-Fire-TV/zgbs/amazon-devices/8521791011
https://www.amazon.com/Best-Sellers-Fire-Tablets/zgbs/amazon-devices/6669703011
https://www.amazon.com/Best-Sellers-Kindle-readers/zgbs/amazon-devices/6669702011
https://www.amazon.com/Best-Sellers-Amazon-Device-Bundles/zgbs/amazon-devices/16926003011
I tried by using below code and its working, But not giving the result what I want.
soup.find('div', class_= 'a-fixed-left-grid-col a-col-left').find_all('ul')[3]
python beautifulsoup
add a comment |
I have below html text -
<div class="a-fixed-left-grid-col a-col-left" id="zg-left-col" style="width:200px;margin-left:-200px;float:none;">
<ul id="zg_browseRoot">
<li class="zg_browseUp"> ‹
<a href="https://www.amazon.com/Best-Sellers/zgbs">Any Department</a>
</li>
<ul>
<li class="zg_browseUp"> ‹
<a href="https://www.amazon.com/Best-Sellers/zgbs/amazon-devices">Amazon Devices & Accessories</a>
</li>
<ul>
<li>
<span class="zg_selected"> Amazon Devices</span>
</li>
<ul>
<li><a href="https://www.amazon.com/Best-Sellers-Home-Security-Amazon/zgbs/amazon-devices/17386948011">Home Security from Amazon</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Amazon-Echo-Alexa-Devices/zgbs/amazon-devices/9818047011">Amazon Echo & Alexa Devices</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Dash-Buttons/zgbs/amazon-devices/10667898011">Dash Buttons</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Fire-TV/zgbs/amazon-devices/8521791011">Fire TV</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Fire-Tablets/zgbs/amazon-devices/6669703011">Fire Tablets</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Kindle-readers/zgbs/amazon-devices/6669702011">Kindle E-readers</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Amazon-Device-Bundles/zgbs/amazon-devices/16926003011">Device Bundles</a></li>
</ul>
</ul>
</ul>
</ul>
</div>
I want to pull somewhat like this -
https://www.amazon.com/Best-Sellers-Home-Security-Amazon/zgbs/amazon-devices/17386948011
https://www.amazon.com/Best-Sellers-Amazon-Echo-Alexa-Devices/zgbs/amazon-devices/9818047011
https://www.amazon.com/Best-Sellers-Dash-Buttons/zgbs/amazon-devices/10667898011
https://www.amazon.com/Best-Sellers-Fire-TV/zgbs/amazon-devices/8521791011
https://www.amazon.com/Best-Sellers-Fire-Tablets/zgbs/amazon-devices/6669703011
https://www.amazon.com/Best-Sellers-Kindle-readers/zgbs/amazon-devices/6669702011
https://www.amazon.com/Best-Sellers-Amazon-Device-Bundles/zgbs/amazon-devices/16926003011
I tried by using below code and its working, But not giving the result what I want.
soup.find('div', class_= 'a-fixed-left-grid-col a-col-left').find_all('ul')[3]
python beautifulsoup
add a comment |
I have below html text -
<div class="a-fixed-left-grid-col a-col-left" id="zg-left-col" style="width:200px;margin-left:-200px;float:none;">
<ul id="zg_browseRoot">
<li class="zg_browseUp"> ‹
<a href="https://www.amazon.com/Best-Sellers/zgbs">Any Department</a>
</li>
<ul>
<li class="zg_browseUp"> ‹
<a href="https://www.amazon.com/Best-Sellers/zgbs/amazon-devices">Amazon Devices & Accessories</a>
</li>
<ul>
<li>
<span class="zg_selected"> Amazon Devices</span>
</li>
<ul>
<li><a href="https://www.amazon.com/Best-Sellers-Home-Security-Amazon/zgbs/amazon-devices/17386948011">Home Security from Amazon</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Amazon-Echo-Alexa-Devices/zgbs/amazon-devices/9818047011">Amazon Echo & Alexa Devices</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Dash-Buttons/zgbs/amazon-devices/10667898011">Dash Buttons</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Fire-TV/zgbs/amazon-devices/8521791011">Fire TV</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Fire-Tablets/zgbs/amazon-devices/6669703011">Fire Tablets</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Kindle-readers/zgbs/amazon-devices/6669702011">Kindle E-readers</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Amazon-Device-Bundles/zgbs/amazon-devices/16926003011">Device Bundles</a></li>
</ul>
</ul>
</ul>
</ul>
</div>
I want to pull somewhat like this -
https://www.amazon.com/Best-Sellers-Home-Security-Amazon/zgbs/amazon-devices/17386948011
https://www.amazon.com/Best-Sellers-Amazon-Echo-Alexa-Devices/zgbs/amazon-devices/9818047011
https://www.amazon.com/Best-Sellers-Dash-Buttons/zgbs/amazon-devices/10667898011
https://www.amazon.com/Best-Sellers-Fire-TV/zgbs/amazon-devices/8521791011
https://www.amazon.com/Best-Sellers-Fire-Tablets/zgbs/amazon-devices/6669703011
https://www.amazon.com/Best-Sellers-Kindle-readers/zgbs/amazon-devices/6669702011
https://www.amazon.com/Best-Sellers-Amazon-Device-Bundles/zgbs/amazon-devices/16926003011
I tried by using below code and its working, But not giving the result what I want.
soup.find('div', class_= 'a-fixed-left-grid-col a-col-left').find_all('ul')[3]
python beautifulsoup
I have below html text -
<div class="a-fixed-left-grid-col a-col-left" id="zg-left-col" style="width:200px;margin-left:-200px;float:none;">
<ul id="zg_browseRoot">
<li class="zg_browseUp"> ‹
<a href="https://www.amazon.com/Best-Sellers/zgbs">Any Department</a>
</li>
<ul>
<li class="zg_browseUp"> ‹
<a href="https://www.amazon.com/Best-Sellers/zgbs/amazon-devices">Amazon Devices & Accessories</a>
</li>
<ul>
<li>
<span class="zg_selected"> Amazon Devices</span>
</li>
<ul>
<li><a href="https://www.amazon.com/Best-Sellers-Home-Security-Amazon/zgbs/amazon-devices/17386948011">Home Security from Amazon</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Amazon-Echo-Alexa-Devices/zgbs/amazon-devices/9818047011">Amazon Echo & Alexa Devices</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Dash-Buttons/zgbs/amazon-devices/10667898011">Dash Buttons</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Fire-TV/zgbs/amazon-devices/8521791011">Fire TV</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Fire-Tablets/zgbs/amazon-devices/6669703011">Fire Tablets</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Kindle-readers/zgbs/amazon-devices/6669702011">Kindle E-readers</a></li>
<li><a href="https://www.amazon.com/Best-Sellers-Amazon-Device-Bundles/zgbs/amazon-devices/16926003011">Device Bundles</a></li>
</ul>
</ul>
</ul>
</ul>
</div>
I want to pull somewhat like this -
https://www.amazon.com/Best-Sellers-Home-Security-Amazon/zgbs/amazon-devices/17386948011
https://www.amazon.com/Best-Sellers-Amazon-Echo-Alexa-Devices/zgbs/amazon-devices/9818047011
https://www.amazon.com/Best-Sellers-Dash-Buttons/zgbs/amazon-devices/10667898011
https://www.amazon.com/Best-Sellers-Fire-TV/zgbs/amazon-devices/8521791011
https://www.amazon.com/Best-Sellers-Fire-Tablets/zgbs/amazon-devices/6669703011
https://www.amazon.com/Best-Sellers-Kindle-readers/zgbs/amazon-devices/6669702011
https://www.amazon.com/Best-Sellers-Amazon-Device-Bundles/zgbs/amazon-devices/16926003011
I tried by using below code and its working, But not giving the result what I want.
soup.find('div', class_= 'a-fixed-left-grid-col a-col-left').find_all('ul')[3]
python beautifulsoup
python beautifulsoup
edited Nov 12 '18 at 12:19
Suraj Rao
22.8k75469
22.8k75469
asked Nov 12 '18 at 12:18
Bhajesh BharathanBhajesh Bharathan
1913
1913
add a comment |
add a comment |
2 Answers
2
active
oldest
votes
using .select()
catLinks = soup.select('#zg_browseRoot ul ul ul li a')
for link in catLinks:
print(link.get('href'))
add a comment |
You need to get all the href
inside all the anchor
tags.
Try this:
print([a['href'] for a in soup.find('div', class_= 'a-fixed-left-grid-col a-col-left').find_all('ul')[3].find_all('a')])
Thank you stackoverflow.com/users/4915798/drec4s, this is working
– Bhajesh Bharathan
Nov 14 '18 at 7:23
add a comment |
Your Answer
StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53262047%2fto-pull-only-the-specific-ul-inside-div%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
using .select()
catLinks = soup.select('#zg_browseRoot ul ul ul li a')
for link in catLinks:
print(link.get('href'))
add a comment |
using .select()
catLinks = soup.select('#zg_browseRoot ul ul ul li a')
for link in catLinks:
print(link.get('href'))
add a comment |
using .select()
catLinks = soup.select('#zg_browseRoot ul ul ul li a')
for link in catLinks:
print(link.get('href'))
using .select()
catLinks = soup.select('#zg_browseRoot ul ul ul li a')
for link in catLinks:
print(link.get('href'))
answered Nov 12 '18 at 12:49
ewwinkewwink
11.4k22237
11.4k22237
add a comment |
add a comment |
You need to get all the href
inside all the anchor
tags.
Try this:
print([a['href'] for a in soup.find('div', class_= 'a-fixed-left-grid-col a-col-left').find_all('ul')[3].find_all('a')])
Thank you stackoverflow.com/users/4915798/drec4s, this is working
– Bhajesh Bharathan
Nov 14 '18 at 7:23
add a comment |
You need to get all the href
inside all the anchor
tags.
Try this:
print([a['href'] for a in soup.find('div', class_= 'a-fixed-left-grid-col a-col-left').find_all('ul')[3].find_all('a')])
Thank you stackoverflow.com/users/4915798/drec4s, this is working
– Bhajesh Bharathan
Nov 14 '18 at 7:23
add a comment |
You need to get all the href
inside all the anchor
tags.
Try this:
print([a['href'] for a in soup.find('div', class_= 'a-fixed-left-grid-col a-col-left').find_all('ul')[3].find_all('a')])
You need to get all the href
inside all the anchor
tags.
Try this:
print([a['href'] for a in soup.find('div', class_= 'a-fixed-left-grid-col a-col-left').find_all('ul')[3].find_all('a')])
edited Nov 12 '18 at 13:28
answered Nov 12 '18 at 12:23
drec4sdrec4s
1,6062621
1,6062621
Thank you stackoverflow.com/users/4915798/drec4s, this is working
– Bhajesh Bharathan
Nov 14 '18 at 7:23
add a comment |
Thank you stackoverflow.com/users/4915798/drec4s, this is working
– Bhajesh Bharathan
Nov 14 '18 at 7:23
Thank you stackoverflow.com/users/4915798/drec4s, this is working
– Bhajesh Bharathan
Nov 14 '18 at 7:23
Thank you stackoverflow.com/users/4915798/drec4s, this is working
– Bhajesh Bharathan
Nov 14 '18 at 7:23
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53262047%2fto-pull-only-the-specific-ul-inside-div%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown