Update README.md

moehmeni · Oct 18, 2021 · fdf2e74 · fdf2e74
1 parent d8c2e74
commit fdf2e74
Showing 1 changed file with 2 additions and 250 deletions.
diff --git a/README.md b/README.md
@@ -39,253 +39,5 @@ Output :
     "comments": ""
 }
 ```
-
-
-## Available properties and methods
- ```python
- # You can use any of below properties and methods instead `a_tags_mp3`
- page.a_tags_mp3
- ```
-<details>
-
-<summary>Click to expand!</summary>
-
-
-#### <kbd>property</kbd> a_tag_hrefs
-
-
-
-
-
----
-
-#### <kbd>property</kbd> a_tag_texts
-
-
-
-
-
----
-
-#### <kbd>property</kbd> a_tags_mp3
-
-
-
-
-
----
-
-#### <kbd>property</kbd> a_tags_rar
-
-
-
-
-
----
-
-#### <kbd>property</kbd> a_tags_with_href
-
-
-
-
-
----
-
-#### <kbd>property</kbd> article_tag
-
-returns an article tag which has the most text length 
-
----
-
-#### <kbd>property</kbd> children
-
-returns a list of `EzSoup` instances from `self.important_hrefs` ##### using `ThreadPoolExecutor` to crawl children much faster than normal `for` loop 
-
----
-
-#### <kbd>property</kbd> favicon_href
-
-
-
-
-
----
-
-#### <kbd>property</kbd> important_a_tags
-
-returns `a` tags that includes header (h2, h3) inside or `a` tags inside headers or elements with class `item` or `post` I call these important becuase they're most likely to be crawlable contentful webpages 
-
----
-
-#### <kbd>property</kbd> important_hrefs
-
-
-
-
-
----
-
-#### <kbd>property</kbd> json_summary
-
-
-
-
-
----
-
-#### <kbd>property</kbd> main_html
-
-
-
-
-
----
-
-#### <kbd>property</kbd> main_image_src
-
-
-
-
-
----
-
-#### <kbd>property</kbd> main_text
-
-
-
-
-
----
-
-#### <kbd>property</kbd> meta_article_modified_time
-
-
-
-
-
----
-
-#### <kbd>property</kbd> meta_article_published_time
-
-
-
-
-
----
-
-#### <kbd>property</kbd> meta_description
-
-
-
-
-
----
-
-#### <kbd>property</kbd> meta_image_src
-
-
-
-
-
----
-
-#### <kbd>property</kbd> possible_topic_names
-
-returns possible topic/breadcrump names of webpage ### values can be unreliable since they aren't generated with NLP methods yet . 
-
----
-
-#### <kbd>property</kbd> summary_dict
-
-
-
-
-
----
-
-#### <kbd>property</kbd> text
-
-
-
-
-
----
-
-#### <kbd>property</kbd> title
-
-usually the `<h1>` tag content of a web page is cleaner than original page `<title>` text so if the h1 or h2 text is similar to the title  it is better to return it instead of original title text 
-
----
-
-#### <kbd>property</kbd> title_tag_text
-
-
-
-
-
-
-
----
-
-### <kbd>method</kbd> `from_url`
-
-```python
-from_url(url: str)
-```
-
-
-
-
-
----
-
-### <kbd>method</kbd> `get_important_children_soups`
-
-```python
-get_important_children_soups(multithread: bool = True, limit: int = None)
-```
-
-returns a list of `EzSoup` instances from `self.important_hrefs`  ## Parameters : 
---- `multithread` : True by default , using `ThreadPoolExecutor` to crawl children much faster 
---- `limit`: limit children count that will be crawled 
-
----
-
-### <kbd>method</kbd> `save_content_summary_html`
-
-```python
-save_content_summary_html(path: str = None)
-```
-
-
-
-
-
----
-
-### <kbd>method</kbd> `save_content_summary_json`
-
-```python
-save_content_summary_json(path: str = None)
-```
-
-
-
-
-
----
-
-### <kbd>method</kbd> `save_content_summary_txt`
-
-```python
-save_content_summary_txt(path: str = None)
-```
-
-</details>
-
----
-
-<sub>
-This README.md was automatically generated via https://github.com/ml-tooling/lazydocs
-</sub>
-
+## Documentation
+Soon...