Skip to content

Commit 049a5ad

Browse files
committed
update
1 parent 5729d4c commit 049a5ad

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

66 files changed

+176808
-11
lines changed

Diff for: IMOOCSpider/readme.md

+35
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,35 @@
1+
### 什么是网络爬虫?
2+
#### 维基百科
3+
>**[网络爬虫](https://zh.wikipedia.org/wiki/%E7%B6%B2%E8%B7%AF%E7%88%AC%E8%9F%B2)**(英语:**web crawler**),也叫网络蜘蛛(spider),是一种用来自动浏览万维网的网络机器人。其目的一般为编纂网络索引。
4+
5+
#### 百度百科
6+
>**网络爬虫**(又被称为网页蜘蛛,网络机器人,在FOAF社区中间,更经常的称为网页追逐者),是一种按照一定的规则,自动地抓取万维网信息的程序或者脚本。另外一些不常使用的名字还有蚂蚁、自动索引、模拟程序或者蠕虫。
7+
8+
### java网络爬虫练习项目
9+
一个比较简单的java练习项目。
10+
#### 需要知识
11+
* 什么是网络爬虫
12+
* Java基础
13+
* 正则表达式
14+
15+
#### 项目思路
16+
[慕课网](http://www.imooc.com/wenda)的猿问页面,寻找一个你喜欢的问题页面,利用http请求获取问题页面的源码,然后通过正则表达式,不断爬取相关问题页面。
17+
18+
#### 项目介绍
19+
该项目存在三个类
20+
`Imooc.java :` `imooc问题bean类`
21+
`Spider.java:` `封装用来爬取页面的方法类`
22+
`Main.java:` `运行起始页面`
23+
24+
* Imooc.java类是要爬取内容的对象类,属性成员有问题,问题链接,问题描述,答案列表和下一个问题链接。
25+
* Spider.java类封装了2个可能用到的方法,getSource方法获取网页源代码和getImoocPage获取页面Url列表。
26+
27+
#### 项目源码
28+
IMOOCSpider项目已上传到我的github上----[传送门](https://github.com/HelloWorld521/Java.git)
29+
欢迎Star
30+
31+
下载下来后导入eclipse即可运行。
32+
33+
#### 项目运行结果
34+
35+
![结果.png](./woss/src/images/result.png)

Diff for: README.md

+6-6
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,19 @@
11
# Java
22

3-
## 项目介绍
3+
## 项目介绍
44

55
本项目是个人的一些java项目练习源码,分享给大家,希望可以和大家一起进步!
66

77
## Java项目实例
88

9-
* JspChat jsp聊天室
9+
* [JspChat jsp聊天室](./JspChat/)
1010

11-
* tomcatServlet3.0 Web服务器
11+
* [tomcatServlet3.0 Web服务器](./tomcatServlet3/)
1212

13-
* Woss2.0 电信采集系统
13+
* [Woss2.0 电信采集系统](./woss/)
1414

15-
* eStore   图书管理系统
15+
* [eStore   图书管理系统](./estore/)
1616

17-
* [checkcode Java制作验证码](checkcode/README.md)
17+
* [checkcode Java制作验证码](./checkcode/)
1818

1919
* [IMOOCSpider 简易网络爬虫](./IMOOCSpider/)

Diff for: tomcatServer3.0/readme.md

+20
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,20 @@
1+
## 介绍
2+
本项目模仿tomcat,封装request和response,帮助理解tomcat源码。
3+
4+
5+
6+
7+
## 需要知识
8+
9+
* 了解http协议
10+
* java多线程和网络编程
11+
12+
13+
## 源码
14+
见本项目
15+
16+
17+
## 收获
18+
使用Java模拟浏览器发送请求给服务器,服务器接受请求并相应,锻炼java知识,帮助理解http协议和tomcat工作原理。
19+
20+
Please feel free to contact us with any questions you may have.

Diff for: woss2.0/.classpath renamed to woss/.classpath

+4-1
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,9 @@
11
<?xml version="1.0" encoding="UTF-8"?>
22
<classpath>
33
<classpathentry kind="src" path="src"/>
4-
<classpathentry kind="con" path="org.eclipse.jdt.launching.JRE_CONTAINER/org.eclipse.jdt.internal.debug.ui.launcher.StandardVMType/jdk1.8"/>
4+
<classpathentry kind="lib" path="jar/dom4j-1.6.1.jar"/>
5+
<classpathentry kind="lib" path="jar/log4j-1.2.11.jar"/>
6+
<classpathentry kind="lib" path="jar/ojdbc5.jar"/>
7+
<classpathentry kind="con" path="org.eclipse.jdt.launching.JRE_CONTAINER"/>
58
<classpathentry kind="output" path="bin"/>
69
</classpath>

Diff for: woss2.0/.project renamed to woss/.project

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
<?xml version="1.0" encoding="UTF-8"?>
22
<projectDescription>
3-
<name>woss2.0</name>
3+
<name>woss</name>
44
<comment></comment>
55
<projects>
66
</projects>

Diff for: woss/.settings/org.eclipse.core.resources.prefs

+2
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
eclipse.preferences.version=1
2+
encoding/<project>=UTF-8

Diff for: woss/.settings/org.eclipse.core.runtime.prefs

+2
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
eclipse.preferences.version=1
2+
line.separator=\r\n
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
11
eclipse.preferences.version=1
22
org.eclipse.jdt.core.compiler.codegen.inlineJsrBytecode=enabled
3-
org.eclipse.jdt.core.compiler.codegen.targetPlatform=1.8
3+
org.eclipse.jdt.core.compiler.codegen.targetPlatform=1.7
44
org.eclipse.jdt.core.compiler.codegen.unusedLocal=preserve
5-
org.eclipse.jdt.core.compiler.compliance=1.8
5+
org.eclipse.jdt.core.compiler.compliance=1.7
66
org.eclipse.jdt.core.compiler.debug.lineNumber=generate
77
org.eclipse.jdt.core.compiler.debug.localVariable=generate
88
org.eclipse.jdt.core.compiler.debug.sourceFile=generate
99
org.eclipse.jdt.core.compiler.problem.assertIdentifier=error
1010
org.eclipse.jdt.core.compiler.problem.enumIdentifier=error
11-
org.eclipse.jdt.core.compiler.source=1.8
11+
org.eclipse.jdt.core.compiler.source=1.7

Diff for: woss/bin/com.zip

813 KB
Binary file not shown.

Diff for: woss/bin/com/briup/client/Client.class

308 Bytes
Binary file not shown.

Diff for: woss/bin/com/briup/client/Gather.class

308 Bytes
Binary file not shown.

Diff for: woss/bin/com/briup/client/imp/ClientImp.class

2.86 KB
Binary file not shown.

Diff for: woss/bin/com/briup/client/imp/GatherImp.class

5.31 KB
Binary file not shown.

Diff for: woss/bin/com/briup/common/Backup.class

408 Bytes
Binary file not shown.

Diff for: woss/bin/com/briup/common/Configuration.class

423 Bytes
Binary file not shown.

Diff for: woss/bin/com/briup/common/ConfigurationAWare.class

199 Bytes
Binary file not shown.

Diff for: woss/bin/com/briup/common/DBUtils.class

1.49 KB
Binary file not shown.

Diff for: woss/bin/com/briup/common/Log.class

264 Bytes
Binary file not shown.

Diff for: woss/bin/com/briup/common/WossModule.class

161 Bytes
Binary file not shown.

Diff for: woss/bin/com/briup/common/imp/BackupImp.class

2.16 KB
Binary file not shown.

Diff for: woss/bin/com/briup/common/imp/ConfigurationImp.class

3.88 KB
Binary file not shown.

Diff for: woss/bin/com/briup/common/imp/LogImp.class

1.84 KB
Binary file not shown.

0 commit comments

Comments
 (0)