当前所在位置:珠峰网资料 >> 计算机 >> IT教育 >> 正文
Java抓取网页的内容
发布时间:2010/10/19 11:10:38 来源:www.xue.net 编辑:城市总裁吧

    public static String getHtmlReadLine(String httpurl){

    String CurrentLine=”";

    String TotalString=”";

    InputStream urlStream;

    String content=”";

    try {

    URL url = new URL(httpurl);

    // URL url = new URL(“http://www.sugarinfo.net/dissertation/gctinfo/“);

    HttpURLConnection connection = (HttpURLConnection)url.openConnection();

    connection.connect();

    urlStream = connection.getInputStream();

    BufferedReader reader = new BufferedReader(

    new InputStreamReader(urlStream,”utf-8″));

    while ((CurrentLine = reader.readLine()) != null) {

    TotalString += CurrentLine+”\n”;

    /**换行的地方主要是在这里**/

    }

    content = TotalString;

    // System.out.println(content);

    } catch (Exception e) {

    e.printStackTrace();

    }

    return content;

    }

广告合作:400-664-0084 全国热线:400-664-0084
Copyright 2010 - 2017 www.my8848.com 珠峰网 粤ICP备15080520号-20
珠峰网 版权所有 All Rights Reserved