It does not use source code analysis, some at HTTP WWW are small things, such as the inside of the problem is that it seems to some url is even without a directory of the previous paragraph ( is not a static page, seems to be the asp dynamic web page like this ), such as the following, then don't use regular extraction ah, have no way? :
<script>
Map_category [" keyno=38 "]="is";
</script>
<script>
Map_category [" keyno=37 "]="face";
</script>
<script>
Map_category [" keyno=43 "]="attachment";
</script>
<script>
Map_category [" keyno=44 "]="tool";
</script>
<script>
Map_category [" keyno=45 "]="packaging";