Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
24 changes: 0 additions & 24 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -43,30 +43,6 @@ python scripts/import_corpus.py /absolute/path/to/file.xlsx
- 엑셀 시트명은 `jd_embed_corpus`, `question_embed_corpus`를 사용합니다.
- `source_analysis_id`, `source_question_id` 기준으로 `INSERT ... ON CONFLICT DO UPDATE` 방식으로 적재합니다.

## Corpus Embedding Sync Script

관리자 API 대신 Python 스크립트로 corpus 임베딩을 일괄 동기화할 수 있습니다.

실행:

```bash
source .venv/bin/activate
pip install -r scripts/requirements-corpus-import.txt
python scripts/sync_corpus_embeddings.py --env-file .env
```

옵션 예시:

```bash
python scripts/sync_corpus_embeddings.py --env-file .env --limit 100
python scripts/sync_corpus_embeddings.py --env-file .env --job-only
python scripts/sync_corpus_embeddings.py --env-file .env --question-only --batch-size 16
```

- `.env`의 `DB_URL`, `DB_USERNAME`, `DB_PASSWORD`, `COHERE_API_KEY`를 사용합니다.
- 기본 모델은 `embed-v4.0`, 기본 배치 크기는 `32`입니다.
- `mock_job_posting_embeddings`, `mock_question_embeddings` 테이블에 `INSERT ... ON CONFLICT DO UPDATE` 방식으로 적재합니다.

## CI/CD

- `CI`: `main`, `develop` 브랜치 push 및 PR에서 테스트와 Docker 이미지 빌드를 실행합니다.
Expand Down
1 change: 0 additions & 1 deletion scripts/requirements-corpus-import.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,2 @@
openpyxl>=3.1.0
psycopg[binary]>=3.1.0
requests>=2.31.0
276 changes: 0 additions & 276 deletions scripts/sync_corpus_embeddings.py

This file was deleted.

Original file line number Diff line number Diff line change
Expand Up @@ -11,14 +11,6 @@

public interface DetailClassificationRepository extends JpaRepository<DetailClassification, Long> {
List<DetailClassification> findAllByMiddleClassificationId(Long middleClassificationId);
@Query("""
SELECT dc
FROM DetailClassification dc
JOIN FETCH dc.middleClassification mc
JOIN FETCH mc.classification
WHERE dc.id = :id
""")
Optional<DetailClassification> findWithHierarchyById(@Param("id") Long id);
Optional<DetailClassification> findByDetailNameIgnoreCase(String detailName);
long countByDetailNameIgnoreCase(String detailName);

Expand Down

This file was deleted.

Loading
Loading